Show simple item record

dc.creatoramelec, viloria
dc.creatorPineda Lezama, Omar Bonerge
dc.creatorChang, Eduardo
dc.description.abstractOne problem in classifying tasks is the handling of features that characterize classes. When the list of features is long, a noise resistant algorithm of irrelevant features can be used, or these features can be reduced. Authorship attribution is a task that assigns an anonymous text to a subject on a list of possible authors, has been widely addressed as an automatic text classification task. In it, n-grams can produce long lists of features even in small corpora. Despite this, there is a lack of research exposing the effects of using noise-resistant algorithms, reducing traits, or combining both options. This paper responds to this lack by using contributions to discussion forums related to organized crime. The results show that the classifiers evaluated, in general, benefit from feature reduction, and that, thanks to such reduction, even classical algorithms outperform state-of-the-art classifiers considered highly noise
dc.publisherCorporación Universidad de la Costaspa
dc.rightsCC0 1.0 Universal*
dc.sourceProcedia Computer Sciencespa
dc.subjectAuthorship attributionspa
dc.subjectClassification featuresspa
dc.subjectNoise resistant algorithmsspa
dc.subjectFeature reductionspa
dc.titleClassification of authors for an automatic recommendation process for criminal responsibilityspa
dcterms.references[1] Vorobeva, A. A. (2016, April). Examining the performance of classification algorithms for imbalanced data sets in web author identification. In 2016 18th Conference of Open Innovations Association and Seminar on Information Security and Protection of Information Technology (FRUCT-ISPIT) (pp. 385-390).
dcterms.references[2] Rocha, A., Scheirer, W. J., Forstall, C. W., Cavalcante, T., Theophilo, A., Shen, B., ... & Stamatatos, E. (2016). Authorship attribution for social media forensics. IEEE Transactions on Information Forensics and Security, 12(1),
dcterms.references[3] Rico-Sulayes, A. (2017). Reducing Vector Space Dimensionality in Automatic Classification for Authorship Attribution. Revista Científica de Ingeniería Electrónica, Automática y Comunicaciones, 38(3),
dcterms.references[4] Win, K. N., Li, K., Chen, J., Viger, P. F., & Li, K. (2019). Fingerprint classification and identification algorithms for criminal investigation: A survey. Future Generation Computer
dcterms.references[5] Tarmizi, N., Saee, S., & Ibrahim, D. H. A. (2020). Author identification for under-resourced language Kadazandusun. Indonesian Journal of Electrical Engineering and Computer Science, 17(1),
dcterms.references[6] Sun, S. (2019). Application of Fuzzy Image Restoration in Criminal Investigation. Journal of Visual Communication and Image Representation,
dcterms.references[7] Boenninghoff, B., Nickel, R. M., Zeiler, S., & Kolossa, D. (2019, May). Similarity learning for authorship verification in social media. In ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 2457-2461).
dcterms.references[8] Watson, D. (2019). Source Code Stylometry and Authorship Attribution for Open Source (Master's thesis, University of Waterloo).spa
dcterms.references[9] Juola, P., Milička, J., & Zemánek, P. (2018). Authorship and time attribution of Arabic texts using JGAAP. In Intelligent Natural Language Processing: Trends and Applications (pp. 325-349). Springer,
dcterms.references[10] Hannah-Moffat, K. (2019). Algorithmic risk governance: Big data analytics, race and information activism in criminal justice debates. Theoretical Criminology, 23(4),
dcterms.references[11] Mutanen, T. P., Metsomaa, J., Liljander, S., & Ilmoniemi, R. J. (2018). Automatic and robust noise suppression in EEG and MEG: The SOUND algorithm. Neuroimage, 166,
dcterms.references[12] Usha, A., & Thampi, S. M. (2017, December). Authorship Analysis of Social Media Contents Using Tone and Personality Features. In International Conference on Security, Privacy and Anonymity in Computation, Communication and Storage (pp. 212-228). Springer,
dcterms.references[13] Hasanov, A., & Mukanova, B. (2017). Fourier Collocation Algorithm for identification of a spacewise dependent source in wave equation from Neumann-type measured data. Applied Numerical Mathematics, 111,
dcterms.references[14] Reddy, T. R., Vardhan, B. V., & Reddy, P. V. (2016). A survey on authorship profiling techniques. International Journal of Applied Engineering Research, 11(5),
dcterms.references[15] Sun, F., Gu, Y., Cao, Y., Lu, Q., Bai, Y., Li, L., ... & Li, T. (2019). Novel flexible pressure sensor combining with dynamic-time-warping algorithm for handwriting identification. Sensors and Actuators A: Physical, 293,

Files in this item


This item appears in the following Collection(s)

Show simple item record

CC0 1.0 Universal
Except where otherwise noted, this item's license is described as CC0 1.0 Universal