DSpace logo

Please use this identifier to cite or link to this item: http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16381
Full metadata record
DC FieldValueLanguage
dc.contributor.authorSharma, Yashvardhan-
dc.date.accessioned2024-11-14T10:41:10Z-
dc.date.available2024-11-14T10:41:10Z-
dc.date.issued2021-
dc.identifier.urihttps://ceur-ws.org/Vol-3159/T3-14.pdf-
dc.identifier.urihttp://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16381-
dc.description.abstractThis paper presents the method adopted for completing Task 1 of Dravidian-CodeMix-HASOC (Hate Speech and Offensive Content Identification in English and Indo-European Languages) Shared Task proposed by the Forum of Information Retrieval Evaluation in 2021, for offensive language detection. For detecting offensive language, a custom model architecture using convolutional neural networks was created using Keras for supervised learning, and trained on a dataset of YouTube comments, written in code-mixed Tamil in both Roman and Tamil scripts. The 5 layer neural network was built only using Keras, and required simple tokenized data, padded to an appropriate length. Recurrent neural networks and transfer learning were not used, and an F-score of 0.835 was achieved with the created CNN model.en_US
dc.language.isoenen_US
dc.publisherCEUR-WSen_US
dc.subjectComputer Scienceen_US
dc.subjectOffensive language detectionen_US
dc.subjectCode-Mixed texten_US
dc.subjectTamilen_US
dc.subjectHASOCen_US
dc.titleOffensive Language Classification of Code-Mixed Tamil with Kerasen_US
dc.typeArticleen_US
Appears in Collections:Department of Computer Science and Information Systems

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.