Applying TF-IDF and BERT-based Variants under Multilabel Classification for Emotion Detection in Urdu Language

dc.contributor.authorSharma, Yashvardhan
dc.date.accessioned2024-11-14T06:35:35Z
dc.date.available2024-11-14T06:35:35Z
dc.date.issued2022
dc.description.abstractNowadays, the use of emojis is very common to show our emotions with just a single image instead of long sentences describing our emotions. Each emoji describes a particular emotion, such as anger, disgust, fear, sadness, surprise, and happiness. Now if we are given a task to identify emotions in a text, that means we have to tag a text with multiple emojis, each pointing to a different emotion. This paper aims to check for multiple emotions in an Urdu text, which comes under the category of multi-label classification. We have used pre-trained BERT models to add basic knowledge about a language (Urdu in our case). Over the pre-trained model, we added the classification layer using PyTorch. The output layer has seven nodes, six of which are for six emotions, and the seventh is for neutral. FIRE 2022 provided the Urdu tweet dataset used here as part of the subtask ”Multi-label emotion classification in Urdu” of the main task ”Emothreat: Emotion and Threat detection in Urdu.”en_US
dc.identifier.urihttps://ceur-ws.org/Vol-3395/T4-7.pdf
dc.identifier.urihttp://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16368
dc.language.isoenen_US
dc.publisherCEUR-WSen_US
dc.subjectComputer Scienceen_US
dc.subjectSocial mediaen_US
dc.subjectUrduHacken_US
dc.subjectBERTen_US
dc.subjectMulti-label classificationen_US
dc.subjectNegative weighten_US
dc.subjectPositive weighten_US
dc.subjectTransformers modelen_US
dc.subjectText classificationen_US
dc.titleApplying TF-IDF and BERT-based Variants under Multilabel Classification for Emotion Detection in Urdu Languageen_US
dc.typeArticleen_US

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: