BITS Pilani at HinglishEval: Quality Evaluation for Code-Mixed Hinglish Text Using Transformers

dc.contributor.authorSharma, Yashvardhan
dc.date.accessioned2024-11-13T08:59:17Z
dc.date.available2024-11-13T08:59:17Z
dc.date.issued2022
dc.description.abstractCode-Mixed text data consists of sentences having words or phrases from more than one language. Most multi-lingual communities worldwide communicate using multiple languages, with English usually one of them. Hinglish is a Code-Mixed text composed of Hindi and English but written in Roman script. This paper aims to determine the factors influencing the quality of Code-Mixed text data generated by the system. For the HinglishEval task, the proposed model uses multilingual BERT to find the similarity between synthetically generated and human-generated sentences to predict the quality of synthetically generated Hinglish sentences.en_US
dc.identifier.urihttps://aclanthology.org/2022.inlg-genchal.6/
dc.identifier.urihttp://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16358
dc.language.isoenen_US
dc.publisherAssociation for Computational Linguisticsen_US
dc.subjectComputer Scienceen_US
dc.subjectCode-Mixed texten_US
dc.subjectHinglishEval tasken_US
dc.subjectBERTen_US
dc.titleBITS Pilani at HinglishEval: Quality Evaluation for Code-Mixed Hinglish Text Using Transformersen_US
dc.typeArticleen_US

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: