DSpace Repository

Language Identification and Context-based Analysis of Code-switching Behaviors in Social Media Discussions

Show simple item record

dc.contributor.author Sharma, Yashvardhan
dc.date.accessioned 2024-11-15T09:11:47Z
dc.date.available 2024-11-15T09:11:47Z
dc.date.issued 2019
dc.identifier.uri https://ieeexplore.ieee.org/abstract/document/9006032
dc.identifier.uri http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16392
dc.description.abstract Social media discussions see the participation of multilingual individuals: who tend to utilize alternate languages in a single post (code-switching) for effective communication in a discussion. This paper attempts to characterize such discussions to analyze contextual factors related to multilingual communities. Features extracted from the posts are used to train a CRF-based sequence labeling algorithm for language identification in an intra-sentential code-switching scenario. The context of a sentence in a discussion is modeled in defining relevance through Term Frequency Inverse Document Frequency (TF-IDF). Further context of a multilingual sentence with respect to the discussion such as agreement and questioning between pairs of posts is also modeled. en_US
dc.language.iso en en_US
dc.publisher IEEE en_US
dc.subject Computer Science en_US
dc.subject Code-switching en_US
dc.subject Data mining en_US
dc.subject Language identification en_US
dc.subject CRF en_US
dc.title Language Identification and Context-based Analysis of Code-switching Behaviors in Social Media Discussions en_US
dc.type Article en_US


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account