dc.contributor.author | Sharma, Yashvardhan | |
dc.date.accessioned | 2024-11-15T09:11:47Z | |
dc.date.available | 2024-11-15T09:11:47Z | |
dc.date.issued | 2019 | |
dc.identifier.uri | https://ieeexplore.ieee.org/abstract/document/9006032 | |
dc.identifier.uri | http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16392 | |
dc.description.abstract | Social media discussions see the participation of multilingual individuals: who tend to utilize alternate languages in a single post (code-switching) for effective communication in a discussion. This paper attempts to characterize such discussions to analyze contextual factors related to multilingual communities. Features extracted from the posts are used to train a CRF-based sequence labeling algorithm for language identification in an intra-sentential code-switching scenario. The context of a sentence in a discussion is modeled in defining relevance through Term Frequency Inverse Document Frequency (TF-IDF). Further context of a multilingual sentence with respect to the discussion such as agreement and questioning between pairs of posts is also modeled. | en_US |
dc.language.iso | en | en_US |
dc.publisher | IEEE | en_US |
dc.subject | Computer Science | en_US |
dc.subject | Code-switching | en_US |
dc.subject | Data mining | en_US |
dc.subject | Language identification | en_US |
dc.subject | CRF | en_US |
dc.title | Language Identification and Context-based Analysis of Code-switching Behaviors in Social Media Discussions | en_US |
dc.type | Article | en_US |
Files | Size | Format | View |
---|---|---|---|
There are no files associated with this item. |