Please use this identifier to cite or link to this item:
http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16392| Title: | Language Identification and Context-based Analysis of Code-switching Behaviors in Social Media Discussions |
| Authors: | Sharma, Yashvardhan |
| Keywords: | Computer Science Code-switching Data mining Language identification CRF |
| Issue Date: | 2019 |
| Publisher: | IEEE |
| Abstract: | Social media discussions see the participation of multilingual individuals: who tend to utilize alternate languages in a single post (code-switching) for effective communication in a discussion. This paper attempts to characterize such discussions to analyze contextual factors related to multilingual communities. Features extracted from the posts are used to train a CRF-based sequence labeling algorithm for language identification in an intra-sentential code-switching scenario. The context of a sentence in a discussion is modeled in defining relevance through Term Frequency Inverse Document Frequency (TF-IDF). Further context of a multilingual sentence with respect to the discussion such as agreement and questioning between pairs of posts is also modeled. |
| URI: | https://ieeexplore.ieee.org/abstract/document/9006032 http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16392 |
| Appears in Collections: | Department of Computer Science and Information Systems |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.