Please use this identifier to cite or link to this item:
http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16566
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Chalapathi, G.S.S. | - |
dc.date.accessioned | 2024-12-03T09:04:24Z | - |
dc.date.available | 2024-12-03T09:04:24Z | - |
dc.date.issued | 2023 | - |
dc.identifier.uri | https://ieeexplore.ieee.org/document/10337279/authors#authors | - |
dc.identifier.uri | http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16566 | - |
dc.description.abstract | This paper investigates various code-switching prop-erties of conversational speech from bilingual English-Malay Singaporean speakers with data obtained from the National Speech Corpus (NSC) and provides baseline language models for various combinations between English-Malay monolingual and codeswitching transcripts. Specifically, the study analyzed the correlation between code-switching patterns and (i) trigger words and code-switched word pairs at code-switching points, and (ii) wordwise POS and pairwise POS tags. Our analysis shows there is a certain set of words that frequently “triggered” code-switching behavior, and speakers tend to code-switch more frequently around nouns. Additionally, we provide perplexities for language models built on the selected datasets. These perplexities could serve as baselines for future language models for Singaporean speech, especially, English-Malay code-switch speech. | en_US |
dc.language.iso | en | en_US |
dc.publisher | IEEE | en_US |
dc.subject | EEE | en_US |
dc.subject | Code-switching | en_US |
dc.subject | Language Modeling | en_US |
dc.subject | Part-of-Speech | en_US |
dc.title | Singaporean Conversational English-Malay Code-Switching Speech: An Analysis Based on Code-switching Points and Part -of-Speech | en_US |
dc.type | Article | en_US |
Appears in Collections: | Department of Electrical and Electronics Engineering |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.