Bits_Pilani@INLI-FIRE-2017:Indian Native Language Identification using Deep Learning

No Thumbnail Available

Date

2017

Journal Title

Journal ISSN

Volume Title

Publisher

CEUR

Abstract

The task of Native Language Identification involves identifying the prior or first learnt language of a user based on his writing technique and/or analysis of speech and phonetics in second language. There is a surplus of such data present on social media sites and organised dataset from bodies like Educational Testing Service(ETS), which can be exploited to develop language learning systems and forensic linguistics. In this paper we propose a deep neural network for this task using hierarchical paragraph encoder with attention mechanism to identify relevant features over tendencies and errors a user makes with second language for the INLI task in FIRE 2017. The task involves six Indian languages as prior/native set and english as the second language which has been collected from user's social media account.

Description

Keywords

Computer Science, Native Language Identification, Natural Language Processing, Deep Learning, Neural networks

Citation

Endorsement

Review

Supplemented By

Referenced By