Please use this identifier to cite or link to this item:
http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16342
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Sharma, Yashvardhan | - |
dc.date.accessioned | 2024-11-12T08:29:25Z | - |
dc.date.available | 2024-11-12T08:29:25Z | - |
dc.date.issued | 2023 | - |
dc.identifier.uri | https://aclanthology.org/2023.wat-1.3/ | - |
dc.identifier.uri | http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16342 | - |
dc.description.abstract | This paper describes the proposed system for mutlimodal machine translation. We have participated in multimodal translation tasks for English into three Indic languages: Hindi, Bengali, and Malayalam. We leverage the inherent richness of multimodal data to bridge the gap of ambiguity in translation. We fine-tuned the ‘No Language Left Behind’ (NLLB) machine translation model for multimodal translation, further enhancing the model accuracy by image data augmentation using latent diffusion. Our submission achieves the best BLEU score for English-Hindi, English-Bengali, and English-Malayalam language pairs for both Evaluation and Challenge test sets. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Association for Computational Linguistics | en_US |
dc.subject | Computer Science | en_US |
dc.subject | Machine Translation (MT) | en_US |
dc.subject | Multimodal Machine Translation (MMT) | en_US |
dc.subject | Image Augmentation | en_US |
dc.subject | Multimodal Machine Translation (MMT) | en_US |
dc.title | BITS-P at WAT 2023: Improving Indic Language Multimodal Translation by Image Augmentation using Diffusion Models | en_US |
dc.type | Article | en_US |
Appears in Collections: | Department of Computer Science and Information Systems |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.