DSpace Repository

BITS-P at WAT 2023: Improving Indic Language Multimodal Translation by Image Augmentation using Diffusion Models

Show simple item record

dc.contributor.author Sharma, Yashvardhan
dc.date.accessioned 2024-11-12T08:29:25Z
dc.date.available 2024-11-12T08:29:25Z
dc.date.issued 2023
dc.identifier.uri https://aclanthology.org/2023.wat-1.3/
dc.identifier.uri http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16342
dc.description.abstract This paper describes the proposed system for mutlimodal machine translation. We have participated in multimodal translation tasks for English into three Indic languages: Hindi, Bengali, and Malayalam. We leverage the inherent richness of multimodal data to bridge the gap of ambiguity in translation. We fine-tuned the ‘No Language Left Behind’ (NLLB) machine translation model for multimodal translation, further enhancing the model accuracy by image data augmentation using latent diffusion. Our submission achieves the best BLEU score for English-Hindi, English-Bengali, and English-Malayalam language pairs for both Evaluation and Challenge test sets. en_US
dc.language.iso en en_US
dc.publisher Association for Computational Linguistics en_US
dc.subject Computer Science en_US
dc.subject Machine Translation (MT) en_US
dc.subject Multimodal Machine Translation (MMT) en_US
dc.subject Image Augmentation en_US
dc.subject Multimodal Machine Translation (MMT) en_US
dc.title BITS-P at WAT 2023: Improving Indic Language Multimodal Translation by Image Augmentation using Diffusion Models en_US
dc.type Article en_US


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account