Explorative Application of Fusion Techniques for Multimodal Hate Speech Detection

Sharma, Yashvardhan

Please use this identifier to cite or link to this item: http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16361

Full metadata record

DC Field	Value	Language
dc.contributor.author	Sharma, Yashvardhan	-
dc.date.accessioned	2024-11-13T10:21:47Z	-
dc.date.available	2024-11-13T10:21:47Z	-
dc.date.issued	2022	-
dc.identifier.uri	https://link.springer.com/article/10.1007/s42979-021-01007-7	-
dc.identifier.uri	http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16361	-
dc.description.abstract	Hate speech detection is an important research area owing to the severe effects of hate speech on the society. Hence automated hate speech detection based on textual data assumed a pivotal role among the research groups. Moreover, the exponential growth of multimodal content on social media like hateful memes poses the need for building efficient machine learning which can handle such content. In this work, we explore different fusion techniques and compare their performance for the multimodal hate speech identification task. In particular, we test new combinations of fusing textual and visual models to improve the performance of the models on the MMHS150K dataset. We apply the corresponding preprocessing techniques for the text and images of tweets. Then, we use a pre-trained BERT model for textual feature extraction and Inceptionv3, Inception ResNet, ResNext to extract features from the images. We apply different early fusion techniques like concatenation and product rule and late fusion techniques namely distribution summation, performance weighting, logarithmic opinion polling, rules learned from training on probabilities to efficiently fuse vision and text modalities. We also employ the SMOTE oversampling technique and random undersampling to deal with the class imbalance in the MMHS150K dataset. Our proposed model has achieved an accuracy of 67.7% which is comparable to the state-of-the-art.	en_US
dc.language.iso	en	en_US
dc.publisher	Springer	en_US
dc.subject	Computer Science	en_US
dc.subject	SMOTE	en_US
dc.subject	Speech Detection	en_US
dc.subject	Fusion Techniques	en_US
dc.title	Explorative Application of Fusion Techniques for Multimodal Hate Speech Detection	en_US
dc.type	Article	en_US
Appears in Collections:	Department of Computer Science and Information Systems

Files in This Item:

There are no files associated with this item.

Show simple item record