A Deep Multi-Level Attentive network for Multimodal Sentiment Analysis

Yadav, Ashima; Vishwakarma, Dinesh Kumar

doi:10.1145/3517139

Computer Science > Multimedia

arXiv:2012.08256 (cs)

[Submitted on 15 Dec 2020]

Title:A Deep Multi-Level Attentive network for Multimodal Sentiment Analysis

Authors:Ashima Yadav, Dinesh Kumar Vishwakarma

View PDF

Abstract:Multimodal sentiment analysis has attracted increasing attention with broad application prospects. The existing methods focuses on single modality, which fails to capture the social media content for multiple modalities. Moreover, in multi-modal learning, most of the works have focused on simply combining the two modalities, without exploring the complicated correlations between them. This resulted in dissatisfying performance for multimodal sentiment classification. Motivated by the status quo, we propose a Deep Multi-Level Attentive network, which exploits the correlation between image and text modalities to improve multimodal learning. Specifically, we generate the bi-attentive visual map along the spatial and channel dimensions to magnify CNNs representation power. Then we model the correlation between the image regions and semantics of the word by extracting the textual features related to the bi-attentive visual features by applying semantic attention. Finally, self-attention is employed to automatically fetch the sentiment-rich multimodal features for the classification. We conduct extensive evaluations on four real-world datasets, namely, MVSA-Single, MVSA-Multiple, Flickr, and Getty Images, which verifies the superiority of our method.

Comments:	11 pages, 7 figures
Subjects:	Multimedia (cs.MM)
Cite as:	arXiv:2012.08256 [cs.MM]
	(or arXiv:2012.08256v1 [cs.MM] for this version)
	https://doi.org/10.48550/arXiv.2012.08256
Journal reference:	ACM Transactions on Multimedia Computing, Communications, and Applications, 2022
Related DOI:	https://doi.org/10.1145/3517139

Submission history

From: Dinesh Kumar Vishwakarma Dr [view email]
[v1] Tue, 15 Dec 2020 12:47:17 UTC (1,646 KB)

Computer Science > Multimedia

Title:A Deep Multi-Level Attentive network for Multimodal Sentiment Analysis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Multimedia

Title:A Deep Multi-Level Attentive network for Multimodal Sentiment Analysis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators