Fusing audio, visual and textual clues for sentiment analysis from multimodal content. - Citegraph

Paper Info

Title
Fusing audio, visual and textual clues for sentiment analysis from multimodal content.

Abstract
A huge number of videos are posted every day on social media platforms such as Facebook and YouTube. This makes the Internet an unlimited source of information. In the coming decades, coping with such information and mining useful knowledge from it will be an increasingly difficult task. In this paper, we propose a novel methodology for multimodal sentiment analysis, which consists in harvesting sentiments from Web videos by demonstrating a model that uses audio, visual and textual modalities as sources of information. We used both feature- and decision-level fusion methods to merge affective information extracted from multiple modalities. A thorough comparison with existing works in this area is carried out throughout the paper, which demonstrates the novelty of our approach. Preliminary comparative experiments with the YouTube dataset show that the proposed multimodal system achieves an accuracy of nearly 80%, outperforming all state-of-the-art systems by more than 20%.

Year	DOI	Venue
2016	10.1016/j.neucom.2015.01.095	Neurocomputing
Keywords	Field	DocType
Multimodal fusion,Big social data analysis,Opinion mining,Multimodal sentiment analysis,Sentic computing	Modalities,Social media,Information retrieval,Multiple modalities,Computer science,Sentiment analysis,Novelty,Merge (version control),Sentic computing,The Internet	Journal
Volume	Issue	ISSN
174	PA	0925-2312
Citations	PageRank	References
77	1.98	40
Authors
5

Authors (5 rows)

Cited by (77 rows)

References (40 rows)

Name	Order	Citations	PageRank
Soujanya Poria	1	1336	60.98
Erik Cambria	2	3873	183.70
Newton Howard	3	253	22.15
Guang-Bin Huang	4	11303	470.52
Amir Hussain	5	705	29.16

1