Title
Multi-stream Fusion Model for Social Relation Recognition from Videos.
Abstract
Social relations are ubiquitous in people's daily life. Especially, the widespread of video in social media and intelligent surveillance gives us a new chance to discover the social relations among people. Previous researches mostly focus on the recognition of social relations from texts, blogs, or images. However, these methods are only concentrated on limited social relations and incapable of dealing with video data. In this paper, we address the challenges of social relation recognition by employing a multi-stream model to exploit the abundant multimodal information in videos. First of all, we build a video dataset with 16 categories of social relations annotation according to psychology and sociology studies, named Social Relation In Videos (SRIV), which comprises of 3,124 videos. According to our knowledge, it is the first video dataset for the social relation recognition. Secondly, we propose a multi-stream deep learning model as a benchmark for recognizing social relations, which learns high level semantic information of spatial, temporal, and audio of people's social interactions in videos. Finally, we fuse them with logical regression to achieve accurate recognition. Experimental results show that the multi-stream deep model is effective for social relation recognition on the proposed dataset.
Year
DOI
Venue
2018
10.1007/978-3-319-73603-7_29
Lecture Notes in Computer Science
Keywords
Field
DocType
Social relation,Video analysis,Deep learning
Social relation,Annotation,Social media,Information retrieval,Pattern recognition,Computer science,Semantic information,Exploit,Artificial intelligence,Deep learning
Conference
Volume
ISSN
Citations 
10704
0302-9743
4
PageRank 
References 
Authors
0.40
13
5
Name
Order
Citations
PageRank
Jinna Lv1124.20
Wu Liu227534.53
Lili Zhou340.74
Bin Wu429052.43
Huadong Ma52020179.93