Automatic Generation of Spatial Tactile Effects by Analyzing Cross-modality Features of a Video - Citegraph

Paper Info

Title
Automatic Generation of Spatial Tactile Effects by Analyzing Cross-modality Features of a Video

Abstract
ABSTRACT Tactile effects can enhance user experience of multimedia content. However, generating appropriate tactile stimuli without any human intervention remains a challenge. While visual or audio information has been used to automatically generate tactile effects, utilizing cross-modal information may further improve the spatiotemporal synchronization and user experience of the tactile effects. In this paper, we present a pipeline for automatic generation of vibrotactile effects through the extraction of both the visual and audio features from a video. Two neural network models are used to extract the diegetic audio content, and localize a sounding object in the scene. These models are then used to determine the spatial distribution and the intensity of the tactile effects. To evaluate the performance of our method, we conducted a user study to compare the videos with tactile effects generated by our method to both the original videos without any tactile stimuli and videos with tactile effects generated based on visual features only. The study results demonstrate that our cross-modal method creates tactile effects with better spatiotemporal synchronization than the existing visual-based method and provides a more immersive user experience.

Year	DOI	Venue
2020	10.1145/3385959.3418459	SUI
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	4

Authors (4 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Kai Zhang	1	114	56.69
Lawrence H. Kim	2	68	6.47
Yipeng Guo	3	0	0.34
Sean Follmer	4	853	56.83

1