Title
Audio Content Based Geotagging In Multimedia
Abstract
In this paper we propose methods to extract geographically relevant information in a multimedia recording using its audio content. Our method primarily is based on the fact that urban acoustic environment consists of a variety of sounds. Hence, location information can be inferred from the composition of sound events/classes present in the audio. More specifically, we adopt matrix factorization techniques to obtain semantic content of recording in terms of different sound classes. We use semi-NMF to for to do audio semantic content analysis using MFCCs. These semantic information are then combined to identify the location of recording. We show that these semantic content based geotagging can perform significantly better than state of art methods.
Year
DOI
Venue
2017
10.21437/Interspeech.2017-40
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION
Keywords
DocType
Volume
Location Identification, Geotagging, Matrix Factorization, Audio Analysis
Conference
abs/1606.02816
ISSN
Citations 
PageRank 
2308-457X
1
0.41
References 
Authors
12
3
Name
Order
Citations
PageRank
Anurag Kumar151.61
Benjamin Elizalde235922.38
Raj, Bhiksha32094204.63