Title | ||
---|---|---|
A Robust Audio Similarity Estimation Method for Audio Alignment in Mobile Karaoke Apps |
Abstract | ||
---|---|---|
With smartphones further integrating into our lives, more people start to sing using mobile karaoke apps instead of going to a KTV club. However, the playback and record APIs of Android systems do not respond in real-time when called. Thus, an Android karaoke app will have to align the record music and the original accompaniment when super-posing those two audios. Dynamic time warping (DTW) based algorithms are usually used to find the optimal alignment between two audios and yield best result so far. In this paper, we propose a simple yet robust approach by considering waveform similarities to solve this problem. Experimental results show that our method outperforms the state-of-the-art method in both accuracy and robustness across different genres and devices. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1145/2578726.2578802 | ICMR |
Keywords | Field | DocType |
mobile karaoke,robust audio similarity estimation,mobile karaoke apps,dynamic time warping,different genre,audio alignment,best result,record apis,record music,android system,ktv club,state-of-the-art method,user experience | User experience design,Android (operating system),Dynamic time warping,Computer science,Waveform,Robustness (computer science),Speech recognition | Conference |
Citations | PageRank | References |
0 | 0.34 | 8 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jinghui Mo | 1 | 107 | 3.63 |
Yansong Feng | 2 | 735 | 64.17 |
Dongyan Zhao | 3 | 998 | 96.35 |