Abstract | ||
---|---|---|
Recent advancements in the field of artificial intelligence has resulted in development of natural language processing using speech signals recorded over communication channels. Typically these speech recordings are segmented over a fixed frame length, usually 25ms, and used for training and testing mathematical models. For applications like speaker recognition and verification, this traditional methodology results in inclusion of lot of unvoiced and silence segments. To eliminate these redundancies, in this paper we propose a novel pitch synchronous segmentation algorithm robust to noise content and variations in speech recording parameters like sampling frequency. With two datasets coming from a handheld device and noise proof studio quality recordings, performance of this algorithm has been tested. With a maximum accuracy of 96.5%, this algorithm provides encouraging results. |
Year | DOI | Venue |
---|---|---|
2020 | 10.1109/WTS48268.2020.9198720 | 2020 Wireless Telecommunications Symposium (WTS) |
Keywords | DocType | ISSN |
Speech segmentation,Fourier transform,recognition,tele monitoring | Conference | 1934-5070 |
ISBN | Citations | PageRank |
978-1-7281-4696-6 | 0 | 0.34 |
References | Authors | |
4 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Sai Bharadwaj Appakaya | 1 | 1 | 1.38 |
Seyed Alireza Khoshnevis | 2 | 0 | 0.68 |
Ehsan Sheybani | 3 | 15 | 8.17 |
Ravi Sankar | 4 | 656 | 55.66 |