Abstract | ||
---|---|---|
We propose new data selection approaches based on speaker discriminability features, including kurtosis and, a set of nasality features which exploit spectral properties of nasal speech sounds. Data selected based on the speaker discriminability features are used to implement end-to-end speaker recognition systems, which produce significant improvements when combined with the baseline system (which uses the speech-only data regions determined by a speech/non-speech detector), where the optimal combination of systems produces roughly a 24% improvement over the baseline. Results suggest that focusing the modeling power on data regions selected via the kurtosis and nasality speaker discriminability features, part of which are often discarded in the speech/non-speech detection process, can improvement speaker recognition. |
Year | Venue | Keywords |
---|---|---|
2011 | 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | speaker recognition, kurtosis, nasality features, data selection |
Field | DocType | Citations |
Spectral properties,Nasality,Data selection,Pattern recognition,Computer science,Speech recognition,Speaker recognition,Artificial intelligence,Baseline system,Nasal speech,Detector,Kurtosis | Conference | 1 |
PageRank | References | Authors |
0.35 | 5 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Howard Lei | 1 | 112 | 6.90 |
Nikki Mirghafori | 2 | 209 | 27.18 |