Title | ||
---|---|---|
Speaker identification based on normalized pitch frequency and Mel Frequency Cepstral Coefficients. |
Abstract | ||
---|---|---|
This paper presents an efficient approach for automatic speaker identification based on cepstral features and the Normalized Pitch Frequency (NPF). Most relevant speaker identification methods adopt a cepstral strategy. Inclusion of the pitch frequency as a new feature in the speaker identification process is expected to enhance the speaker identification accuracy. In the proposed framework for speaker identification, a neural classifier with a single hidden layer is used. Different transform domains are investigated for reliable feature extraction from the speech signal. Moreover, a pre-processing noise reduction step, is used prior to the feature extraction process to enhance the performance of the speaker identification system. Simulation results prove that the NPF as a feature in speaker identification enhances the performance of the speaker identification system, especially with the Discrete Cosine Transform (DCT) and wavelet denoising pre-processing step. |
Year | DOI | Venue |
---|---|---|
2018 | 10.1007/s10772-018-9524-7 | I. J. Speech Technology |
Keywords | Field | DocType |
Speaker identification, MFCCs, Normalized pitch frequency, ANNs | Noise reduction,Mel-frequency cepstrum,Speaker identification,Normalization (statistics),Pattern recognition,Computer science,Discrete cosine transform,Cepstrum,Speech recognition,Feature extraction,Artificial intelligence,Classifier (linguistics) | Journal |
Volume | Issue | ISSN |
21 | 4 | 1381-2416 |
Citations | PageRank | References |
0 | 0.34 | 7 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Marwa A. Nasr | 1 | 0 | 0.34 |
Mohammed Abd-Elnaby | 2 | 20 | 9.94 |
Adel S. El-Fishawy | 3 | 8 | 4.54 |
S. El-Rabaie | 4 | 40 | 14.40 |
Fathi E. Abd El-Samie | 5 | 439 | 87.48 |