Abstract | ||
---|---|---|
In this work, we explore prediction of different physical parameters from speech data. We aim to predict shoulder size and waist size of people from speech data in addition to the conventional height and weight parameters. A data-set with this information is created from 207 volunteers. A bag of words representation based on log magnitude spectrum is used as features. A support vector regression predicts the physical parameters from the bag of the words representation. The system is able to achieve a root mean square error of 6.6 cm for height estimation, 2.6cm for shoulder size, 7.1cm for waist size and 8.9 kg for weight estimation. The results of height estimation is on par with state of the art results. |
Year | DOI | Venue |
---|---|---|
2016 | 10.1109/MLSP.2016.7738873 | 2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP) |
Keywords | Field | DocType |
Physical parameters,Speech forensics,Height,Weight,Shoulder size,Waist size | Bag-of-words model,Phase spectrum,Mel-frequency cepstrum,Pattern recognition,Computer science,Waist,Support vector machine,Mean squared error,Feature extraction,Speech recognition,Artificial intelligence,Machine learning | Conference |
ISSN | ISBN | Citations |
2161-0363 | 978-1-5090-0747-9 | 0 |
PageRank | References | Authors |
0.34 | 4 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Shareef Babu Kalluri | 1 | 0 | 0.68 |
Ashwin K. Vijayakumar | 2 | 32 | 2.24 |
Deepu Vijayasenan | 3 | 73 | 10.91 |
Rita Singh | 4 | 329 | 48.97 |