Abstract | ||
---|---|---|
This paper describes our preliminary study towards a new type of speech enhancement system. To avoid using odd-looking electrolarynx, we used lip-reading function. Our final image is to use a smart phone with camera and audio output to be able to convert the lip motion to speech output. We tested MLP, CNN, and MobileNets image recognition methods. 3k image datasets for training and testing were recorded from five persons. The preliminary experiment indicated that the MobileNets is the most adequate algorithm for smart phone apps. in terms of the recognition accuracy and the calculation cost. |
Year | DOI | Venue |
---|---|---|
2018 | 10.1109/IICAIET.2018.8638466 | 2018 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET) |
Keywords | Field | DocType |
Lips,Speech enhancement,Smart phones,Optical filters,Neural networks,Image recognition | Speech enhancement,Speech output,Computer science,Optical filter,Speech recognition,Mobile device,Smart phone,Electrolarynx,Artificial neural network | Conference |
ISBN | Citations | PageRank |
978-1-5386-7813-8 | 0 | 0.34 |
References | Authors | |
0 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yuta Matsunaga | 1 | 0 | 0.34 |
Kenji Matsui | 2 | 1 | 5.76 |