Title
A hybrid noise suppression filter for accuracy enhancement of commercial speech recognizers in varying noisy conditions
Abstract
Commercial speech recognizers have made possible many speech control applications such as wheelchair, tone-phone, multifunctional robotic arms and remote controls, for the disabled and paraplegic. However, they have a limitation in common in that recognition errors are likely to be produced when background noise surrounds the spoken command, thereby creating potential dangers for the disabled if recognition errors exist in the control systems. In this paper, a hybrid noise suppression filter is proposed to interface with the commercial speech recognizers in order to enhance the recognition accuracy under variant noisy conditions. It intends to decrease the recognition errors when the commercial speech recognizers are working under a noisy environment. It is based on a sigmoid function which can effectively enhance noisy speech using simple computational operations, while a robust estimator based on an adaptive-network-based fuzzy inference system is used to determine the appropriate operational parameters for the sigmoid function in order to produce effective speech enhancement under variant noisy conditions. The proposed hybrid noise suppression filter has the following advantages for commercial speech recognizers: (i) it is not possible to tune the inbuilt parameters on the commercial speech recognizers in order to obtain better accuracy; (ii) existing noise suppression filters are too complicated to be implemented for real-time speech recognition; and (iii) existing sigmoid function based filters can operate only in a single-noisy condition, but not under varying noisy conditions. The performance of the hybrid noise suppression filter was evaluated by interfacing it with a commercial speech recognizer, commonly used in electronic products. Experimental results show that improvement in terms of recognition accuracy and computational time can be achieved by the hybrid noise suppression filter when the commercial recognizer is working under various noisy environments in factories.
Year
DOI
Venue
2014
10.1016/j.asoc.2013.05.017
Appl. Soft Comput.
Keywords
Field
DocType
sigmoid function,varying noisy condition,real-time speech recognition,recognition accuracy,hybrid noise suppression filter,noisy speech,commercial speech recognizers,commercial speech recognizer,recognition error,accuracy enhancement,variant noisy condition,effective speech enhancement,anfis,speech recognition
Speech enhancement,Speech processing,Background noise,Voice activity detection,Speech recognition,Commercial speech,Artificial intelligence,Adaptive neuro fuzzy inference system,Control system,Mathematics,Machine learning,Sigmoid function
Journal
Volume
ISSN
Citations 
14,
1568-4946
0
PageRank 
References 
Authors
0.34
37
5
Name
Order
Citations
PageRank
Kit Yan Chan147045.36
Pei Chee Yong2365.79
Sven Nordholm340562.82
Ka Fai Cedric Yiu417623.70
H. K. Lam53618193.15