Tuning methodology for speech enhancement algorithms using a simulated conversational database and perceptual objective measures - Citegraph

Paper Info

Title
Tuning methodology for speech enhancement algorithms using a simulated conversational database and perceptual objective measures

Abstract
In this paper, we propose a formal methodology for tuning the parameters of a single-microphone speech enhancement system for hands-free devices. The tuning problem is formulated as a large-scale nonlinear programming problem that is solved by a genetic algorithm to determine the global solution. A conversational speech database is automatically generated by modeling the interactivity in telephone conversations, and perceptual objective quality measures are used as the optimization criteria for the automated tuning over the generated database. A subjective listening test is then performed by comparing the automatically tuned system based on objective criteria to the system tuned by expert human listeners. Subjective and objective evaluation result shows that the proposed automated tuning methodology greatly improves the enhanced speech quality, potentially saving resources over manual evaluation, speeding up development and deployment time, and guiding the algorithmic design.

Year	DOI	Venue
2014	10.1109/HSCMA.2014.6843252	Hands-free Speech Communication and Microphone Arrays
Keywords	Field	DocType
audio databases,echo suppression,genetic algorithms,microphones,nonlinear programming,speech enhancement,enhanced speech quality,formal methodology,genetic algorithm,hands-free devices,large-scale nonlinear programming problem,perceptual objective measures,simulated conversational database,single-microphone speech enhancement system,speech database,speech enhancement algorithms,subjective listening test,telephone conversations,tuning methodology,tuning problem,acoustic echo cancellation,conversation analysis,perceptual objective quality,acoustics,databases,speech,tuning,noise	Speech enhancement,Interactivity,Speech processing,Software deployment,Algorithm design,Voice activity detection,Computer science,Nonlinear programming,Speech recognition,Genetic algorithm,Database	Conference
Citations	PageRank	References
2	0.38	18
Authors
4

Authors (4 rows)

Cited by (2 rows)

References (18 rows)

Name	Order	Citations	PageRank
Daniele Giacobello	1	117	12.33
Jason Wung	2	16	4.45
Ramin Pichevar	3	56	9.92
Joshua Atkins	4	7	4.20

1