Abstract | ||
---|---|---|
We propose a Multigranular Automatic Speech Recognizer. The hypothesis is that speech signal contains information distributed on more different time scales. Many works from various scientific fields ranging from neurobiology to speech technologies, seem to concord on this assumption. In a broad sense, it seems that speech recognition in human is optimal because of a partial parallelization process according to which the left-to-right stream of speech is captured in a multilevel grid in which several linguistic analyses take place contemporarily. Our investigation aims, in this view, to apply these new ideas to the project of more robust and efficient recognizers. |
Year | DOI | Venue |
---|---|---|
2005 | 10.1007/11558590_33 | AI*IA |
Keywords | Field | DocType |
speech recognition,left-to-right stream,multigranular scale speech recognizers,speech signal,efficient recognizers,cognitive view,broad sense,different time scale,multilevel grid,multigranular automatic speech recognizer,linguistic analysis,speech technology,parallel processing | Automatic speech,Speech analytics,Computer science,Speech recognition,Ranging,Artificial intelligence,Natural language processing,Cognition,Grid,Linguistic analysis | Conference |
Volume | ISSN | ISBN |
3673 | 0302-9743 | 3-540-29041-9 |
Citations | PageRank | References |
0 | 0.34 | 5 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Francesco Cutugno | 1 | 76 | 18.01 |
Gianpaolo Coro | 2 | 21 | 8.54 |
Massimo Petrillo | 3 | 10 | 1.00 |