Abstract | ||
---|---|---|
The Residue-Driven Architecture presented here is a model of auditory stream segregation from input sounds. A subsystem to extract auditory streams by using some sound attributes is called an agency and the design of each agency is based on the residue-driven architecture. This architecture consists of three kinds of agents: an event-detector, a tracer-generator, and tracers. The event-detector calculates a residue by subtracting the predicted input from the actual input. When a residue exceeds a threshold value, tracer-generator generates a tracerthat extracts an auditory stream from the residue and returns a predicted input of the next time frame to the event-detector. This aproach improves the performance of segregation and the resulting system can segregate a woman's voiced stream, a man's voiced stream, and a noise stream from a mixture of these sounds. Binaural segregation is also designed by the architecture. |
Year | Venue | Keywords |
---|---|---|
1995 | IJCAI | computational auditory scene analysis,threshold value,resulting system,binaural segregation,actual input,residue-driven architecture,sound attribute,noise stream,auditory stream segregation,next time frame,auditory stream |
Field | DocType | ISSN |
Architecture,Auditory scene analysis,Time frame,Computer science,Speech recognition,Binaural recording,Computational auditory scene analysis | Conference | 1045-0823 |
ISBN | Citations | PageRank |
1-55860-363-8 | 19 | 8.44 |
References | Authors | |
6 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Tomohiro Nakatani | 1 | 1327 | 139.18 |
Hiroshi G. Okuno | 2 | 2092 | 233.19 |
Takeshi Kawabata | 3 | 296 | 51.73 |