Abstract | ||
---|---|---|
This paper discusses a speaker-sampling problem in data collection. In collecting a huge speaker-corpus, the sampling of the speakers becomes a very important problem. In the Voice Across Japan (VAJ) project, we have gathered 8,773 speakers' data through a telephone line in proportion to population as possible as we can, and also have collected the speakers' information such as gender, age, and growing area. We discuss the problem where the parameters of the speakers such as, the "age" or "growing place" influence the speech recognition. If such parameters are not so important, both data collection and product realization will be easier. In our experiments, it was clearly found that the "speakers' age" is the strongest factor and that the "growing area", also, have an influence in the range of 2-4%. |
Year | DOI | Venue |
---|---|---|
1996 | 10.1109/ICASSP.1996.543248 | ICASSP |
Keywords | Field | DocType |
speech recognition,strongest factor,product realization,telephone line,speaker-sampling problem,japan database,huge speaker-corpus,important problem,data collection,sampling methods,telephony,parameter estimation,background noise,feedback,age,databases | Population,Data collection,Computer science,Speech recognition,Speaker recognition,Sampling (statistics),Estimation theory,Telephone line | Conference |
ISBN | Citations | PageRank |
0-7803-3192-3 | 1 | 0.41 |
References | Authors | |
2 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
I. Kudo | 1 | 1 | 0.41 |
T. Nakama | 2 | 1 | 0.41 |
T. Watanabe | 3 | 252 | 51.28 |