Title
An estimation of speaker sampling in Voice Across Japan database.
Abstract
This paper discusses a speaker-sampling problem in data collection. In collecting a huge speaker-corpus, the sampling of the speakers becomes a very important problem. In the Voice Across Japan (VAJ) project, we have gathered 8,773 speakers' data through a telephone line in proportion to population as possible as we can, and also have collected the speakers' information such as gender, age, and growing area. We discuss the problem where the parameters of the speakers such as, the "age" or "growing place" influence the speech recognition. If such parameters are not so important, both data collection and product realization will be easier. In our experiments, it was clearly found that the "speakers' age" is the strongest factor and that the "growing area", also, have an influence in the range of 2-4%.
Year
DOI
Venue
1996
10.1109/ICASSP.1996.543248
ICASSP
Keywords
Field
DocType
speech recognition,strongest factor,product realization,telephone line,speaker-sampling problem,japan database,huge speaker-corpus,important problem,data collection,sampling methods,telephony,parameter estimation,background noise,feedback,age,databases
Population,Data collection,Computer science,Speech recognition,Speaker recognition,Sampling (statistics),Estimation theory,Telephone line
Conference
ISBN
Citations 
PageRank 
0-7803-3192-3
1
0.41
References 
Authors
2
3
Name
Order
Citations
PageRank
I. Kudo110.41
T. Nakama210.41
T. Watanabe325251.28