Title
Bottom-Up Unsupervised Word Discovery via Acoustic Units.
Abstract
Unsupervised term discovery is the task of identifying and grouping reoccurring word-like patterns from the untranscribed audio data. It facilitates unsupervised acoustic model training in zero resource setting where no or minimal transcribed speech is available. In this paper, we investigate two-step bottom-up approaches for unsupervised discovery of word-like units. The first step discovers phone-like acoustic units from data and the second step combines the basic acoustic blocks to identify word-like units. We investigated Embedded Segmental K-means and Nested Hierarchical Pitman-Yor (PYR) model as bottom-up strategies. ESK-Means iteratively selects boundaries from an initial set to arrive at the word boundaries. The final performance critically depends on the quality of the initial boundaries. We used a segmentation method that discovers boundaries much closer to actual boundaries. PYR model has been used for word segmentation from space removed text data, and here we use it for word discovery from unsupervised acoustic units. The term discovery performance is evaluated on the Zero Resource 2017 challenge dataset, which consists of around 70 hours of unlabelled data. Our systems outperformed the baseline systems on all the languages without language-specific parameter tuning. We performed comprehensive experiments of the system parameters on the system performance.
Year
DOI
Venue
2019
10.1109/GlobalSIP45357.2019.8969225
GlobalSIP
Field
DocType
Citations 
Pattern recognition,Segmentation,Computer science,Top-down and bottom-up design,Text segmentation,Unsupervised learning,Artificial intelligence,Acoustic model
Conference
0
PageRank 
References 
Authors
0.34
0
6
Name
Order
Citations
PageRank
Saurabhchand Bhati100.34
Chunxi Liu200.34
Jesús A. Villalba364.49
Jan Trmal423520.91
Sanjeev Khudanpur52155202.00
N. Dehak6126992.64