Title
Adaptation of a feedforward artificial neural network using a linear transform
Abstract
In this paper we present a novel method for adaptation of a multi-layer perceptron neural network (MLP ANN). Nowadays, the adaptation of the ANN is usually done as an incremental retraining either of a subset or the complete set of the ANN parameters. However, since sometimes the amount of the adaptation data is quite small, there is a fundamental drawback of such approach - during retraining, the network parameters can be easily overfitted to the new data. There certainly are techniques that can help overcome this problem (early-stopping, cross-validation), however application of such techniques leads to more complex and possibly more data hungry training procedure. The proposed method approaches the problem from a different perspective. We use the fact that in many cases we have an additional knowledge about the problem. Such additional knowledge can be used to limit the dimensionality of the adaptation problem. We applied the proposed method on speaker adaptation of a phoneme recognizer based on TRAPS (Temporal Patterns) parameters. We exploited the fact that the employed TRAPS parameters are constructed using log-outputs of mel-filter bank and by virtue of reformulating the first layer weight matrix adaptation problem as a mel-filter bank output adaptation problem, we were able to significantly limit the number of free variables. Adaptation using the proposed method resulted in a substantial improvement of phoneme recognizer accuracy.
Year
DOI
Venue
2010
10.1007/978-3-642-15760-8_54
TSD
Keywords
Field
DocType
adaptation problem,adaptation data,mlp ann,ann parameter,additional knowledge,novel method,mel-filter bank output adaptation,feedforward artificial neural network,layer weight matrix adaptation,speaker adaptation,neural network,cross validation,filter bank,artificial neural network,linear transformation,multi layer perceptron
Matrix (mathematics),Free variables and bound variables,Computer science,Speech recognition,Curse of dimensionality,Time delay neural network,Artificial intelligence,Artificial neural network,Perceptron,Machine learning,Vocal tract,Feed forward
Conference
Volume
ISSN
ISBN
6231
0302-9743
3-642-15759-9
Citations 
PageRank 
References 
9
0.64
4
Authors
3
Name
Order
Citations
PageRank
Jan Trmal123520.91
Jan Zelinka2378.86
Luděk Müller37510.67