Abstract | ||
---|---|---|
The Polish text corpus was analysed to find information about phoneme statistics. We were especially interested in triphones as they are commonly used in many speech processing applications like HTK speech recogniser. An attempt to create the full list of triphones for Polish language is presented. A vast amount of phonetically transcribed text was analysed to obtain the frequency of triphone occurrences. A distibution of frequency of triphones occuring and other phenomena are presented. The standard phonetic alphabet for Polish and methods of providing phonetic transcriptions are described. |
Year | DOI | Venue |
---|---|---|
2007 | 10.1007/978-3-642-04235-5_6 | Human Language Technology. Challenges of the Information Society |
Keywords | Field | DocType |
speech processing application,polish text corpus,polish language,phonetically transcribed text,triphones occuring,htk speech recogniser,standard phonetic alphabet,phonetic transcription,triphone statistics,full list,phoneme statistic,speech processing | Triphone,Speech processing,Transcription (linguistics),Computer science,Polish,Text corpus,Speech recognition,Artificial intelligence,Natural language processing,Statistics,Alphabet | Conference |
Volume | ISSN | Citations |
5603 | 0302-9743 | 3 |
PageRank | References | Authors |
0.50 | 4 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Bartosz Ziólko | 1 | 46 | 15.76 |
Jakub Galka | 2 | 44 | 7.47 |
Suresh Manandhar | 3 | 1238 | 88.99 |
Richard C. Wilson | 4 | 1754 | 137.60 |
mariusz ziolko | 5 | 9 | 2.36 |