Title
UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese.
Abstract
In order to construct an annotated diachronic corpus of Japanese, we propose to create a new dictionary for morphological analysis of Early Middle Japanese (Classical Japanese) based on UniDic, a dictionary for Contemporary Japanese. Differences between the Early Middle Japanese and Contemporary Japanese, which prevent a naive adaptation of UniDic to Early Middle Japanese, are found at the levels of lexicon, morphology, grammar, orthography and pronunciation. In order to overcome these problems, we extended dictionary entries and created a training corpus of Early Middle Japanese to adapt UniDic for Contemporary Japanese to Early Middle Japanese. Experimental results show that the proposed UniDic-EMJ, a new dictionary for Early Middle Japanese, achieves as high accuracy (97%) as needed for the linguistic research on lexicon and grammar in Japanese classical text analysis.
Year
Venue
Keywords
2012
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION
Morphological Analysis,Classical Japanese,Early Middle Japanese,Historical Corpus of Japanese
Field
DocType
Citations 
Pronunciation,Computer science,Grammar,Orthography,Lexicon,Artificial intelligence,Natural language processing,Morphological analysis
Conference
1
PageRank 
References 
Authors
0.36
3
4
Name
Order
Citations
PageRank
Toshinobu Ogiso1797.42
Mamoru Komachi224144.56
Yasuharu Den314526.23
yuji matsumoto43008300.05