Title
Robust Dependency Parsing of Spontaneous Japanese Spoken Language
Abstract
Spontaneously spoken Japanese includes a lot of grammatically ill-formed linguistic phenomena such as fillers, hesitations, inversions, and so on, which do not appear in written language. This paper proposes a novel method of robust dependency parsing using a large-scale spoken language corpus, and evaluates the availability and robustness of the method using spontaneously spoken dialogue sentences. By utilizing stochastic information about the appearance of ill-formed phenomena, the method can robustly parse spoken Japanese including fillers, inversions, or dependencies over utterance units. Experimental results reveal that the parsing accuracy reached 87.0%, and we confirmed that it is effective to utilize the location information of a bunsetsu, and the distance information between bunsetsus as stochastic information.
Year
DOI
Venue
2005
10.1093/ietisy/e88-d.3.545
IEICE Transactions
Keywords
DocType
Volume
written language,parsing accuracy,stochastic information,novel method,ill-formed phenomenon,location information,robust dependency parsing,dialogue sentence,grammatically ill-formed linguistic phenomenon,spontaneous japanese spoken language,distance information,language corpus,dependency parsing
Journal
E88-D
Issue
ISSN
Citations 
3
1745-1361
5
PageRank 
References 
Authors
0.60
0
4
Name
Order
Citations
PageRank
Tomohiro Ohno13110.06
Shigeki Matsubara217943.41
Nobuo Kawaguchi331364.23
Yasuyoshi Inagaki424344.27