Title
Parsing and Subcategorization Data
Abstract
In this paper, we compare the per- formance of a state-of-the-art statistical parser (Bikel, 2004) in parsing written and spoken language and in generating sub- categorization cues from written and spo- ken language. Although Bikel's parser achieves a higher accuracy for parsing written language, it achieves a higher ac- curacy when extracting subcategorization cues from spoken language. Additionally, we explore the utility of punctuation in helping parsing and extraction of subcat- egorization cues. Our experiments show that punctuation is of little help in pars- ing spoken language and extracting sub- categorization cues from spoken language. This indicates that there is no need to add punctuation in transcribing spoken cor- pora simply in order to help parsers.
Year
Venue
Keywords
2006
ACL
written language,state-of-the-art statistical parser,subcategorization data,subcategorization frame,written text,subcategorization cue,current technology,higher accuracy
Field
DocType
Volume
Transcription (linguistics),Subcategorization,Computer science,Written language,Natural language processing,Artificial intelligence,Parsing,Linguistics,Punctuation,Spoken language
Conference
P06-2
Citations 
PageRank 
References 
1
0.36
20
Authors
2
Name
Order
Citations
PageRank
Jianguo Li110.36
Chris Brew232144.44