Abstract | ||
---|---|---|
In this paper, we compare the per- formance of a state-of-the-art statistical parser (Bikel, 2004) in parsing written and spoken language and in generating sub- categorization cues from written and spo- ken language. Although Bikel's parser achieves a higher accuracy for parsing written language, it achieves a higher ac- curacy when extracting subcategorization cues from spoken language. Additionally, we explore the utility of punctuation in helping parsing and extraction of subcat- egorization cues. Our experiments show that punctuation is of little help in pars- ing spoken language and extracting sub- categorization cues from spoken language. This indicates that there is no need to add punctuation in transcribing spoken cor- pora simply in order to help parsers. |
Year | Venue | Keywords |
---|---|---|
2006 | ACL | written language,state-of-the-art statistical parser,subcategorization data,subcategorization frame,written text,subcategorization cue,current technology,higher accuracy |
Field | DocType | Volume |
Transcription (linguistics),Subcategorization,Computer science,Written language,Natural language processing,Artificial intelligence,Parsing,Linguistics,Punctuation,Spoken language | Conference | P06-2 |
Citations | PageRank | References |
1 | 0.36 | 20 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Jianguo Li | 1 | 1 | 0.36 |
Chris Brew | 2 | 321 | 44.44 |