Title
Construction and analysis of Japanese-English broadcast news corpus with named entity tags
Abstract
We are aiming to acquire named entity (NE) translation knowledge from nonparallel, content-aligned corpora, by utilizing NE extraction techniques. For this research, we are constructing a Japanese-English broadcast news corpus with NE tags. The tags represent not only NE class information but also coreference information within the same monolingual document and between corresponding Japanese-English document pairs. Analysis of about 1,100 annotated article pairs has shown that if NE occurrence information, such as classes, number of occurrence and occurrence order, is given for each language, it may provide a good clue for corresponding NEs across languages.
Year
DOI
Venue
2003
10.3115/1119384.1119387
NER@ACL
Keywords
Field
DocType
japanese-english broadcast news corpus,monolingual document,corresponding nes,ne class information,corresponding japanese-english document pair,utilizing ne extraction technique,entity tag,ne tag,ne occurrence information,occurrence order,coreference information
Entity linking,Broadcasting,Coreference,Information retrieval,Computer science,Named entity,Natural language processing,Artificial intelligence
Conference
Volume
Citations 
PageRank 
W03-15
2
0.39
References 
Authors
11
4
Name
Order
Citations
PageRank
Tadashi Kumano1204.23
Hideki Kashioka238067.59
Hideki Tanaka38015.07
Takahiro Fukusima4499.03