Abstract | ||
---|---|---|
We present an exploratory tool that extracts person names from multilingual
news collections, matches name variants referring to the same person, and
infers relationships between people based on the co-occurrence of their names
in related news. A novel feature is the matching of name variants across
languages and writing systems, including names written with the Greek, Cyrillic
and Arabic writing system. Due to our highly multilingual setting, we use an
internal standard representation for name representation and matching, instead
of adopting the traditional bilingual approach to transliteration. This work is
part of the news analysis system NewsExplorer that clusters an average of
25,000 news articles per day to detect related news within the same and across
different languages. |
Year | Venue | Keywords |
---|---|---|
2006 | Clinical Orthopaedics and Related Research | information retrieval,internal standard |
DocType | Volume | ISSN |
Journal | abs/cs/060 | Journal CORELA - Cognition, Representation, Langage. Numeros
speciaux, Le traitement lexicographique des noms propres. December 2005. ISSN
1638-5748 |
Citations | PageRank | References |
10 | 1.04 | 8 |
Authors | ||
7 |
Name | Order | Citations | PageRank |
---|---|---|---|
Bruno Pouliquen | 1 | 678 | 58.19 |
Ralf Steinberger | 2 | 949 | 79.70 |
Camelia Ignat | 3 | 456 | 42.11 |
Irina Temnikova | 4 | 56 | 5.51 |
Anna Widiger | 5 | 269 | 15.89 |
Wajdi Zaghouani | 6 | 197 | 21.27 |
Jan Zizka | 7 | 49 | 11.47 |