Abstract | ||
---|---|---|
State-of-the-art named entity disambiguation approaches tend to perform poorly on social media content, and microblogs in particular. Tweets are processed individually and the richer, microblog-specific context is largely ignored. This paper focuses specifically on quantifying the impact on entity disambiguation performance when readily available contextual information is included from URL content, hash tag definitions, and Twitter user profiles. In particular, including URL content significantly improves performance. Similarly, user profile information for @mentions improves recall by over 10﾿% with no adverse impact on precision. We also share a new corpus of tweets, which have been hand-annotated with DBpedia URIs, with high inter-annotator agreement. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1007/978-3-319-18818-8_11 | Extended Semantic Web Conference |
Field | DocType | Volume |
Entity linking,Data mining,Contextual information,Social media,User profile,Information retrieval,Computer science,Microblogging,Hash function,Recall | Conference | 9088 |
ISSN | Citations | PageRank |
0302-9743 | 3 | 0.46 |
References | Authors | |
23 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Genevieve Gorrell | 1 | 266 | 22.00 |
Johann Petrak | 2 | 148 | 11.09 |
Kalina Bontcheva | 3 | 2538 | 211.33 |