Title
Gathering and ranking photos of named entities with high precision, high recall, and diversity
Abstract
Knowledge-sharing communities like Wikipedia and automated extraction methods like those of DBpedia enable the construction of large machine-processible knowledge bases with relational facts about entities. These endeavors lack multimodal data like photos and videos of people and places. While photos of famous entities are abundant on the Internet, they are much harder to retrieve for less popular entities such as notable computer scientists or regionally interesting churches. Querying the entity names in image search engines yields large candidate lists, but they often have low precision and unsatisfactory recall. Our goal is to populate a knowledge base with photos of named entities, with high precision, high recall, and diversity of photos for a given entity. We harness relational facts about entities for generating expanded queries to retrieve different candidate lists from image search engines. We use a weighted voting method to determine better rankings of an entity's photos. Appropriate weights are dependent on the type of entity (e.g., scientist vs. politician) and automatically computed from a small set of training entities. We also exploit visual similarity measures based on SIFT features, for higher diversity in the final rankings. Our experiments with photos of persons and landmarks show significant improvements of ranking measures like MAP and NDCG, and also for diversity-aware ranking.
Year
DOI
Venue
2010
10.1145/1718487.1718541
WSDM
Keywords
Field
DocType
popular entity,entity name,high recall,ranking photo,final ranking,different candidate list,harness relational fact,better ranking,high precision,famous entity,diversity-aware ranking,training entity,knowledge base,query expansion,ranking
Data mining,Learning to rank,Query expansion,Information retrieval,Ranking,Computer science,Exploit,Weighted voting,Knowledge base,Recall,The Internet
Conference
Citations 
PageRank 
References 
28
1.05
24
Authors
3
Name
Order
Citations
PageRank
Bilyana Taneva141014.37
Mouna Kacimi225719.82
Gerhard Weikum3127102146.01