Title
Creating a dead poets society: extracting a social network of historical persons from the web
Abstract
We present a simple method to extract information from search engine snippets. Although the techniques presented are domain independent, this work focuses on extracting biographical information of historical persons from multiple unstructured sources on the Web. We first similarly find a list of persons and their periods of life by querying the periods and scanning the retrieved snippets for person names. Subsequently, we find biographical information for the persons extracted. In order to get insight in the mutual relations among the persons identified, we create a social network using co-occurrences on the Web. Although we use uncontrolled and unstructured Web sources, the information extracted is reliable. Moreover we show that Web Information Extraction can be used to create both informative and enjoyable applications.
Year
DOI
Venue
2007
10.1007/978-3-540-76298-0_12
ISWC/ASWC
Keywords
Field
DocType
social network,unstructured web source,web information extraction,simple method,historical person,dead poets society,enjoyable application,multiple unstructured source,biographical information,person name,search engine snippet,mutual relation,information extraction,search engine
Data mining,World Wide Web,Search engine,Social network,Information retrieval,Computer science,Information extraction,Social Semantic Web,Web information,Database
Conference
Volume
ISSN
ISBN
4825
0302-9743
3-540-76297-3
Citations 
PageRank 
References 
2
0.40
14
Authors
2
Name
Order
Citations
PageRank
Gijs Geleijnse117415.55
Jan Korst217519.94