Title
Big Scholarly Data in CiteSeerX: Information Extraction from the Web
Abstract
We examine CiteSeerX, an intelligent system designed with the goal of automatically acquiring and organizing large-scale collections of scholarly documents from the world wide web. From the perspective of automatic information extraction and modes of alternative search, we examine various functional aspects of this complex system with an eye towards ongoing and future research developments.
Year
DOI
Venue
2015
10.1145/2740908.2741736
WWW (Companion Volume)
Keywords
Field
DocType
scholarly big data, citeseerx, information acquisition and extraction, digital library search engine, intelligent systems
Data mining,World Wide Web,Computer science,Information extraction
Conference
Citations 
PageRank 
References 
0
0.34
25
Authors
5
Name
Order
Citations
PageRank
Ororbia II Alexander G.112117.83
Jian Wu2452.92
Madian Khabsa323718.81
Kyle Williams420821.61
C. Lee Giles5111541549.48