Big Scholarly Data in CiteSeerX: Information Extraction from the Web - Citegraph

Paper Info

Title
Big Scholarly Data in CiteSeerX: Information Extraction from the Web

Abstract
We examine CiteSeerX, an intelligent system designed with the goal of automatically acquiring and organizing large-scale collections of scholarly documents from the world wide web. From the perspective of automatic information extraction and modes of alternative search, we examine various functional aspects of this complex system with an eye towards ongoing and future research developments.

Year	DOI	Venue
2015	10.1145/2740908.2741736	WWW (Companion Volume)
Keywords	Field	DocType
scholarly big data, citeseerx, information acquisition and extraction, digital library search engine, intelligent systems	Data mining,World Wide Web,Computer science,Information extraction	Conference
Citations	PageRank	References
0	0.34	25
Authors
5

Authors (5 rows)

Cited by (0 rows)

References (25 rows)

Name	Order	Citations	PageRank
Ororbia II Alexander G.	1	121	17.83
Jian Wu	2	45	2.92
Madian Khabsa	3	237	18.81
Kyle Williams	4	208	21.61
C. Lee Giles	5	11154	1549.48

1