Title
A Strategy for Automatically Extracting References from PDF Documents
Abstract
Every day the number of citations an author receives is becoming more important than the size of his list of publications. The automatic extraction of bibliographic references in scientific articles is still a difficult problem in Document Engineering, even if the document is originally in digital form. This paper presents a strategy for extracting references of scientific documents in PDF format. The scheme proposed was validated in Live Memory platform, developed to generate digital libraries of proceedings of technical events.
Year
DOI
Venue
2012
10.1109/DAS.2012.12
Document Analysis Systems
Keywords
Field
DocType
scientific document,scientific article,digital library,difficult problem,digital form,document engineering,automatic extraction,bibliographic reference,pdf format,live memory platform,automatically extracting references,pdf documents,image retrieval,accuracy,digital libraries,information extraction,regular expression,document processing,classification algorithms,data mining
Regular expression,World Wide Web,Information retrieval,Computer science,Document management system,Document clustering,Document engineering,Document processing,Image retrieval,Information extraction,Digital library
Conference
ISBN
Citations 
PageRank 
978-1-4673-0868-7
0
0.34
References 
Authors
3
3
Name
Order
Citations
PageRank
Neide Ferreira Alves161.30
Rafael Dueire Lins257175.79
Maria Lencastre35118.51