Agile text mining with Sherlok - Citegraph

Paper Info

Title
Agile text mining with Sherlok

Abstract
The successful development of an intelligent text mining application requires the collaboration of two main stakeholders: subject matter experts and text miners. In this paper, we describe a new methodology, agile text mining to improve that collaboration. Agile text mining is characterized by short development cycles, frequent tasks redefinition and continuous performance monitoring through integration tests. We introduce Sherlok, a system supporting the development of agile text mining applications and present an application to extract mention of neurons from a very large corpus of scientific articles. The resulting code and models are publicly available.

Year	DOI	Venue
2015	10.1109/BigData.2015.7363910	Big Data
Keywords	Field	DocType
natural language processing, text mining, big data, UIMA, agile data science	Data science,Ontology (information science),Data mining,Co-occurrence networks,Concept mining,Text mining,Integration testing,Computer science,Subject-matter expert,Agile software development,Big data	Conference
Citations	PageRank	References
2	0.43	12
Authors
4

Authors (4 rows)

Cited by (2 rows)

References (12 rows)

Name	Order	Citations	PageRank
Renaud Richardet	1	9	1.27
Jean-Cédric Chappelier	2	107	10.47
Shreejoy Tripathy	3	2	0.43
Sean Hill	4	83	7.22

1