Creation of a new longitudinal corpus of clinical narratives - Citegraph

Paper Info

Title
Creation of a new longitudinal corpus of clinical narratives

Abstract
Display Omitted New corpus of 1304 de-identified medical records.Longitudinal records represent 296 patients.Three patient cohorts: with Coronary Artery Disease (CAD), no CAD, develop CAD.Overview of novel uses for this corpus. The 2014 i2b2/UTHealth Natural Language Processing (NLP) shared task featured a new longitudinal corpus of 1304 records representing 296 diabetic patients. The corpus contains three cohorts: patients who have a diagnosis of coronary artery disease (CAD) in their first record, and continue to have it in subsequent records; patients who do not have a diagnosis of CAD in the first record, but develop it by the last record; patients who do not have a diagnosis of CAD in any record. This paper details the process used to select records for this corpus and provides an overview of novel research uses for this corpus. This corpus is the only annotated corpus of longitudinal clinical narratives currently available for research to the general research community.

Year	DOI	Venue
2015	10.1016/j.jbi.2015.09.018	Journal of Biomedical Informatics
Keywords	Field	DocType
Corpus,Machine learning,Medical records,NLP	CAD,Information retrieval,Narrative,Natural language processing,Medical record,Artificial intelligence,Cohort study,Medicine	Journal
Volume	Issue	ISSN
58	S	1532-0464
Citations	PageRank	References
6	0.56	21
Authors
4

Authors (4 rows)

Cited by (6 rows)

References (21 rows)

Name	Order	Citations	PageRank
vishesh kumar	1	6	0.56
Amber Stubbs	2	115	9.57
stanley shaw	3	6	0.56
Özlem Uzuner	4	1045	67.09

1