Abstract | ||
---|---|---|
Clinical data recorded in modern EHRs are very rich, although their secondary use research and medical decision may be complicated (eg, missing and incorrect data, data spread over several clinical databases, information available only within unstructured narrative documents). We propose to address the issue related to the processing of narrative documents in order to detect and extract numerical values and to associate them with the corresponding concepts (or themes) and units. We propose to use a CRF supervised categorisation for the detection of segments (themes, numerical sequences and units) and a rules-based system for the association of these segments among them in order to build semantically meaningful sequences. The average results obtained are competitive (0.96 precision, 0.78 recall, and 0.86 F-measure) and we plan to use the system with larger clinical data. |
Year | DOI | Venue |
---|---|---|
2015 | 10.3233/978-1-61499-512-8-50 | Studies in Health Technology and Informatics |
Keywords | Field | DocType |
Natural Language Processing,Text Mining,Software Design,Information Storage and retrieval,France | Data mining,Unstructured data,Medicine | Conference |
Volume | ISSN | Citations |
210 | 0926-9630 | 1 |
PageRank | References | Authors |
0.36 | 3 | 5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Elise Bigeard | 1 | 1 | 1.03 |
Vianney Jouhet | 2 | 1 | 0.36 |
F Mougin | 3 | 19 | 3.16 |
Frantz Thiessard | 4 | 53 | 10.57 |
Natalia Grabar | 5 | 1 | 0.70 |