Title | ||
---|---|---|
An Unsupervised Approach For Precise Context Identification From Unstructured Text Documents |
Abstract | ||
---|---|---|
The majority of the documents produced and exchanged through medias and social networks are unstructured. Due to the amount of these unstructured documents on the Web, their exploitation represents a tedious or even impossible task for human beings without assistance by dedicated algorithms and specialized computer systems in document classification or information extraction. To be efficient and relevant, such systems have to understand the content of these unstructured documents. The context (or topic) of a document is one of the basic information essential for the understanding of its content, and the more precise the context of a document, the more relevant its understanding will be. This paper presents a precise context identification approach that is evaluated quantitatively and qualitatively on several reference corpora and compared to other context identification systems. The contexts identified by our model are much more precise than those identified by these others systems. |
Year | DOI | Venue |
---|---|---|
2020 | 10.1109/ICTAI50040.2020.00130 | 2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI) |
Keywords | DocType | ISSN |
accurate context extraction, unstructured textual document, text mining, semantic analysis | Conference | 1082-3409 |
Citations | PageRank | References |
0 | 0.34 | 0 |
Authors | ||
5 |
Name | Order | Citations | PageRank |
---|---|---|---|
Maha Mallek | 1 | 0 | 0.34 |
Sébastien Fournier | 2 | 37 | 16.72 |
Ramzi Guetari | 3 | 1 | 1.75 |
Bernard Espinasse | 4 | 123 | 24.92 |
Wided Lejouad Chaari | 5 | 29 | 11.14 |