Abstract | ||
---|---|---|
The problem of data classification goes back to the definition of taxonomies covering knowledge areas. With the advent of the Web, the amount of data available increased several orders of magnitude, making manual data classification impossible. This work presents an approach based on the prototype theory to automatically classify semi-structured data, represented by frames, without any previous knowledge about structured classes. Our approach uses a variation of the K-Means algorithm that organizes a set of frames into classes, structured as a strict hierarchy. |
Year | DOI | Venue |
---|---|---|
2015 | 10.1109/CIT/IUCC/DASC/PICOM.2015.30 | CIT/IUCC/DASC/PICOM 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY - UBIQUITOUS COMPUTING AND COMMUNICATIONS - DEPENDABLE, AUTONOMIC AND SECURE COMPUTING - PERVASIVE INTELLIGENCE AND COMPUTING |
Field | DocType | Citations |
Semi-structured data,Data mining,Informatics,Prototype theory,Information retrieval,XML,Computer science,Data classification,Hierarchy | Conference | 0 |
PageRank | References | Authors |
0.34 | 7 | 3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Bernardo Pereira Nunes | 1 | 185 | 30.96 |
Giseli Rabello Lopes | 2 | 107 | 16.44 |
Marco Antonio Casanova | 3 | 178 | 24.18 |