Title | ||
---|---|---|
Automatic Generation of Integration and Preprocessing Ontologies for Biomedical Sources in a Distributed Scenario |
Abstract | ||
---|---|---|
Access to a large number of remote data sources has boosted research in biomedicine, where different biological and clinical research projects are based on collaborative efforts among international organizations. In this scenario, the authors have developed various methods and tools in the area of database integration, using an ontological approach. This paper describes a method to automatically generate preprocessing structures (ontologies) within an ontology-based KDD model. These ontologies are obtained from the analysis of data sources, searching for: (i) valid numerical ranges (using clustering techniques), (ii) different scales, (iii) synonym transformations based on known dictionaries and (iv) typographical errors. To test the method, experiments were carried out with four biomedical databases ―containing rheumatoid arthritis, gene expression patterns, biological processes and breast cancer patients― proving the performance of the approach. This method supports experts in data analysis processes, facilitating the detection of inconsistencies. |
Year | DOI | Venue |
---|---|---|
2008 | 10.1109/CBMS.2008.71 | CBMS |
Keywords | Field | DocType |
ontological approach,different scale,biological process,remote data source,preprocessing ontologies,automatic generation,breast cancer patient,data analysis process,various method,biomedical databases,clinical research project,biomedical sources,data source,dictionaries,data mining,distributed processing,numerical range,breast cancer,testing,data analysis,gene expression,biological processes,ontologies,preprocessing,database management systems,distributed databases,clinical research,database integration,typographical error,databases | Data integration,Ontology (information science),Ontology,Data mining,Data analysis,Computer science,Preprocessor,Distributed database,Typographical error,Cluster analysis | Conference |
ISSN | Citations | PageRank |
2372-9198 | 2 | 0.42 |
References | Authors | |
10 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Alberto Anguita | 1 | 82 | 9.29 |
david perezrey | 2 | 135 | 20.28 |
José Crespo | 3 | 126 | 24.90 |
Víctor Maojo | 4 | 48 | 4.79 |