Title
Automatic Generation of Integration and Preprocessing Ontologies for Biomedical Sources in a Distributed Scenario
Abstract
Access to a large number of remote data sources has boosted research in biomedicine, where different biological and clinical research projects are based on collaborative efforts among international organizations. In this scenario, the authors have developed various methods and tools in the area of database integration, using an ontological approach. This paper describes a method to automatically generate preprocessing structures (ontologies) within an ontology-based KDD model. These ontologies are obtained from the analysis of data sources, searching for: (i) valid numerical ranges (using clustering techniques), (ii) different scales, (iii) synonym transformations based on known dictionaries and (iv) typographical errors. To test the method, experiments were carried out with four biomedical databases ―containing rheumatoid arthritis, gene expression patterns, biological processes and breast cancer patients― proving the performance of the approach. This method supports experts in data analysis processes, facilitating the detection of inconsistencies.
Year
DOI
Venue
2008
10.1109/CBMS.2008.71
CBMS
Keywords
Field
DocType
ontological approach,different scale,biological process,remote data source,preprocessing ontologies,automatic generation,breast cancer patient,data analysis process,various method,biomedical databases,clinical research project,biomedical sources,data source,dictionaries,data mining,distributed processing,numerical range,breast cancer,testing,data analysis,gene expression,biological processes,ontologies,preprocessing,database management systems,distributed databases,clinical research,database integration,typographical error,databases
Data integration,Ontology (information science),Ontology,Data mining,Data analysis,Computer science,Preprocessor,Distributed database,Typographical error,Cluster analysis
Conference
ISSN
Citations 
PageRank 
2372-9198
2
0.42
References 
Authors
10
4
Name
Order
Citations
PageRank
Alberto Anguita1829.29
david perezrey213520.28
José Crespo312624.90
Víctor Maojo4484.79