Title
Annotating breast cancer microarray samples using ontologies.
Abstract
As the most common cancer among women, breast cancer results from the accumulation of mutations in essential genes. Recent advance in high-throughput gene expression microarray technology has inspired researchers to use the technology to assist breast cancer diagnosis, prognosis, and treatment prediction. However, the high dimensionality of microarray experiments and public access of data from many experiments have caused inconsistencies which initiated the development of controlled terminologies and ontologies for annotating microarray experiments, such as the standard microarray Gene Expression Data (MGED) ontology(MO). In this paper, we developed BCM-CO, anontology tailored specifically for indexing clinical annotations of breast cancer microarray samples from the NCI Thesaurus. Our research showed that the coverage of NCI Thesaurus is very limited with respect to i) terms used by researchers to describe breast cancer histology (covering 22 out of 48 histology terms); ii) breast cancer cell lines (covering one out of 12 cell lines); and iii) classes corresponding to the breast cancer grading and staging. By incorporating a wider range of those terms into BCM-CO, we were able to indexed breast cancer microarray samples from GEO using BCMCO and MGED ontology and developed a prototype system with web interface that allows the retrieval of microarray data based on the ontology annotations.
Year
Venue
Keywords
2008
AMIA
gene expression profiling,natural language processing,documentation
Field
DocType
ISSN
Ontology (information science),Microarray,Information retrieval,Breast cancer,Computer science,Microarray analysis techniques,Computational biology,Gene chip analysis,Microarray databases,Gene expression profiling,Cancer
Conference
1942-597X
Citations 
PageRank 
References 
1
0.35
7
Authors
4
Name
Order
Citations
PageRank
Hongfang Liu110.35
Xin Li210.35
Victoria Yoon310.69
Robert Clarke471.40