Title
BioWeka---extending the Weka framework for bioinformatics
Abstract
Summary: Given the growing amount of biological data, data mining methods have become an integral part of bioinformatics research. Unfortunately, standard data mining tools are often not sufficiently equipped for handling raw data such as e.g. amino acid sequences. One popular and freely available framework that contains many well-known data mining algorithms is the Waikato Environment for Knowledge Analysis (Weka). In the BioWeka project, we introduce various input formats for bioinformatics data and bioinformatics methods like alignments to Weka. This allows users to easily combine them with Weka's classification, clustering, validation and visualization facilities on a single platform and therefore reduces the overhead of converting data between different data formats as well as the need to write custom evaluation procedures that can deal with many different programs. We encourage users to participate in this project by adding their own components and data formats to BioWeka. Availability: The software, documentation and tutorial are available at http://www.bioweka.org. Contact: support@bioweka.org
Year
DOI
Venue
2007
10.1093/bioinformatics/btl671
Bioinformatics
Keywords
Field
DocType
biological data,amino acid sequence,data mining
Data mining,Biological data,Data stream mining,Visualization,Computer science,Raw data,Software,Bioinformatics,Documentation,Cluster analysis,Data mining algorithm,Database
Journal
Volume
Issue
ISSN
23
5
1367-4803
Citations 
PageRank 
References 
16
12.39
5
Authors
3
Name
Order
Citations
PageRank
Jan E. Gewehr18117.19
Martin Szugat21813.03
Ralf Zimmer31612.39