Abstract | ||
---|---|---|
We propose a compression algorithm for the quality scores contained in FASTQ files which are generated in large volumes during high throughput sequencing. The proposed algorithm is a context dependent arithmetic coder which is based on observations of the structure of quality scores in FASTQ files. Simulation results indicate a significantly superior performance of the algorithm to the current state of the art. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1109/DCC.2014.46 | Data Compression Conference |
Keywords | Field | DocType |
Q-factor,arithmetic codes,data compression,FASTQ files,compression algorithm,context dependent arithmetic coder,high throughput sequencing,next generation sequencing,quality factors compression,quality scores,Biological sequence compression,DNA,Quality factor | Compression (physics),FASTQ format,Computer science,Theoretical computer science,DNA sequencing,Data compression | Conference |
ISSN | Citations | PageRank |
1068-0314 | 2 | 0.41 |
References | Authors | |
0 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Özkan U. Nalbantoglu | 1 | 2 | 0.75 |
Khalid Sayood | 2 | 868 | 88.12 |