Title
Compression of Quality Factors in Next Generation Sequencing
Abstract
We propose a compression algorithm for the quality scores contained in FASTQ files which are generated in large volumes during high throughput sequencing. The proposed algorithm is a context dependent arithmetic coder which is based on observations of the structure of quality scores in FASTQ files. Simulation results indicate a significantly superior performance of the algorithm to the current state of the art.
Year
DOI
Venue
2014
10.1109/DCC.2014.46
Data Compression Conference
Keywords
Field
DocType
Q-factor,arithmetic codes,data compression,FASTQ files,compression algorithm,context dependent arithmetic coder,high throughput sequencing,next generation sequencing,quality factors compression,quality scores,Biological sequence compression,DNA,Quality factor
Compression (physics),FASTQ format,Computer science,Theoretical computer science,DNA sequencing,Data compression
Conference
ISSN
Citations 
PageRank 
1068-0314
2
0.41
References 
Authors
0
2
Name
Order
Citations
PageRank
Özkan U. Nalbantoglu120.75
Khalid Sayood286888.12