Fig. 1 | Scientific Reports

Fig. 1

From: Lossless and reference-free compression of FASTQ/A files using GeneSqueeze

Fig. 1

GeneSqueeze compression flow-diagram. A diagram depicting the overall methodology of the GeneSqueeze algorithm. A blue outline denotes the quality score branch, a red outline indicates the nucleotide sequence branch, and a black outline designates the read identifier branch. The duplication removal, semi-duplication removal, k-mer removal, and nucleotide sequence regulator all output auxiliary information, which refers to all of the information necessary to losslessly decompress the file.

Back to article page