Commands

A simple example is provided as follows to illustrate the use of "LWFQZip2". To compress the sample FASTQ file SRR1063349.fastq, the command "LWFQZip2 -c" is executed with a reference NC_017634.1.fasta.

LWFQZip2 -c -i SRR1063349.fastq -r NC_017634.1.fasta

where the target FASTQ file is first mapped to the reference obtaining an intermediate output file "SRR1063349.fastq.map.txt"(in SAM format) in the same directory, and then the original data is compressed based on this mapping results. A compressed file "SRR1063349.fastq.lz" is obtained.

To decompress the file, the command "LWFQZip2 -d" should be called.

LWFQZip2 -d -i SRR1063349.fastq.lz -r NC_017634.1.fasta

More parameters can be specified for the mapping and compressiong parts as follows:

COMMANDS AND OPTIONS

LWFQZip2 <mode>...[options]
  Mode:
 
-c compression.
-d decompression.
  Compression/Decompression Options:
 
-i input FASTQ file or compressed file.
-r input Reference file.
-m maximal read length,ranging from 30000 to 300000 (Default: '-m 300000').
-h help.
-g best compression ratio mode (slow), usage: LWFQZip2 -c -i input -r reference -g.
-a assemble-based mode,An optional amount (Default: 0.3% of the original file size) of reads, which contains the predefined prefix (Default: 'CG'), could be combined to be an artificial reference. At the end of the package, this artificial reference is included.
usage:
LWFQZip2 -c -i input.fastq -a 0.003(default)
LWFQZip2 -d -i input.fastq.lz -a.
-s display the counts of the common prefixes in the reference, e.g. CG,AT,ATA...
-v version.
  Mapping Options:
 
-b the number of mapping thread(Default: 10, mininum: 1)
-p specify the kmer prefixes, e.g.,'CG', 'AT', and 'TAG' (Default: '-p CG'). 'AA' is not recommended as a prefix.
-k length of a kmer used in locate local alignment. (Default: '-k 8').
-l the mini length of a legal alignment.(Default: '-l 12').
-o the complementary palindrome mode.(Default: '-o 1' means start,otherwise'-o 0').
-e the tolerance rate of mismatches.(Default: '-e 0.05').