This mode is used to convert a compact file to FASTA format. It is implemented by edu.cornell.med.icb.goby.modes.CompactToFastaMode.java.
Mode Parameters
The following options are available in this mode
| Flag | Arguments | Required | Description |
|---|---|---|---|
--index-to-header | n/a | no | When this switch is provided, ignore the sequence description line and write the sequence index as a string in place of the fasta sequence header. |
--output-color-space | n/a | no | When this switch is provided, convert the sequence to color mode. Only makes sense if the compact file encodes a sequence in fasta format. |
--output-fake-nt | n/a | no | When this switch is provided, convert the color space sequence to fake nucleotide representation (e.g. 0123.->ACTGN). Used by the BWA algorithm. |
--output-fake-quality | n/a | no | When this switch is provided, output fake qualities equal to fixed default constant quality. |
--reference | n/a | no | When this switch is provided with output-color-space, any transition on the N character will result in writing an ‘N’ base. |
--trim-adaptor-length | trim-adaptor-length | no | When positive, removes this many bases from the beginning of sequence. |
(-i|--input) | input | yes | Name of the compact input file. |
(-o|--output) | output | yes | Name of the fasta file output. |
(-p|--pair-output) | pair-output | yes | Name of the pair output file (where paired sequence will be written). |
(-t|--output-format) | output-format | no | Type of output to generate. Can be Fasta or Fastq. The fastq format encodes quality scores. |
(-a|--alphabet) | alphabet | no | Alphabet of valid bases/residues. The sequence is filtered to keep only characters in the alphabet. Other characters are replaced by ‘N’. |
(-f|--read-index-filter) | read-index-filter | no | The name of a read index filter. When provided, compact-to-fasta will only write reads to the output if their index is contained in the filter. |
--quality-encoding | quality-encoding | no | The encoding for quality scores. The default quality encoding is that used by the Illumina pipeline 1.3+ |

Leave a Comment