Filter out malformed reads
This filter is applied automatically by all GATK tools in order to protect them from crashing on reads that are malformed. There are a few types of malformation (such as the absence of sequence bases) that are not filtered out by default and can cause errors, but these cases can be preempted by setting flags that cause the problem reads to also be filtered.
java -jar GenomeAnalysisTk.jar \ -T ToolName \ -R reference.fasta \ -I input.bam \ -o output.file \ -filterNoBases
Note that the MalformedRead filter itself does not need to be specified in the command line because it is set automatically.
This table summarizes the command-line arguments that are specific to this tool. For more details on each argument, see the list further down below the table or click on an argument name to jump directly to that entry in the list.
Argument name(s) | Default value | Summary | |
---|---|---|---|
Optional Flags | |||
--filter_bases_not_stored -filterNoBases |
false | Filter out reads with no stored bases (i.e. '*' where the sequence should be), instead of failing with an error | |
--filter_mismatching_base_and_quals -filterMBQ |
false | Filter out reads with mismatching numbers of bases and base qualities, instead of failing with an error | |
--filter_reads_with_N_cigar -filterRNC |
false | Filter out reads with CIGAR containing the N operator, instead of failing with an error |
Arguments in this list are specific to this tool. Keep in mind that other arguments are available that are shared with other tools (e.g. command-line GATK arguments); see Inherited arguments above.
Filter out reads with no stored bases (i.e. '*' where the sequence should be), instead of failing with an error
boolean false
Filter out reads with mismatching numbers of bases and base qualities, instead of failing with an error
boolean false
Filter out reads with CIGAR containing the N operator, instead of failing with an error
boolean false