Diagnostics and Quality Control Tools

ASEReadCounter

AnalyzeCovariates

CallableLoci

CheckPileup

CompareCallableLoci

ContEst

CountBases

CountIntervals

CountLoci

CountMales

CountRODs

CountRODsByRef

CountReadEvents

CountReads

CountTerminusEvent

DepthOfCoverage

DiagnoseTargets

DiffObjects

ErrorRatePerCycle

FastaStats

FindCoveredIntervals

FlagStat

GCContentByInterval

GatherBqsrReports

Pileup

PrintRODs

QualifyMissingIntervals

ReadClippingStats

ReadGroupProperties

ReadLengthDistribution

SimulateReadsForVariants

Sequence Data Processing Tools

BaseRecalibrator

ClipReads

IndelRealigner

LeftAlignIndels

PrintReads

RealignerTargetCreator

SplitNCigarReads

SplitSamFile

Variant Discovery Tools

ApplyRecalibration

CalculateGenotypePosteriors

GATKPaperGenotyper

GenotypeGVCFs

HaplotypeCaller

MuTect2

RegenotypeVariants

UnifiedGenotyper

VariantRecalibrator

Variant Evaluation Tools

GenotypeConcordance

ValidateVariants

VariantEval

VariantFiltration

Variant Manipulation Tools

CatVariants

CombineGVCFs

CombineVariants

HaplotypeResolver

LeftAlignAndTrimVariants

PhaseByTransmission

RandomlySplitVariants

ReadBackedPhasing

SelectHeaders

SelectVariants

ValidationSiteSelector

VariantAnnotator

VariantsToAllelicPrimitives

VariantsToBinaryPed

VariantsToTable

VariantsToVCF

Annotation Modules

AS_BaseQualityRankSumTest

AS_FisherStrand

AS_InbreedingCoeff

AS_InsertSizeRankSum

AS_MQMateRankSumTest

AS_MappingQualityRankSumTest

AS_QualByDepth

AS_RMSMappingQuality

AS_ReadPosRankSumTest

AS_StrandOddsRatio

AlleleBalance

AlleleBalanceBySample

AlleleCountBySample

BaseCounts

BaseCountsBySample

BaseQualityRankSumTest

BaseQualitySumPerAlleleBySample

ChromosomeCounts

ClippingRankSumTest

ClusteredReadPosition

Coverage

DepthPerAlleleBySample

DepthPerSampleHC

ExcessHet

FisherStrand

FractionInformativeReads

GCContent

GenotypeSummaries

HaplotypeScore

HardyWeinberg

HomopolymerRun

InbreedingCoeff

LikelihoodRankSumTest

LowMQ

MVLikelihoodRatio

MappingQualityRankSumTest

MappingQualityZero

MappingQualityZeroBySample

NBaseCount

OxoGReadCounts

PossibleDeNovo

QualByDepth

RMSMappingQuality

ReadPosRankSumTest

SampleList

SnpEff

SpanningDeletions

StrandAlleleCountsBySample

StrandBiasBySample

StrandOddsRatio

TandemRepeatAnnotator

TransmissionDisequilibriumTest

VariantType

BadCigarFilter

BadMateFilter

CountingFilteringIterator.CountingReadFilter

DuplicateReadFilter

FailsVendorQualityCheckFilter

HCMappingQualityFilter

LibraryReadFilter

MalformedReadFilter

MappingQualityFilter

MappingQualityUnavailableFilter

MappingQualityZeroFilter

MateSameStrandFilter

MaxInsertSizeFilter

MissingReadGroupFilter

NoOriginalQualityScoresFilter

NotPrimaryAlignmentFilter

OverclippedReadFilter

Platform454Filter

PlatformFilter

PlatformUnitFilter

ReadGroupBlackListFilter

ReadLengthFilter

ReadNameFilter

ReadStrandFilter

ReassignMappingQualityFilter

ReassignOneMappingQualityFilter

ReassignOriginalMQAfterIndelRealignmentFilter

SampleFilter

SingleReadGroupFilter

UnmappedReadFilter

Resource File Codecs

BeagleCodec

BedTableCodec

RawHapMapCodec

RefSeqCodec

SAMPileupCodec

SAMReadCodec

TableCodec

Reference Utilities

FastaAlternateReferenceMaker

FastaReferenceMaker

QCRef

Showing docs for version 3.7-0

ReadLengthDistribution

Collect read length statistics

Category Diagnostics and Quality Control Tools

Traversal ReadWalker

PartitionBy READ

Overview

This tool generates a table with the read lengths categorized per sample. If the file has no sample information (no read groups) it considers all reads to come from the same sample.

Input

A BAM file.

Output

A human/R-readable table of tab-separated values with one column per sample and one row per read.

Usage example

    java -jar GenomeAnalysisTK.jar \
      -T ReadLengthDistribution \
      -R reference.fasta \
      -I example.bam \
      -o example.tbl

Additional Information

Read filters

These Read Filters are automatically applied to the data by the Engine before processing by ReadLengthDistribution.

Downsampling settings

This tool does not apply any downsampling by default.

Command-line Arguments

Engine arguments

All tools inherit arguments from the GATK Engine' "CommandLineGATK" argument collection, which can be used to modify various aspects of the tool's function. For example, the -L argument directs the GATK engine to restrict processing to specific genomic intervals; or the -rf argument allows you to apply certain read filters to exclude some of the data from the analysis.

CommandLineGATK

ReadLengthDistribution specific arguments

This table summarizes the command-line arguments that are specific to this tool. For more details on each argument, see the list further down below the table or click on an argument name to jump directly to that entry in the list.

Argument name(s)	Default value	Summary
Optional Outputs
--out -o	stdout	An output file created by the walker. Will overwrite contents if file exists

Argument details

Arguments in this list are specific to this tool. Keep in mind that other arguments are available that are shared with other tools (e.g. command-line GATK arguments); see Inherited arguments above.

--out / -o

An output file created by the walker. Will overwrite contents if file exists

PrintStream stdout