Description

This tool locates and tags duplicate reads in a BAM or SAM file, where duplicate reads are defined as originating from a single fragment of DNA.

Input

name:type
description
pattern

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

bam{:bash}

:file

Sorted BAM file

*.{bam}

fasta{:bash}

:file

Fasta file

*.{fasta}

fasta_fai{:bash}

:file

Fasta index file

*.{fai}

Output

name:type
description
pattern

cram{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*cram{:bash}

:file

Marked duplicates CRAM file

*.{cram}

bam{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*bam{:bash}

:file

Marked duplicates BAM file

*.{bam}

crai{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.crai{:bash}

:file

CRAM index file

*.{cram.crai}

bai{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.bai{:bash}

:file

BAM index file

*.{bam.bai}

metrics{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.metrics{:bash}

:file

Duplicate metrics file generated by GATK

*.{metrics.txt}

versions_gatk4{:bash}

${task.process}{:bash}

:string

The name of the process

gatk4{:bash}

:string

The name of the tool

gatk --version | sed -n '/GATK.*v/s/.*v//p'{:bash}

:eval

The expression to obtain the version of the tool

versions_samtools{:bash}

${task.process}{:bash}

:string

The name of the process

samtools{:bash}

:string

The name of the tool

samtools version | sed '1!d;s/.* //'{:bash}

:eval

The expression to obtain the version of the tool

Topics

name:type
description
pattern

versions{:bash}

${task.process}{:bash}

:string

The name of the process

gatk4{:bash}

:string

The name of the tool

gatk --version | sed -n '/GATK.*v/s/.*v//p'{:bash}

:eval

The expression to obtain the version of the tool

${task.process}{:bash}

:string

The name of the process

samtools{:bash}

:string

The name of the tool

samtools version | sed '1!d;s/.* //'{:bash}

:eval

The expression to obtain the version of the tool

Tools

gatk4
MIT

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.