Bioinformatics

Genome Analysis Toolkit (GATK)

GATK is a genomic analysis toolkit focused on variant discovery.

The GATK is the industry standard for identifying SNPs and indels in germline DNA and RNAseq data. Its scope is now expanding to include somatic short variant calling, and to tackle copy number (CNV) and structural variation (SV). In addition to the variant callers themselves, the GATK also includes many utilities to perform related tasks such as processing and quality control of high-throughput sequencing data, and bundles the popular Picard toolkit.

These tools were primarily designed to process exomes and whole genomes generated with Illumina sequencing technology, but they can be adapted to handle a variety of other technologies and experimental designs. And although it was originally developed for human genetics, the GATK has since evolved to handle genome data from any organism, with any level of ploidy.

This is free and open source software.

Website: gatk.broadinstitute.org/hc/en-us
Support: GitHub Code Repository
Developer: Broad Institute, Inc.
License: Apache License 2.0

GATK is written in Java. Learn Java with our recommended free books and free tutorials.


Related Software

Bioinformatics Tools
BioconductorAnalysis and comprehension of high-throughput genomic data
BiopythonTools for biological computation written in Python
UGENESet of integrated bioinformatics software
BioPerlPerl tools for computational molecular biology
GROMACSVersatile package to perform molecular dynamics
IGVHigh-performance visualization genome browser tool
GATKGenomic analysis toolkit focused on variant discovery
BioJavaProvides Java tools for processing biological data
InterMineIntegrate biological data sources
bedtoolsPowerful toolset for genome arithmetic
EMBOSSThe European Molecular Biology Open Software Suite
BLASTAlgorithm for comparing primary biological sequence information
GalaxyWeb-based platform for data-intensive computational research
minimap2Versatile sequence alignment program
JalviewMultiple sequence alignment editing, visualisation and analysis
samtoolsManipulate next-generation sequencing data
BCFtoolsVariant calling and manipulating files in the Variant Call Format
FastQCQuality control tool for high throughput sequence data
SPAdesVersatile toolkit for assembling and analysing sequencing data
GenomeToolsCollection of bioinformatics tools
AliViewAlignment viewer and editor
mothurAnalyze microbial communities
BandageVisualising de novo assembly graphs
craminoBAM/CRAM quality evaluation
abPOAAdaptive banded Partial Order Alignment
Taverna WorkbenchFor designing and executing bioinformatics workflows
geWorkbenchSoftware platform for integrated genomic data analysis
BioclipseRich-client platform chemistry and biology workbench

Read our verdict in the software roundup.


Best Free and Open Source Software Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.

This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk.

You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more.

Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form.
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments