is a General purpose Resource.
SnpEff is a genetic variant annotation and functional effect prediction toolbox widely used in genomics pipelines. It annotates and predicts the effects of genetic variants on genes and proteins (such as amino acid changes), supports over 38,000 genomes, uses standardized Sequence Ontology terms, and implements the VCF annotation standard ANN field. SnpEff is bundled with SnpSift, a companion tool for filtering and manipulating genomic annotated variants. Both tools are integrated with Galaxy and GATK, making them essential components of sequencing data analysis workflows.
biomedical, genomics, precision medicine, biological systems
infores:snpeff
Unknown
| ID | Name | URL | Category | Format | Description |
|---|---|---|---|---|---|
| snpeff.download | SnpEff Download Package | snpEff_latest_core.zip | ProgrammingInterface | ❔ | SnpEff core package with command-line... |
| snpeff.documentation | SnpEff Documentation | introduction | GraphicalInterface | http | Comprehensive documentation website f... |
| snpeff.databases ⚠ | SnpEff Genome Databases | download | Product | ❔ | Over 38,000 pre-built genome database... |
| snpeff.github | SnpEff GitHub Repository | SnpEff | ProgrammingInterface | ❔ | SnpEff source code repository on GitHub |
| snpeff.sourceforge | SnpEff SourceForge Archive | files | Product | ❔ | Historical releases and versions of S... |
| ID | Name | URL | Category | Format | Relation | Description |
|---|---|---|---|---|---|---|
| snpsift.documentation | SnpSift Documentation | introduction | GraphicalInterface | http | had primary source | SnpSift documentation covering tools ... |
SnpEff is a widely-used open-source tool for annotating and predicting the functional effects of genetic variants. It analyzes how genetic variations affect genes, transcripts, and proteins, making it an essential component of genomics and precision medicine workflows. SnpEff is bundled with SnpSift, a complementary toolset for filtering and manipulating annotated variants.
Latest version: 5.3 (released 2025-09-02)
-o gatk)SnpSift is bundled with SnpEff and provides extensive capabilities for working with annotated VCF files:
SnpEff installation is straightforward:
# Download latest version
wget https://snpeff.odsp.astrazeneca.com/versions/snpEff_latest_core.zip
# Unzip
unzip snpEff_latest_core.zip
# Run SnpEff
java -jar snpEff.jar [options] genome_version input.vcf > output.vcf
Databases are automatically downloaded when first used:
# List available databases
java -jar snpEff.jar databases
# Pre-download a specific database (optional)
java -jar snpEff.jar download GRCh38.76
# Annotate variants (database downloaded automatically if needed)
java -jar snpEff.jar GRCh38.76 input.vcf > annotated.vcf
# With summary HTML report
java -jar snpEff.jar -v -stats summary.html GRCh38.76 input.vcf > annotated.vcf
# GATK-compatible output
java -jar snpEff.jar -o gatk GRCh38.76 input.vcf > annotated.vcf
# Filter variants
cat annotated.vcf | java -jar SnpSift.jar filter "(ANN[*].IMPACT = 'HIGH')" > high_impact.vcf
# Extract fields
cat annotated.vcf | java -jar SnpSift.jar extractFields - CHROM POS REF ALT "ANN[*].GENE" "ANN[*].EFFECT" > variants.txt
# Annotate with dbSNP
java -jar SnpSift.jar annotate dbsnp.vcf input.vcf > annotated_dbsnp.vcf
SnpEff adds annotations to the VCF ANN (Annotation) field with the following structure:
Allele | Annotation | Annotation_Impact | Gene_Name | Gene_ID | Feature_Type | Feature_ID | Transcript_BioType | Rank | HGVS.c | HGVS.p | cDNA.pos / cDNA.length | CDS.pos / CDS.length | AA.pos / AA.length | Distance | ERRORS / WARNINGS / INFO
SnpEff generates HTML summary reports including:
SnpEff is fully integrated with the Galaxy platform:
Native support for GATK pipelines:
SnpEff supports building custom genome databases from:
SnpEff and SnpSift are released under the MIT License, making them free for both academic and commercial use.
The project welcomes contributions:
SnpEff: Cingolani P, Platts A, Wang le L, Coon M, Nguyen T, Wang L, Land SJ, Lu X, Ruden DM. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin). 2012 Apr-Jun;6(2):80-92. PMID: 22728672
SnpSift: Cingolani P, Patel VM, Coon M, Nguyen T, Land SJ, Ruden DM, Lu X. Using Drosophila melanogaster as a model for genotoxic chemical mutational studies with a new program, SnpSift. Frontiers in Genetics, 3, 2012.
The SnpEff project honors the memory of Dr. Xiangyi Lu, a co-author on both the SnpEff and SnpSift papers, who passed away from ovarian cancer in 2017. Users are encouraged to consider donations to the Xiangyi Lu Graduate Student Fellowship in Bioinformatics Fund at Wayne State University.
SnpEff is commonly used alongside:
Created: October 31, 2025 | Last modified: October 31, 2025