Downloads & Links
Installation script
Recommended
Software
Database Update Utility
New!
MySQL Data Tables
Note! Large file, over 4.5 GB
Download User Manual
Download README File
Major revisions
Annotated Call Sets
SNP/SNV
Indel
SV/CNV
Support, Questions, Suggestions
Send email to makarovv at gmail.com
|
What is AnnTools
AnnTools is an efficient, fast and robust bioinformatics tool annotating single nucleotide substitutions (SNP/SNV),
small insertions/deletions (indels), and copy number variations (CNV) calls generated from sequencing and microarray data.
Availability
Source code, supporting files and user manual are freely available for download and use under the BSD License.
The system is under active and continuous development, we provide montly updates for the aggregate database used for annotation
Release
Version 1.1
Operating System
LINUX/UNIX/MAC OSX
Prerequisites
The AnnTools is written in Python and uses MySQL for data storage.
Installation
The easiest way to install the software is to run setup script
- Download setup.sh.gz file from the left navigation tab to any directory on your computer where you have full privileges
and sufficient space (at least 100GB) available. Shell file was zipped as some systems may prevent downloading executable files.
- Unzip the shell
$ gunzip setup.sh.gz and give it execute permission $ chmod 755 setup.sh.
- Open shell file with any text editor and modify MySQL user name and password as appropriate. Account must have DROP, CREATE, INSERT, SELECT privileges to MySQL server in order to accomplish data table installation.
- Run installation
./setup.sh
As some tables are very large, please allow several hours to complete this operation.
- Perform one time configuration as described below
Manual Installation is also possible
- Download AnnTools source code "anntools.tar.gz" from the left navigation bar to any directory on your computer.
- Unzip the application code
$ tar xzvf anntools.tar.gz
- Download MySQL dump file "annotator.sql.gz" from the left navigation bar to any directory on your computer.
- Unzip the MySQL dump
$ bunzip2 annotator.sql.bz2
- Create a new MySQL database
$mysql --user=USER --password=PASSWORD < annotator.sql
Please note that you need an account with DROP, CREATE, INSERT, SELECT privileges to perform this operation.
- Perform one time configuration as described below
Aggregate Database Update
- To ensure easy and timely updates, we provide db_update utility for aggregate database update.
- Unzip it, change to "db_update" directory and modify MySQL login name and password as appropriate. No other modification is required. You must have DROP, CREATE, INSERT, SELECT privileges to MySQL server in order to accomplish update.
- Run database update
./run_db_update.sh
- Only database tables downloaded and installed from Anntools website during Installation step will be updated. No custom annotation table (see Custom annotation section of user manual) will change.
One Time Configuration Before You Start
- cd to "anntools" directory and change permissions to 755 for all *.py files and *.sh file
- Edit the "config.txt" file and specify host, user and password. Database name is already specified, please do not change it.
Please note that whereas for most of the functionality MySQL account with SELECT privileges is adequate,
you need to provide an account with DROP, CREATE, INSERT, SELECT privileges if you wish using custom annotation, as it involved creation of new tables.
Test run
You will find three shell scripts called run_snp.sh, run_indel.sh and run_cnv.sh.
We recommend trying running them to annotate SNP, indel and CNV calls located to "example" subdirectory to make sure that installation went correctly.
Each has about 1000 variants and complete for about two minutes (on our machine). To annotate your own variants, simply substitute our VCF files names to yours.
Environment
Standalone machine or HPCC. Possible run modes:
- As a standalone program by runing the shell scripts provided (run_snp.sh, run_indel.sh, run_cnv.sh). In this mode one vcf file at a time is annotated
- Submitted to High Performance Computer Cluster. Writing submission shell will be reqired based on your HPCC specifications.
Due to multiplicity of possibilities (SGE, PBS, etc.), writing of submission shell is left to the local developer.
- As an API, integrated to your own python code to analyse multiple BAM files. Use "annotate.py" python script as a starting point to include AnnTools to your python application.
Input
VCF (Variant Call Format), SamTools pileup, tabular files (plain text). The latter should be manually converted to VCF using tab2vcf.pl tool
Output
Output is generated in VCF (Variant Call Format).
Annotated Call Sets for single nucleotide substitutions (SNP/SNV),
short insertions/deletions (Indel) and structural variants
(SV/CNV) are available
for download from the left navigaion bar. Examples for each type of variants are available in tabular format below. Information was parced for better viewing.
Note: "." indicates that information is not applicable or unavailable
Limitations
Only human genome build 37/hg19 can be annotated at this time
Future development plans
Our future development plans include adding a graphical user interface for desktop users and developing a web service to maximize user-friendliness and versatility.
Presentations
AnnTools: Genome annotation toolkit for SNP and CNV data from whole exome and whole genome sequencing data and microarrays. V. Makarov, T. O'Grady, G. Cai, J. Lihm, J. Buxbaum, S. Yoon; (Abstract #918W). Presented at the 12th International Congress of Human Genetics/61st Annual Meeting of The American Society of Human Genetics, October 12, 2011, Montreal, Canada.
|