Whole genome SNP based identification of members of the Mycobacterium tuberculosis complex. Based on code originally written by Samuel Lipworth and turned into a package by Philip Fowler.
SNP-IT allows rapid Mycobacterial speciation of VCF or FASTA files aligned to NC000962 (H37rV).
For more information please see the publication:
Lipworth S, Jajou R, de Neeling A, Bradley P, van der Hoek W, Maphalala G, et al. SNP-IT tool for identifying subspecies and associated lineages of Mycobacterium tuberculosis complex. Emerg Infect Dis. 2019 Mar.
DOI:10.3201/eid2503.180894
Please note we are now using version 2 libraries (slightly smaller than those in the paper). If you notice any problems please let us know.
If you are experience difficulties installing the package, please email for help or try SNP-IT (the original non-packaged version).
How to install
From source
First clone the repository on your local machine
> git clone https://github.com/samlipworth/snpit.git
Cloning into 'snpit'...
then enter the directory and install
> cd snpit
> python setup develop --user
The --user flag ensures that it is only installed for the user (avoiding the need to know the root/sudo password). To system-wide install simply omit the flag.
Using conda
NB: The bioconda recipe is called mtb-snp-it, with thanks to @pvanheus. I have not tested yet so please let me know if there are any issues.
The code is Python3 and a snpit class is defined. To demonstrate simple usage, a python script that calls the package (snpit-run.py) which can be found in bin/ folder is installed in your $PATH during installation. To see what it does, a single example VCF is provided in the example/ folder.
Note that, as shown in the paper, sublineages are only available for Lineage 4, hence no sublineage is reported for this sample. To alter how the results are output, please see the bin/snpit-run.py script.
Below is an example for bulk usage:
ls *.fasta.gz | parallel -j10 snpit-run.py {} > snpit_results.tsv
snpit
Whole genome SNP based identification of members of the Mycobacterium tuberculosis complex. Based on code originally written by Samuel Lipworth and turned into a package by Philip Fowler.
SNP-IT allows rapid Mycobacterial speciation of VCF or FASTA files aligned to NC000962 (H37rV).
For more information please see the publication:
Please note we are now using version 2 libraries (slightly smaller than those in the paper). If you notice any problems please let us know.
Please email samuel.lipworth@ndm.ox.ac.uk with any queries.
If you are experience difficulties installing the package, please email for help or try SNP-IT (the original non-packaged version).
How to install
From source
First clone the repository on your local machine
then enter the directory and install
The
--userflag ensures that it is only installed for the user (avoiding the need to know the root/sudo password). To system-wide install simply omit the flag.Using conda
NB: The bioconda recipe is called mtb-snp-it, with thanks to @pvanheus. I have not tested yet so please let me know if there are any issues.
Install Miniconda and setup bioconda channel:
Install SNP-IT
Usage
The code is Python3 and a
snpitclass is defined. To demonstrate simple usage, a python script that calls the package (snpit-run.py) which can be found inbin/folder is installed in your$PATHduring installation. To see what it does, a single example VCF is provided in theexample/folder.To run simply
Note that, as shown in the paper, sublineages are only available for Lineage 4, hence no sublineage is reported for this sample. To alter how the results are output, please see the
bin/snpit-run.pyscript.Below is an example for bulk usage: