In this example, we genotype 100 SNPs on human chromosome 22 with a small subset of 1000 Genome Project Illumina sequencing reads. The whole process should finish in around a minute and requries 34 GB RAM.
go to test data directory
pre-process the reference and SNP list to generate indices:
Chen Sun and Paul Medvedev, Toward fast and accurate SNP genotyping from whole genome sequencing data for bedside diagnostics.
VarGeno’s algorithm is built on top of LAVA’s. Its code is built on top of LAVA’s and it reuses a lot of LAVA’s code. It uses some code from the AllSome project.
VarGeno
Fase SNP genotyping tool for whole genome sequencing data and large SNP database.
Install from Bioconda
VarGeno can be installed from Bioconda with command
conda install vargeno.Go to this link for more information about Bioconda.
If you do not have Bioconda installed, you can install VarGeno from source code.
Quick Usage
VarGeno takes as input:
Before genotyping an individual, you must construct indices for the reference and SNP list using the following commands:
To perform the genotyping:
Here
index_prefixshould be the same string as index generating.Output format: VCF
VarGeno’s genotyping results are in the “FORMAT” column of VCF file.
0/0,0/1or1/1.For details of “GT” and “GQ” fields, please refer to The Variant Call Format(VCF) Version 4.2 Specification.
Install from Source Code
Prerequisites
g++version 4.9 or higher.Install Command
You should then see
vargenoin vargeno directory. To verify that your installation is correct, you can run the toy example below.Example
The example dataset is in https://github.com/medvedevgroup/vargeno/tree/master/test .
In this example, we genotype 100 SNPs on human chromosome 22 with a small subset of 1000 Genome Project Illumina sequencing reads. The whole process should finish in around a minute and requries 34 GB RAM.
go to test data directory
pre-process the reference and SNP list to generate indices:
genotype variants:
The expected output of VarGeno on the example dataset should be https://github.com/medvedevgroup/vargeno/blob/master/test/expected_output.
Memory Lite Version
The memory lite version of VarGeno (VarGeno-Lite) is maintained as an independent project in https://github.com/medvedevgroup/vargeno_lite.
Citation
If you use VarGeno in your research, please cite
VarGeno’s algorithm is built on top of LAVA’s. Its code is built on top of LAVA’s and it reuses a lot of LAVA’s code. It uses some code from the AllSome project.