EsViritu is a read mapping pipeline for detection and measurement of human and animal virus pathogens using sequencing reads from metagenomic environmental or clinical samples.
This approach is sensitive, specific, and ideal for exploring virus presence/absence/diversity within and between metagenomic or clinical samples. Interactive reports make it easy to see the breadth of read coverage for each detected virus. This tool should reliably detect virus reads with 80% ANI or greater to reference genomes.
See documentation and manual on the dedicated ReadTheDocs site
Features
As of EsViritu v1.0.0 (and later):
Highly curated sequence-to-taxonomy database of known human, animal, and plant viruses.
Single read sensitivity for detection (requires reads >= 100 nt).
Assembly-aware genome reconstructions for segmented and non-segmented viruses.
Expression of uncertainty (i.e. average read identity to reference, nucleotide diversity measurement).
Then run the summarize_esv_runs command with the relative path to the output directory as the only argument:
summarize_esv_runs myproject_EsViritu1
This command will generate the tables myproject_EsViritu1.detected_virus.info.tsv, myproject_EsViritu1.detected_virus.assembly_summary.tsv, myproject_EsViritu1.tax_profile.tsv and the reactable myproject_EsViritu1.batch_detected_viruses.html, which summarize information about all the samples in the given directory.
Citation
Wastewater sequencing reveals community and variant dynamics of the collective human virome
Michael Tisza, Sara Javornik Cregeen, Vasanthi Avadhanula, Ping Zhang, Tulin Ayvaz, Karen Feliz, Kristi L. Hoffman, Justin R. Clark, Austen Terwilliger, Matthew C. Ross, Juwan Cormier, David Henke, Catherine Troisi, Fuqing Wu, Janelle Rios, Jennifer Deegan, Blake Hansen, John Balliew, Anna Gitter, Kehe Zhang, Runze Li, Cici X. Bauer, Kristina D. Mena, Pedro A. Piedra, Joseph F. Petrosino, Eric Boerwinkle, Anthony W. Maresso
EsViritu
EsViritu is a read mapping pipeline for detection and measurement of human and animal virus pathogens using sequencing reads from metagenomic environmental or clinical samples.
This approach is sensitive, specific, and ideal for exploring virus presence/absence/diversity within and between metagenomic or clinical samples. Interactive reports make it easy to see the breadth of read coverage for each detected virus. This tool should reliably detect virus reads with 80% ANI or greater to reference genomes.
See documentation and manual on the dedicated ReadTheDocs site
Features
As of EsViritu
v1.0.0(and later):Highly curated sequence-to-taxonomy database of known human, animal, and plant viruses.
Single read sensitivity for detection (requires reads >= 100 nt).
Assembly-aware genome reconstructions for segmented and non-segmented viruses.
Expression of uncertainty (i.e. average read identity to reference, nucleotide diversity measurement).
Attractive HTML reports.
Schematic
Logo by Adrien Assie
Interactive report of detected viruses
Viruses in Database
19,925 high quality virus genome assemblies across 63 taxonomic families:
Installation
Current Versions
Code: v1.2.0
Database: v3.2.4
Stable release via Bioconda (recommended)
NOTE: 2026-01-27 EsViritu v1.1.6 released and available on bioconda.
1) Create conda environment.
mambais preferable tocondafor environment creation.mamba create -n EsViritu bioconda::esviritu2) Activate the environment with
condaconda activate EsVirituyou should be able to run help menu:
EsViritu -h3) Download the database (~400 MB when decompressed). EsViritu v1.0.0 or higher requires DB v3.1.0 or higher!
cdwhere you want the database to residemkdir esviritu_DB && cd esviritu_DBDownload the tarball of DB
v3.2.4(most recent version) from Zenodo:wget https://zenodo.org/records/17716199/files/esviritu_db_v3.2.4.tar.gzCheck that the download was successful:
md5sum esviritu_db_v3.2.4.tar.gzshould return
24d85c1ec3cbffff12e921d2f39c91b2 esviritu_db_v3.2.4.tar.gzUnpack and remove the tarball:
tar -xvf esviritu_db_v3.2.4.tar.gzrm esviritu_db_v3.2.4.tar.gzDB files should be in
v3.2.4/4) Set the database path (optional but recommended):
conda env config vars set ESVIRITU_DB=/path/to/esviritu_DB/v3.2.45) (OPTIONAL BUT RECOMMENDED) Install the
Rpackagedatauimanually in an R session. Withoutdatauireports won’t show genome coverage sparklines.Rthen:
remotes::install_github("timelyportfolio/dataui")**See development and container installation guides in the documentation.
Running the tool
You might run this as part of a bash script, snakemake pipeline, do your own upstream read processing, etc, but these are the basic instructions.
Required inputs:
-r reads file FASTQ format (can be gzipped .gz)-s sample name-o output directory (may be shared with other samples)Activate the conda environment:
conda activate EsVirituIndividual samples can be run with the python script. E.g.:
Basic run with 1 .fastq file:
Using paired end input .fastq files. Must be exactly 2 files.
With pre-filtering steps:
Help menu
Make a Summary for Batch of Reports
Run the batch summary script to collate reports from several sequencing libraries in a project:
Example:
Activate conda environment:
conda activate EsVirituThen run the
summarize_esv_runscommand with the relative path to the output directory as the only argument:This command will generate the tables
myproject_EsViritu1.detected_virus.info.tsv,myproject_EsViritu1.detected_virus.assembly_summary.tsv,myproject_EsViritu1.tax_profile.tsvand the reactablemyproject_EsViritu1.batch_detected_viruses.html, which summarize information about all the samples in the given directory.Citation
Wastewater sequencing reveals community and variant dynamics of the collective human virome
Michael Tisza, Sara Javornik Cregeen, Vasanthi Avadhanula, Ping Zhang, Tulin Ayvaz, Karen Feliz, Kristi L. Hoffman, Justin R. Clark, Austen Terwilliger, Matthew C. Ross, Juwan Cormier, David Henke, Catherine Troisi, Fuqing Wu, Janelle Rios, Jennifer Deegan, Blake Hansen, John Balliew, Anna Gitter, Kehe Zhang, Runze Li, Cici X. Bauer, Kristina D. Mena, Pedro A. Piedra, Joseph F. Petrosino, Eric Boerwinkle, Anthony W. Maresso
https://doi.org/10.1038/s41467-023-42064-1