Convert GTDB taxonomy to NCBI taxdump format in order to
use the GTDB taxonomy with software that requires a
taxonomy in the taxdump format (eg., kraken2 or TaxonKit).
Note that the taxIDs are arbitrarily assigned and don’t
match anything in the NCBI! Running gtdb_to_taxdump on
a different list of taxonomies (e.g., a different GTDB release)
will create different taxIDs.
See GTDB-taxdump for a method
to produce stable taxIDs (recommended!).
WARNING
There was a serious bug with ncbi-gtdb_map.py prior to version 0.1.5.
Many of the taxonomic classifications are likely incorrect.
Please re-run the analysis. I’m sorry for any inconvenience.
gtdb_to_taxdump
Convert GTDB taxonomy to NCBI taxdump format.
Table of Contents
Summary
Convert GTDB taxonomy to NCBI taxdump format in order to use the GTDB taxonomy with software that requires a taxonomy in the taxdump format (eg., kraken2 or TaxonKit).
WARNING
Citation
Install
Dependencies
Package
From pypi
From github
Usage
See
gtdb_to_taxdump.py -hExample (GTDB release202):
Example (GTDB release95):
Example (GTDB release89):
You can add the taxIDs to a GTDB metadata table via the
--tableparam. For example:Extras
ncbi-gtdb_map.pyNCBI => GTDBorGTDB => NCBI(see--query-taxonomy)gtdb_to_diamond.pylineage2taxid.pygtdb_to_taxdump.pyacc2gtdb_tax.py./uniref_utils/unirefxml2clust50-90idx.pyunirefxml2fasta.pyunirefxml2tax.pyGTDB website
https://data.ace.uq.edu.au/public/gtdb/data/releases/