There are a number of ways to install chado-tools and details are provided below. If you encounter an issue when installing chado-tools please contact your local system administrator. If you encounter a bug please log it here or email us at path-help@sanger.ac.uk.
Required dependencies
Python 3.6
PostgreSQL 9.6 or higher
From source
Download the latest release from this github repository, or clone the repository to obtain the most recent updates.
Then install the software:
The program is also available as Bioconda package. Install it with the command
conda install -c bioconda chado-tools
Using a Docker container
The program is also available as a standalone Docker container. The latest build can be downloaded from DockerHub with the command
docker pull sangerpathogens/chado-tools
When running the container with Docker, use the flags --interactive --tty and map all required environment variables (see below) with flag --env.
Usage
The installation will put a single script called chado in your PATH.
The usage is:
chado <command> [<subcommand>] [options]
To list the available commands and brief descriptions, just run chado -h or chado --help.
To display the version of the program, type chado -v or chado --version.
Use chado <command> -h or chado <command> --help to get a detailed description and the usage of that command.
Database connection
You can set up default values for database host, port and user with environment variables. To do so, add the following
lines to your .bashrc (replacing the example values):
The software seeks for these environment variables on your system. If they do not exist, it will use the
default connection settings, which you can edit manually if you really want.
Analogously, you can specify a default password with the environment variable CHADO_PASS.
The flag -p enforces asking the user for a password, which is useful if you don’t want to store a default password in your environment.
Alternatively, you can supply your own YAML configuration file in the same format as the default file
with flag -c (including password). The software will then ignore any environment variables.
Available commands
Command
Description
connect
connect to a CHADO database for an interactive session
query
query a CHADO database and export the result into a text file
execute
execute a function defined in a CHADO database
extract
run a pre-compiled query against the CHADO database
insert
insert a new entity of a specified type into the CHADO database
delete
delete an entity of a specified type from the CHADO database
import
import data from file into the CHADO database
export
export data from the CHADO database to file
admin
perform admin tasks, such as creating or dumping a CHADO database
Examples
Create a new CHADO database called eukaryotes according to the current GMOD schema:
Dump this database into an archive called eukaryotes.dump:
chado admin dump eukaryotes eukaryotes.dump
List all organisms in the eukaryotes database:
chado extract organisms eukaryotes
Query the database to check the meaning of a certain cvterm_id:
chado query -q "SELECT name FROM cvterm WHERE cvterm_id = 25" eukaryotes
Export a FASTA file containing the sequences of the organism Pfalciparum:
chado export fasta -a Pfalciparum -o Pfalciparum.fasta -t contigs eukaryotes
Note concerning tests
Some of the integration tests rely on access to a PostgreSQL server. In order to successfully run those tests,
modify the default connection settings such that they describe an existing
PostgreSQL database server to which you can connect. The tests can then be run as python3 setup.py test.
They create temporary databases on that server and clean those up when finished, so this shouldn’t interfere with
anything you have stored in any database on that server. If you are concerned about this, though, make sure to point
the tool to an empty test server.
License
chado-tools is free software, licensed under GPLv3.
chado-tools
Python3 command line script providing various tools for accessing CHADO databases.
Contents
Installation
There are a number of ways to install chado-tools and details are provided below. If you encounter an issue when installing chado-tools please contact your local system administrator. If you encounter a bug please log it here or email us at path-help@sanger.ac.uk.
Required dependencies
From source
Download the latest release from this github repository, or clone the repository to obtain the most recent updates. Then install the software:
For running tests please see the note below.
Using pip
You can install the program from the Python Package Index (PyPI) using the command
Using Bioconda
The program is also available as Bioconda package. Install it with the command
Using a Docker container
The program is also available as a standalone Docker container. The latest build can be downloaded from DockerHub with the command
When running the container with Docker, use the flags
--interactive --ttyand map all required environment variables (see below) with flag--env.Usage
The installation will put a single script called
chadoin your PATH. The usage is:chado -horchado --help.chado -vorchado --version.chado <command> -horchado <command> --helpto get a detailed description and the usage of that command.Database connection
You can set up default values for database host, port and user with environment variables. To do so, add the following lines to your
.bashrc(replacing the example values):The software seeks for these environment variables on your system. If they do not exist, it will use the default connection settings, which you can edit manually if you really want.
Analogously, you can specify a default password with the environment variable
CHADO_PASS. The flag-penforces asking the user for a password, which is useful if you don’t want to store a default password in your environment.Alternatively, you can supply your own YAML configuration file in the same format as the default file with flag
-c(including password). The software will then ignore any environment variables.Available commands
Examples
Create a new CHADO database called
eukaryotesaccording to the current GMOD schema:Dump this database into an archive called
eukaryotes.dump:List all organisms in the
eukaryotesdatabase:Query the database to check the meaning of a certain
cvterm_id:Export a FASTA file containing the sequences of the organism
Pfalciparum:Note concerning tests
Some of the integration tests rely on access to a PostgreSQL server. In order to successfully run those tests, modify the default connection settings such that they describe an existing PostgreSQL database server to which you can connect. The tests can then be run as
python3 setup.py test. They create temporary databases on that server and clean those up when finished, so this shouldn’t interfere with anything you have stored in any database on that server. If you are concerned about this, though, make sure to point the tool to an empty test server.License
chado-tools is free software, licensed under GPLv3.
Feedback/Issues
Please report any issues to the issues page or email path-help@sanger.ac.uk.