Conservation is a Python package for evolutionary conservation analysis at the codon and amino acid level. It supports comparative analysis using Pfam domain sequences and CDS datasets across multiple species.
🔧 Features
Codon substitution matrix computation
Fisher’s exact test for conservation significance
Automated visualization of alignment and substitution matrices
Supports BLOSUM62 alignment, multi-threading, and multiple species
📦 Installation
You can install the package from PyPI:
pip install conservation
Or from Bioconda:
conda install bioconda::conservation
Or install from source:
git clone https://github.com/hanjunlee21/conservation.git
cd conservation
pip install .
Conservation
Conservation is a Python package for evolutionary conservation analysis at the codon and amino acid level. It supports comparative analysis using Pfam domain sequences and CDS datasets across multiple species.
🔧 Features
📦 Installation
You can install the package from PyPI:
Or from Bioconda:
Or install from source:
🚀 Usage
Command-line Arguments
--domain,-d: Pfam or domain FASTA file--cds,-c: Comma-separated list of CDS FASTA files (one per species)--output,-o: Output directory--threads,-t: Number of parallel threads--fdr,-q: FDR cutoff (optional)--conservedness,-s: Identity ratio threshold (optional)--dpi,-r: DPI for PDF plots📊 Outputs
.tsv.statistics.tsv🧬 Example
Example command for 3 species:
📁 Project Structure
📜 License
MIT License
👤 Author
Hanjun Lee
hanjun_lee@hms.harvard.edu
Project URL: https://github.com/hanjunlee21/conservation