An easy to use tool for identifying and analyzing excludons in genomic data using RNA-seq data.
Outline
From a given RNA-seq data, alignment is performed against reference genome (1) and coverage per nucleotide is calculated (2).
Convergent (-> <-) and divergent (<- ->) pairs of genes are substarcted and median covergage is calculated for each of them (3). Trancriptional units (TUs) for each gene is annotated (4) based on gene coverage. A threshold of the covegare decreasing is set, gene gene expression decays under this threshold transcription start and end sites (TSS and TTS) is set. If TUs of convergen and divergent pairs overlaps, this pair is annotated as Excludon (5).
Features
Fast parallel processing for large datasets
Support for both short and long-read data
Support for paired-end and single-end RNA-seq data
The data/examples directory contains test RNA-seq data from E. coli K12 MG1655. For faster testing and analysis, the dataset is reduced to reads mapping only to the first 50 genes. Expected results can be found in data/examples/output/.
Citation
If you found this tool useful, please cite:
Alvaro Sanmartin, Pablo Iturbe, Jeronimo Rodriguez-Beltran, Iñigo Lasa. ExcludonFinder: Mapping Transcriptional Overlaps Between Neighboring Genes
ExcludonFinder
An easy to use tool for identifying and analyzing excludons in genomic data using RNA-seq data.
Outline
From a given RNA-seq data, alignment is performed against reference genome (1) and coverage per nucleotide is calculated (2). Convergent (-> <-) and divergent (<- ->) pairs of genes are substarcted and median covergage is calculated for each of them (3). Trancriptional units (TUs) for each gene is annotated (4) based on gene coverage. A threshold of the covegare decreasing is set, gene gene expression decays under this threshold transcription start and end sites (TSS and TTS) is set. If TUs of convergen and divergent pairs overlaps, this pair is annotated as Excludon (5).
Features
Installation
Using Conda (Recommended)
From source
Usage
If installed with conda:
If installed from source
Options
Example
Examples
The data/examples directory contains test RNA-seq data from E. coli K12 MG1655. For faster testing and analysis, the dataset is reduced to reads mapping only to the first 50 genes. Expected results can be found in data/examples/output/.
Citation
If you found this tool useful, please cite: