After one year’s tensive development, we have upgraded and extended CIRCexplorer to a new version – CIRCexplorer2, with many improvements and lots of new features. Welcome to use and cite it!
NEWS: CIRCpedia is an integrative database, aiming to annotating alternative back-splicing and alternative splicing in circRNAs across different cell lines. Welcome to use!
A schematic flow shows the pipeline
Notice
CIRCexplorer is now only a circular RNA annotating tool, and it parses fusion junction information from mapping results of other aligners. The result of circular RNA annotating is directly dependent on the mapping strategy of aligners. Different aligners may have different circular RNA annotations. CIRCexplorer is only in charge of annotating circRNA junctions according to the gene annotation. More functions could be found in CIRCexplorer2.
interval (If you use the old version of CIRCexplorer, you require to install the interval package from https://github.com/kepbod/interval. For the latest version of CIRCexplorer, you won’t install it by yourself.)
CIRCexplorer was originally developed as a circular RNA analysis toolkit supporting TopHat & TopHat-Fusion. After version 1.1.0, it also supports STAR.
TopHat & TopHat-Fusion
To obtain junction reads for circular RNAs, two-step mapping strategy was exploited:
You could use fetch_ucsc.py script to download relevant ref.txt (Known Genes, RefSeq or Ensembl) and the genome fasta file for hg19, hg38 or mm10 from UCSC.
fetch_ucsc.py hg19/hg38/mm10 ref/kg/ens/fa out
Example (download hg19 RefSeq gene annotation file):
CIRCexplorer
CIRCexplorer is a combined strategy to identify junction reads from back spliced exons and intron lariats.
Version: 1.1.10
Last Modified: 2016-7-14
Authors: Xiao-Ou Zhang (zhangxiaoou@picb.ac.cn), Li Yang (liyang@picb.ac.cn)
Maintainer: Xu-Kai Ma (maxukai@picb.ac.cn)
Download the latest stable version of CIRCexplorer
To see what has changed in recent versions of CIRCexplorer, see the CHANGELOG.
FAQ
After one year’s tensive development, we have upgraded and extended CIRCexplorer to a new version – CIRCexplorer2, with many improvements and lots of new features. Welcome to use and cite it!
NEWS: CIRCpedia is an integrative database, aiming to annotating alternative back-splicing and alternative splicing in circRNAs across different cell lines. Welcome to use!
A schematic flow shows the pipeline
Notice
CIRCexplorer is now only a circular RNA annotating tool, and it parses fusion junction information from mapping results of other aligners. The result of circular RNA annotating is directly dependent on the mapping strategy of aligners. Different aligners may have different circular RNA annotations. CIRCexplorer is only in charge of annotating circRNA junctions according to the gene annotation. More functions could be found in CIRCexplorer2.
Prerequisites
Software / Package
TopHat or STAR
Others
RNA-seq
The poly(A)−/ribo− RNA-seq is recommended. If you want to obtain more circular RNAs, RNase R treatment could be performed.
Aligner
CIRCexplorer was originally developed as a circular RNA analysis toolkit supporting TopHat & TopHat-Fusion. After version 1.1.0, it also supports STAR.
TopHat & TopHat-Fusion
To obtain junction reads for circular RNAs, two-step mapping strategy was exploited:
STAR
To detect fusion junctions with STAR,
--chimSegmentMinshould be set to a positive value.Example 1 (single reads):
Example 2 (paired-end reads):
For more details about STAR, please refer to STAR manual.
Installation
Option 1: using pip
Option 2: via conda
CIRCexplorer is available as conda package with:
Option 3: in galaxy
If you have access to a Galaxy instance, CIRCexplorer is also available from the Galaxy Tool Shed.
Option 4: from source codes
1 Download CIRCexplorer
2 Install required packages
3 Install CIRCexplorer
Usage
Example
TopHat & TopHat-Fusion
STAR
Chimeric.out.junctiontofusion_junction.txt(star_parse.pywas modified from STAR filterCirc.awk)fusion_junction.txtNote
hg19.fa is genome sequence in FASTA format.
You could use fetch_ucsc.py script to download relevant ref.txt (Known Genes, RefSeq or Ensembl) and the genome fasta file for hg19, hg38 or mm10 from UCSC.
Example (download hg19 RefSeq gene annotation file):
Output
See details in the example file
Note: The first 12 columns are in BED12 format.
Citation
Zhang XO, Wang HB, Zhang Y, Lu X, Chen LL and Yang L. Complementary sequence-mediated exon circularization. Cell, 2014, 159: 134-147
License
Copyright (C) 2014-2017 YangLab. See the LICENSE file for license rights and limitations (MIT).