-h, –help show this help message and exit
-i INPUT, –input INPUT
Input BAM file
-o OUTPUT, –output OUTPUT
Output BAM file
-r REF, –ref REF reference genome used for analyzing RNA-seq data
-g GTF, –gtf GTF gene annotatino file used for analyzing RNA-seq data
-s SPLICE_BIN, –splice_bin SPLICE_BIN
splice site half bin size (default: 20)
-m MAPQ, –mapq MAPQ minimal MAPQ of read from BAM file for supporting
Indel (default: 15)
-l LENGTH, –length LENGTH
Maximum deletion length to be detected (default:1000000)
-v, –version show program’s version number and exit
Input:
input_bam_file :input BAM file is produced by BWA-MEM and is sorted and indexed.
reference_genome_fasta (for RNA-seq) :reference genome in FastA format
gtf_file (for RNA-seq) :gene annotation file in GTF format
Output:
your_output_bam_file :BAM file for CIGAR string redefinement.
transIndel generates the following optional fields in output BAMs
Tag| Meaning
--------------------------------------------------------------------------------------
OA | original representative alignment; format: (pos,CIGAR)
JM | splicing junction reads; infered from GTF or splicing motif (used in RNA-seq BAM)
Introduction
transIndel is used to detect indels (insertions and deletions) from DNA-seq or RNA-seq data by parsing chimiric alignments from BWA-MEM.
Prerequisites
Samtools/1.0 or newer (http://www.htslib.org/) Python 3.6 or newer (https://www.python.org/) Python packages:
Getting Soure Code
Running transIndel
STEP 1: Build new BAM file with redefined CIGAR string
Options:
-h, –help show this help message and exit -i INPUT, –input INPUT -o OUTPUT, –output OUTPUT -r REF, –ref REF reference genome used for analyzing RNA-seq data -g GTF, –gtf GTF gene annotatino file used for analyzing RNA-seq data -s SPLICE_BIN, –splice_bin SPLICE_BIN -m MAPQ, –mapq MAPQ minimal MAPQ of read from BAM file for supporting -l LENGTH, –length LENGTH -v, –version show program’s version number and exitInput:
Output:
STEP 2: Call indel
Options:
-h, –help show this help message and exit -i INPUT, –input INPUT -o OUTPUT, –output OUTPUT -c AO, –ao AO minimal observation count for Indel (default: 4) -d DEPTH, –depth DEPTH -f VAF, –vaf VAF minimal variant allele frequency (default: 0.1) -l LENGTH, –length LENGTH -m MAPQ, –mapq MAPQ minimal MAPQ of read from BAM file to call Indel -t REGION Limit analysis to targets listed in the BED-format -v, –version show program’s version number and exitInput:
Output: