目录

DiNAMO: An exact and efficient method for IUPAC motif discovery in DNA sequences

Build Status Build status

The DiNAMO software implements an exhaustive algorithm to detect over-represented IUPAC motifs in a set of DNA sequences. It has two modes: scanning mode (default), where all windows are parsed, or fixed-position mode (see optional parameter -p ), where only motifs occurring at a specific position in the sequences are taken into account.

DiNAMO can be used in a variety of applications, such as ChIP-seq peak analysis for transcription factor binding sites identification, or finding motifs that induce systematic sequencing errors.

Installation

Binaries

Binaries for Windows, OS X and Linux are available here.

Build from source code

Dependencies

Boost library (if not in the standard includes, please use -I /path/to/boost in the Makefile)

Compilation

make

The executable file is located in the bin/ directory

Usage

DiNAMO takes as input two fasta files, which contain respectively the positive dataset (signal.fa) and the negative dataset (control.fa). You should specify also the motif length ‘-l’. Optionally, you can provide a maximum number of degenerate letters ‘-d’. The output file contains the over-represented motifs found in the positive dataset compared to the negative dataset, in the MEME format (see http://meme-suite.org/doc/meme-format.html).

    dinamo -pf signal.fa  -nf control.fa -l 6

Optional Parameters

  • maximum number i of degenerate letters

      -d <i>
  • Only process motifs at a offset i (0..N) related to the end of each sequence

      -p <i>
  • output file

      -o <file>
  • change Fisher’s exact test p-value threshold f (default: 0.05)

      -t <f>
  • do not display logs

      --no-log

References

how to cite this tool:

C. Saad, L. Noé, H. Richard, J. Leclerc, M.-P. Buisine, H. Touzet, and M. Figeac. Dinamo: highly sensitive dna motif discovery in high-throughput sequencing data. BMC Bioinformatics, 19(1):223, Jun 2018. https://doi.org/10.1186/s12859-018-2215-1

关于

一个用于分子动力学模拟的软件包,提供高性能计算支持。

3.2 MB
邀请码
    Gitlink(确实开源)
  • 加入我们
  • 官网邮箱:gitlink@ccf.org.cn
  • QQ群
  • QQ群
  • 公众号
  • 公众号

版权所有:中国计算机学会技术支持:开源发展技术委员会
京ICP备13000930号-9 京公网安备 11010802032778号