–fa_folder: the directory which you save the reference fa files.
–fa_name: the name of the reference fa file.
–out_dir: the directory to save the output results.
*Optional parameters
–kmer_len , default = 100, the length of generated kmers.
–mapq_thres: default = 50000 (50kb), the MAPQ threshold to keep unique mapping kmers.
—chr_thread , default = 2, number of threads for processing chromosome,
–chr_start, –chr_end: if you only process some chromosomes or only one chromosome. For example: use “–chr_start 1 –chr_end 5” will process chromsomes 1,2,3,4,5. Use “–chr_start 2 –chr_end 2” will only assemlby chromosome 2.
Aquila_Umap
(Please ensure channels are properly setup for bioconda before installing)
Usage
*Required parameters
–fa_folder: the directory which you save the reference fa files.
–fa_name: the name of the reference fa file.
–out_dir: the directory to save the output results.
*Optional parameters
–kmer_len , default = 100, the length of generated kmers.
–mapq_thres: default = 50000 (50kb), the MAPQ threshold to keep unique mapping kmers.
—chr_thread , default = 2, number of threads for processing chromosome,
–chr_start, –chr_end: if you only process some chromosomes or only one chromosome. For example: use “–chr_start 1 –chr_end 5” will process chromsomes 1,2,3,4,5. Use “–chr_start 2 –chr_end 2” will only assemlby chromosome 2.
A practical example
Download Rhesus macaque (Macaca mulatta, ftp://ftp.ensembl.org/pub/release-97/fasta/macaca_mulatta/dna/ ) genome fasta file from HERE.
Your folder structure should be as follows :
“macaca_mulatta.fa” is the fasta file you have just downloaded.
Then edit the second block and run the whole notebook.
When finished, you will get:
The final results are stored in Uniqness_map folder.