gfflu is a Python CLI app to generate annotations of Influenza A virus (IAV) gene segment nucleotide sequences with
BLASTX and Miniprot using the same protein sequences as Influenza Virus Sequence Annotation Tool and
output a GFF3 file with the expected genetic features for each of the 8 IAV gene segments.
Usage: gfflu [OPTIONS] FASTA
Annotate Influenza A virus sequences using Miniprot and BLASTX
The Miniprot GFF for a particular reference sequence gene segment will have multiple annotations for the same gene. This script will select the top scoring annotation for each gene and write out a new GFF file that can be used
with SnpEff.
╭─ Arguments ──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
│ * fasta FILE Influenza virus nucleotide sequence FASTA file [default: None] [required] │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Options ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
│ --outdir -o PATH Output directory [default: gfflu-outdir] │
│ --force -f Overwrite existing files │
│ --prefix -p TEXT Output file prefix [default: None] │
│ --verbose -v │
│ --version -V Print 'gfflu version 0.0.2' and exit │
│ --install-completion Install completion for the current shell. │
│ --show-completion Show completion for the current shell, to copy it or customize the installation. │
│ --help Show this message and exit. │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
gfflu version 0.0.2; Python 3.10.5
Installation
Conda
This is the recommended installation method.
conda install -c bioconda gfflu
PyPI
pip install gfflu
This install method assumes that you have BLAST+ and Miniprot installed and on your $PATH.
From Source
Recommended to use conda to manage the environment from
the provided environment.yml file.
gfflu
gffluis a Python CLI app to generate annotations of Influenza A virus (IAV) gene segment nucleotide sequences with BLASTX and Miniprot using the same protein sequences as Influenza Virus Sequence Annotation Tool and output a GFF3 file with the expected genetic features for each of the 8 IAV gene segments.Table of Contents
Usage
Below is an example of typical usage with a FASTA nucleotide sequence file (Segment_4_HA.MH201222.fasta):
Produces an output directory
gfflu-outdir/by default with the following files:Help output:
Installation
Conda
This is the recommended installation method.
PyPI
From Source
Recommended to use conda to manage the environment from the provided
environment.ymlfile.Annotation
gffluoutputs a SnpEff compatible GFF with the same features identified as the Influenza Virus Sequence Annotation Tool.Segment 1
Influenza Virus Sequence Annotation Tool output
NCBI Genbank GFF for MH201221.1
gffluGFFSegment 2
Influenza Virus Sequence Annotation Tool output
NCBI Genbank GFF for CY147460.1
gffluGFFSegment 3
Influenza Virus Sequence Annotation Tool output
NCBI Genbank GFF for CY146806.1
gffluGFFSegment 4
Influenza Virus Sequence Annotation Tool output
NCBI Genbank GFF for MH201222.1
gffluGFFSegment 5
Influenza Virus Sequence Annotation Tool output
NCBI Genbank GFF for MH085254.1
gffluGFFSegment 6
gffluGFFSegment 7
Influenza Virus Sequence Annotation Tool output
gffluGFFSegment 8
Influenza Virus Sequence Annotation Tool output
gffluGFFLicense
gffluis distributed under the terms of the MIT license.References