Bump actions/checkout from 5 to 6 (#28)
Bumps actions/checkout from 5 to 6.
updated-dependencies:
- dependency-name: actions/checkout dependency-version: ‘6’ dependency-type: direct:production update-type: version-update:semver-major …
Signed-off-by: dependabot[bot] support@github.com Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
deRIP2 scans aligned sequences for evidence of un-RIP’d precursor states, allowing for improved RIP-correction across large repeat families in which members are independently RIP’d.
Use deRIP2 to:
Predict ancestral fungal transposon sequences by correcting for RIP-like mutations (CpA –> TpA) and cytosine deamination (C –> T) events.
Mask RIP or deamination events as ambiguous bases to remove RIP signal from phylogenetic analyses.
Table of contents
Installation
Install from PyPi.
Pip install latest development version from GitHub.
Test installation.
Setup Development Environment
If you want to contribute to the project or run the latest development version, you can clone the repository and install the package in editable mode.
Example usage
For aligned sequences in ‘mintest.fa’:
Basic usage with masking
Output:
results/derip_output.fasta- Corrected sequenceresults/derip_output_alignment.fasta- Alignment with masked correctionsresults/derip_output_masked_alignment.fasta- Alignment with masked correctionsWith vizualization
The
--plotoption will create a visualization of the alignment with RIP markup. The--plot-rip-typeoption can be used to specify the type of RIP events to be displayed in the alignment visualizationproduct,substrate, orboth.Output:
results/derip_output.fasta- Corrected sequenceresults/derip_output_masked_alignment.fasta- Alignment with masked correctionsresults/derip_output_visualization.png- Visualization of the alignment with RIP markupUsing maximum GC content for filling
By default uncorrected positions in the output sequence are filled from the sequence with the lowest RIP count. If the
--fill-max-gcoption is set, remaining positions are filled from the sequence with the highest G/C content sequence instead.Alternatively, the
--fill-indexoption can be used to force selection of alignment row to fill uncorrected positions from by row index number (indexed from 0). Note: This will override the--fill-max-gcoption.Correcting all deamination events
If the
--reaminateoption is set, all deamination events will be corrected, regardless of RIP context.--plot-rip-type productis used to highlight the product of RIP events in the visualization. Non-RIP deamination events are also highlighted.Output:
results/derip_reaminated.fasta- Corrected sequence using highest GC content sequence for fillingresults/derip_reaminated_alignment.fasta- Alignment with corrected sequence appendedresults/derip_reaminated_vizualization.png- Visualization of the alignment with RIP markupStandard Options
Algorithm overview
For each column in input alignment:
max-snp-noise= 0.4, then at least 0.6 of positions in column must be C/T or G/A).min-rip-likethreshold, AND at least one substrate and one product motif (i.e. CpA and TpA) is present, perform RIP correction in output sequence.Issues
Submit feedback to the Issue Tracker
License
Software provided under MIT license.