The High Throughput Sequencing Cell Authentication Toolkit (seqCAT) is an
R-package for authenticating, evaluating and characterisation of cells using
single nucleotide variants (SNVs) from sequencing data. Its input data should
be on the form of VCF files, i.e. output from variant callers such as
the Genome Analysis ToolKit and annotated with software such as
SnpEff.
Installation
The seqCAT package is available on both Bioconductor and here on
GitHub. You can install the latest, stable version from Bioconductor like so:
To list the versions of seqCAT available on Conda, you can use the search
functionality:
conda search -c bioconda bioconductor-seqcat
Usage
The general workflow of seqCAT consists of three steps:
1. Creation of SNV profiles
2. Comparisons of SNV profiles
3. Evaluation of profile comparisons
# Load the package
library("seqCAT")
# Path to the example VCF file
vcf <- system.file("extdata", "example.vcf.gz", package = "seqCAT")
# Create SNV profiles
hct116 <- create_profile(vcf, "HCT116")
hke3 <- create_profile(vcf, "HKE3")
rko <- create_profile(vcf, "RKO")
# Compare all profiles to each other
profiles <- list(hct116, hke3, rko)
comparisons <- compare_many(profiles)
# Create an heatmap of comparisons and their similarity scores
plot_heatmap(comparisons[[1]])
For more detailed instructions on how to use seqCAT, please see the
vignette.
Citation
If you are using seqCAT to analyse your data, please cite the
following article:
seqCAT: a Bioconductor R-package for variant analysis of high throughputsequencing data Fasterius E. and Al-Khalili Szigyarto C.
F1000Research (2018), 7:1466
https://f1000research.com/articles/7-1466
License
The seqCAT package is released with a MIT licence and is a free software: you
may redistribute and/or modify it under the terms of the license. For more
information, please see the LICENCE file.
Overview
The High Throughput Sequencing Cell Authentication Toolkit (seqCAT) is an R-package for authenticating, evaluating and characterisation of cells using single nucleotide variants (SNVs) from sequencing data. Its input data should be on the form of VCF files, i.e. output from variant callers such as the Genome Analysis ToolKit and annotated with software such as SnpEff.
Installation
The
seqCATpackage is available on both Bioconductor and here on GitHub. You can install the latest, stable version from Bioconductor like so:If you are interested in the development version of
seqCAT, you can install it from GitHub:You may also install
seqCATusing Conda:To list the versions of
seqCATavailable on Conda, you can use thesearchfunctionality:Usage
The general workflow of
seqCATconsists of three steps:For more detailed instructions on how to use
seqCAT, please see the vignette.Citation
If you are using
seqCATto analyse your data, please cite the following article:License
The
seqCATpackage is released with a MIT licence and is a free software: you may redistribute and/or modify it under the terms of the license. For more information, please see theLICENCEfile.