You can install starCAT and its dependencies via the Python Package Index.
pip install starcatpy
We tested it with scikit-learn 1.3.2, AnnData 0.9.2, and python 3.8. To run the tutorials, you also need jupyter or jupyterlab as well as scanpy and cnmf:
pip install jupyterlab scanpy cnmf
Published and custom references programs
Several gene expression program references are available for annotation with starCAT, including the T cell reference described in our manuscript. Download and learn more about them on Zenodo.
–reference - name of a default reference to download (ex. TCAT.V1) OR filepath containing a reference set of GEPs by genes (*.tsv/.csv/.txt), default is ‘TCAT.V1’
–counts - filepath to input (cell x gene) counts matrix as a matrix market (.mtx.gz), tab delimited text file, or anndata file (.h5ad)
–scores - optional path to yaml file for calculating score add-ons, not necessary for pre-built references
–output-dir - the output directory. all output will be placed in {output-dir}/{name}…’. default directory is ‘.’
–name - the output analysis prefix name, default is ‘starCAT’
For code to reproduce figures and analyses from our manuscript, please refer to the TCAT analysis Github.
starCAT website
For small datasets (smaller than ~50,000 cells or 700 MB), try running starCAT on our website.
starCAT
Implements starCellAnnoTator (AKA starCAT), annotating scRNA-Seq with predefined gene expression programs
Citation
If you use starCAT, please cite our manuscript.
Installation
You can install starCAT and its dependencies via the Python Package Index.
We tested it with scikit-learn 1.3.2, AnnData 0.9.2, and python 3.8. To run the tutorials, you also need jupyter or jupyterlab as well as scanpy and cnmf:
Published and custom references programs
Several gene expression program references are available for annotation with starCAT, including the T cell reference described in our manuscript. Download and learn more about them on Zenodo.
We also provide example scripts for constructing custom starCAT references from a single cNMF run or multiple cNMF runs. Email me at dkotliar@broadinstitute.org if you are interested in making your reference available for others to re-use.
Basic starCAT usage
Please see our tutorials in python and R. A sample pipeline using a pre-built reference programs (TCAT.V1) is shown below.
starCAT also can be run in the command line.
For code to reproduce figures and analyses from our manuscript, please refer to the TCAT analysis Github.
starCAT website
For small datasets (smaller than ~50,000 cells or 700 MB), try running starCAT on our website.