The following two libraries are necessary for pseudo-time estimation based on the shortest path on the PCA space.
** This pseudo-time is only used for initialing SCOUP, and hence, pseudo-time estimates from other methods or experimental time can be substituted for initialization.**
LAPACK
BLAS
How to build
git clone https://github.com/hmatsu1226/SCOUP
cd SCOUP
make
Or download from “Download ZIP” button and unzip it.
Running SP
Estimate pseudo-time based on shortest path on the PCA space.
The Input_file1 is the G x C matrix of expression data (separated with ‘TAB’).
Each row corresponds to each gene, and each column corresponds to each cell.
Output_file1 : Optimized parameters related to genes and lineages
Output_file2 : Optimized parameters related to cells
Output_file3 : Log-likelihood
G : The number of genes
C : The number of cells
Options
-k INT : The number of lineages (default is 1)
-m INT : Upper bound of EM iteration (without pseudo-time optimization). The detailed explanation is described in the supplementary text. (default is 1,000)
-M INT : Upper bound of EM iteration (including pseudo-time optimization) (default is 10,000).
-a DOUBLE : Lower bound of alpha (default is 0.1)
-A DOUBLE : Upper bound of alpha (default is 100)
-t DOUBLE : Lower bound of pseudo-time (default is 0.001)
-T DOUBLE : Upper bound of pseudo-time (default is 2.0)
-s DOUBLE : Lower bound of sigma squared (default is 0.1)
SCOUP
SCOUP : a probabilistic model based on the Ornstein-Uhlenbeck process to analyze single-cell expression data during differentiation.
Reference
https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-016-1109-3
Requirements
The following two libraries are necessary for pseudo-time estimation based on the shortest path on the PCA space. ** This pseudo-time is only used for initialing SCOUP, and hence, pseudo-time estimates from other methods or experimental time can be substituted for initialization.**
How to build
Or download from “Download ZIP” button and unzip it.
Running SP
Estimate pseudo-time based on shortest path on the PCA space.
Usage
Format of Input_file1
The Input_file1 is the G x C matrix of expression data (separated with ‘TAB’). Each row corresponds to each gene, and each column corresponds to each cell.
Example of Input_file1
Format of Input_file2
The Input_file2 contains the mean and variance of the initial normal distribution.
Example of Input_file2
Format of Output_file1
The Output_file1 contains the pseudo-time estimates.
Example of Output_file1
Format of Output_file2
The Output_file2 contains the coordinates of PCA.
This file contain (C+1) lines and the last line corresponds to the root cell defined by the mean of the initial distribution.
Example of Output_file2
Running SCOUP
Estimate the parameters of Mixute Ornstein-Uhlenbeck process.
Usage
Options
Example of running SCOUP
Format of Input_file1
This is the expression data matrix data and is the same data as the Input_file1 of SP.
Format of Input_file2
This is initial distribution and is the same data as the Input_file2 of SP.
Format of Input_file3
This is the pseudo-time for initialization and is the same as the Output_file1 of SP.
Format of Output_file1
The Output_file1 contains the optimized parameters related to genes and lineages.
Example of Output_file1
Format of Output_file2
Example of Output_file2
Format of Output_file3
The log-likelihood
Exapmle of Output_file3
Running SCOUP from the middle of the activity
Re-estimate parameters from the middle of the activity.
Usage
Options
It is the same as the Options of “scoup”.
Example of running SCOUP
Format of Input_file1
This is the same as the Input_file1 of “scoup”.
Format of Input_file2
This is the same as the Input_file2 of “scoup”.
Format of Input_file3
This is the parameters related to genes and lineages and is the same as the Output_file1 of SCOUP.
Format of Input_file4
This is the parameters related to cells and is the same as the Output_file2 of “scoup”.
Format of Output_file1, 2, 3
These file are the same as the output files of SCOUP.
Running Correlation analysis
Calculate the correlation between genes after standardization.
Usage
Options
Example of running Correlation analysis
Format of Output_file1
The Output_file1 contains the standardized expression data.
Format of Output_file2
The Output_file2 contains the correlation for the standardized expression data.
License
Copyright (c) 2015 Hirotaka Matsumoto Released under the MIT license