目录

funOmics Package

Active R-CMD-check license GitHub R package version

The funOmics R package is a collection of functions for aggregating omics data into higher-level functional representations such as pathways, protein complexes, and cellular locations. The package provides a tool for aggregating omics data from high-throughput experiments (e.g. transcriptomics, metabolomics, proteomics) into higher-level functional activity scores that can then be used for further analysis and modeling. The package provides different pooling operators, such as aggregation statistics (mean, median, standard deviation, min, max), dimension-reduction derived scores (pca, nmf, mds, pathifier), or test statistics (t-test, Wilcoxon test, Kolmogorov–Smirnov test) with options for adjusting parameters and settings to suit specific research questions and data types. The package is also well-documented, with detailed descriptions of each function and an example of usage.

Installation

Install funOmics from Bioconductor (release 3.19 onwards) via:

if (!require("BiocManager", quietly = TRUE))
    install.packages("BiocManager")

BiocManager::install("funOmics")

or the pre-release and latest development version from GitHub:

if (!require("devtools", quietly = TRUE))
    install.packages("devtools")

devtools::install_github("elisagdelope/funomics") 

Usage

To use the Funomics R package, you can load the package using the following command:

library(funOmics)

You can then access the main function provided by the package, summarize_pathway_level, with the type of pooling operator desired to be applied for each molecular set. The available aggregation operators and other parameters options are described in detail in the package documentation.

This function has several options for adjusting parameters and settings to suit specific research questions and data types. Here is an example usage, with a simulated gene expression matrix X of dimensions p*n, and a list of 100 gene sets pathways. Pathway activity is summarized using the mean pooling aggregation for those sets containing at least 12 genes. Note that you can adjust the minsize and type of aggregation as desired.

# Example usage:
p <- 10000
n <- 20
X <- matrix(rnorm(p * n), nrow = p, dimnames = list(paste0("g", 1:p), paste0("s", 1:n)))
pathways <- as.list(sample(10:100, size = 100, replace = TRUE))
pathways <- lapply(pathways, function(n, p) paste0("g", sample(1:p, size = n, replace = FALSE)), p)
names(pathways) <- paste0("pathways", 1:length(pathways))
pathway_activity <- summarize_pathway_level(X, pathways, type = "mean", minsize = 12)

This example mimics gene expression data and pathway gene sets, but funOmics can be used to aggregate other types of omics data and molecular sets. For example, it can be similarly applied to gene expression data and gene sets of GO terms or protein complexes of the CORUM database. It can also be applied to a metabolomics matrix X and KEGG metabolic pathways.

If you have any questions or issues with the funOmics R package, please contact elisa.gomezdelope@uni.lu. I welcome feedback and suggestions for improving the package.

Disclaimer

The R package funOmics implements functions for aggregating omics data into higher-level functional representations.

Copyright © 2024 Elisa Gómez de Lope, University of Luxembourg, Luxembourg Centre for Systems Biomedicine (LCSB), Biomedical Data Science (BDS)

This program is free software: you can redistribute it and/or modify it under the terms of the MIT License. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; including but not limited to the warranties of MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, and NONINFRINGEMENT. See the terms of the MIT License for more details.

关于

功能组学分析工具,用于整合和解释基因组学数据

120.0 KB
邀请码
    Gitlink(确实开源)
  • 加入我们
  • 官网邮箱:gitlink@ccf.org.cn
  • QQ群
  • QQ群
  • 公众号
  • 公众号

版权所有:中国计算机学会技术支持:开源发展技术委员会
京ICP备13000930号-9 京公网安备 11010802047560号