All Projects → TomKellyGenetics → graphsim

TomKellyGenetics / graphsim

Licence: GPL-3.0 license
R package: Simulate Expression data from igraph network using mvtnorm (CRAN; JOSS)

Programming Languages

r
7636 projects
TeX
3793 projects
ruby
36898 projects - #4 most used programming language

Projects that are alternatives of or similar to graphsim

enformer-pytorch
Implementation of Enformer, Deepmind's attention network for predicting gene expression, in Pytorch
Stars: ✭ 146 (+812.5%)
Mutual labels:  genomics, gene-expression
CRONO
A Shared Memory Multithreaded Graph Benchmark Suite for Multicores
Stars: ✭ 21 (+31.25%)
Mutual labels:  benchmarking, graph-algorithms
awesome-genetics
A curated list of awesome bioinformatics software.
Stars: ✭ 60 (+275%)
Mutual labels:  genomics, genetics
region-plot
A tool to plot significant regions of GWAS
Stars: ✭ 20 (+25%)
Mutual labels:  genomics, genetics
manhattan generator
Manhattan plot Generator
Stars: ✭ 20 (+25%)
Mutual labels:  genomics, genetics
HumanIdiogramLibrary
Resource of human chromosome schematics & images
Stars: ✭ 76 (+375%)
Mutual labels:  genomics, genetics
Sibeliaz
A fast whole-genome aligner based on de Bruijn graphs
Stars: ✭ 76 (+375%)
Mutual labels:  genomics, graph-algorithms
switchde
Inference of switch-like differential expression along single-cell trajectories
Stars: ✭ 19 (+18.75%)
Mutual labels:  genomics, gene-expression
gardenia
GARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators
Stars: ✭ 22 (+37.5%)
Mutual labels:  benchmarking, graph-algorithms
genipe
Genome-wide imputation pipeline
Stars: ✭ 28 (+75%)
Mutual labels:  genomics, genetics
tidygenomics
Tidy Verbs for Dealing with Genomic Data Frames https://const-ae.github.io/tidygenomics/
Stars: ✭ 97 (+506.25%)
Mutual labels:  genomics, r-stats
fwdpy11
Forward-time simulation in Python using fwdpp
Stars: ✭ 25 (+56.25%)
Mutual labels:  genomics, genetics
dee2
Digital Expression Explorer 2 (DEE2): a repository of uniformly processed RNA-seq data
Stars: ✭ 32 (+100%)
Mutual labels:  genomics, genetics
open-cravat
A modular annotation tool for genomic variants
Stars: ✭ 74 (+362.5%)
Mutual labels:  genomics, genomic-data-analysis
Benchmarking Tools
Repository for the GA4GH Benchmarking Team work developing standardized benchmarking methods for germline small variant calls
Stars: ✭ 129 (+706.25%)
Mutual labels:  benchmarking, genomics
STing
Ultrafast sequence typing and gene detection from NGS raw reads
Stars: ✭ 15 (-6.25%)
Mutual labels:  genomics, ngs-analysis
go enrichment
Transcripts annotation and GO enrichment Fisher tests
Stars: ✭ 24 (+50%)
Mutual labels:  genomics, gene-expression
PracticalMachineLearning
A collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (+275%)
Mutual labels:  graph-algorithms
tiki
Library for functional graph & geometry algorithms
Stars: ✭ 20 (+25%)
Mutual labels:  graph-algorithms
variantkey
Numerical Encoding for Human Genetic Variants
Stars: ✭ 32 (+100%)
Mutual labels:  genomics

Travis-CI Build Status CircleCI build status AppVeyor Build Status Project Status: Active – The project has reached a stable, usable state and is being actively developed.

CRAN_Status_Badge Downloads Total Downloads

DOI bioRxiv status


graphsim

Version 1.0.2

Simulate Expression Data from 'igraph' Networks

This package provides functions to develop simulated continuous data (e.g., gene expression) from a sigma covariance matrix derived from a graph structure in 'igraph' objects. Intended to extend 'mvtnorm' to take 'igraph' structures rather than sigma matrices as input. This allows the use of simulated data that correctly accounts for pathway relationships and correlations. Here we present a versatile statistical framework to simulate correlated gene expression data from biological pathways, by sampling from a multivariate normal distribution derived from a graph structure. This package allows the simulation of biological pathways from a graph structure based on a statistical model of gene expression, such as simulation of expression profiles that of log-transformed and normalised data from microarray and RNA-Seq data.

Motivation

Network analysis of molecular biological pathways is important for insights into biology and medical genetics. Gene expression profiles capture the regulatory state of a cell and can be used to analyse complex molecular states with genome-scale data. Biological pathways are more than simply sets of genes involved in functions, they are rich in information of relationships defined by pathway structure.

Methods to infer biological pathways and gene regulatory networks from gene expression data can be tested on simulated datasets using this framework. This also allows for pathway structures to be considered as a confounding variable when simulating gene expression data to test the performance of genomics analyses.

This package enables the generation of simulated gene expression datasets containing pathway relationships from a known underlying network. These simulated datasets can be used to evaluate various bioinformatics methodologies, including statistical and network inference procedures.

Network analysis techniques have an important role in understanding of biological pathways and interpretation of genomics studies. Modelling biological pathways allows the evaluation of gene regulatory network inference techniques (which so far rely on experimental validation or resampling). This technique also enables modelling datasets with correlated pathway-structures to assess whether other genomics analysis techniques perform as expected with the background of complex pathways.

Installation

To install the latest release from CRAN:

install.packages("graphsim")

To install the stable release of this package from github:

# install.packages("devtools")
devtools::install_github("TomKellyGenetics/graphsim", ref = "master")

To get the development version of this package from github:

# install.packages("devtools")
devtools::install_github("TomKellyGenetics/graphsim", ref = "dev")

Usage

Please see the vignettes for demonstrations of this package on examples of simple simulated networks and the reactome pathway TGF-β receptor signaling activates SMADs (R-HSA-2173789). An article with further details has been published in the Journal of Open Source Software.

A help menu can also be accessed within the R environment:

?graphsim
help("graphsim-package")

This will display a help page and link to documentation for each function.


Citation

To cite package 'graphsim' in publications use:

S. Thomas Kelly and Michael A. Black (2020). graphsim: Simulate Expression Data from 'igraph' Networks. R package version 1.0.2. https://github.com/TomKellyGenetics/graphsim doi:10.5281/zenodo.3931288

A BibTeX entry for LaTeX users is:

  @Manual{,
    title = {{graphsim}: Simulate Expression Data from 'igraph' Networks },
    author = {S. Thomas Kelly and Michael A. Black},
    year = {2020},
    note = {R package version R package version 1.0.2.},
    url = {https://github.com/TomKellyGenetics/graphsim},
    doi = {10.5281/zenodo.3931288},
  }

Please also cite the publication describing use of this package where appropriate.

Kelly, S.T. and Black, M.A. (2020). graphsim: An R package for simulating gene expression data from graph structures of biological pathways. Journal of Open Source Software, 5(51), 2161, https://doi.org/10.21105/joss.02161

  @article{Kelly2020joss02161,
    doi = {10.21105/joss.02161},
    url ={https://doi.org/10.21105/joss.02161},
    year = {2020},
    publisher = {The Open Journal},
    volume = {5},
    number = {51},
    pages = {2161},
    author = {S. Thomas Kelly and Michael A. Black},
    title = {graphsim: An R package for simulating gene expression data from graph structures of biological pathways},
    journal = {Journal of Open Source Software} }

This article is also avaliable as a preprint.

S. Thomas Kelly, Michael A. Black (2020) graphsim: An R package for simulating gene expression data from graph structures of biological pathways bioRxiv 2020.03.02.972471; doi:https://doi.org/10.1101/2020.03.02.972471

@article {Kelly2020.03.02.972471,
	author = {Kelly, S Thomas and Black, Michael A},
	title = {graphsim: An R package for simulating gene expression data from graph structures of biological pathways},
	elocation-id = {2020.03.02.972471},
	year = {2020},
	doi = {10.1101/2020.03.02.972471},
	publisher = {Cold Spring Harbor Laboratory},
	URL = {https://www.biorxiv.org/content/early/2020/03/04/2020.03.02.972471},
	eprint = {https://www.biorxiv.org/content/early/2020/03/04/2020.03.02.972471.full.pdf},
	journal = {bioRxiv}
}

Contributions and Bug Reports

Please submit issues on GitHub to report problems or suggest features. Pull requests to the dev branch on GitHub are also welcome to add features or correct problems. Please see the contributor guide for more details.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].