All Projects → fraenkel-lab → OmicsIntegrator2

fraenkel-lab / OmicsIntegrator2

Licence: BSD-3-Clause License
Prize-Collecting Steiner Forests for Interactomes

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to OmicsIntegrator2

OmicLearn
🧪 🖥 Transparent exploration of machine learning for biomarker discovery from proteomics and omics data
Stars: ✭ 46 (+15%)
Mutual labels:  proteomics, omics
PAW pipeline
A Comet-based, best practices proteomics pipeline.
Stars: ✭ 22 (-45%)
Mutual labels:  proteomics
cosmosR
COSMOS (Causal Oriented Search of Multi-Omic Space) is a method that integrates phosphoproteomics, transcriptomics, and metabolomics data sets.
Stars: ✭ 30 (-25%)
Mutual labels:  proteomics
peptide-shaker
Interpretation of proteomics identification results
Stars: ✭ 32 (-20%)
Mutual labels:  proteomics
gneiss
compositional data analysis toolbox
Stars: ✭ 48 (+20%)
Mutual labels:  omics
LFQ-Analyst
The repo for LFQ-Analyst
Stars: ✭ 17 (-57.5%)
Mutual labels:  proteomics
searchgui
Highly adaptable common interface for proteomics search and de novo engines
Stars: ✭ 22 (-45%)
Mutual labels:  proteomics
RforProteomics
Using R and Bioconductor packages for the analysis and comprehension of proteomics data.
Stars: ✭ 52 (+30%)
Mutual labels:  proteomics
pyqms
pyQms, generalized, fast and accurate mass spectrometry data quantification
Stars: ✭ 22 (-45%)
Mutual labels:  proteomics
pia
📚 🔬 PIA - Protein Inference Algorithms
Stars: ✭ 19 (-52.5%)
Mutual labels:  proteomics
pDeep
pDeep: Predicting MS/MS Spectra of Peptides with Deep Learning
Stars: ✭ 24 (-40%)
Mutual labels:  proteomics
PTXQC
A Quality Control (QC) pipeline for Proteomics (PTX) results generated by MaxQuant
Stars: ✭ 34 (-15%)
Mutual labels:  proteomics
lightdock
Protein-protein, protein-peptide and protein-DNA docking framework based on the GSO algorithm
Stars: ✭ 110 (+175%)
Mutual labels:  proteomics
philosopher
A complete toolkit for shotgun proteomics data analysis
Stars: ✭ 82 (+105%)
Mutual labels:  proteomics
QFeatures
Quantitative features for mass spectrometry data
Stars: ✭ 12 (-70%)
Mutual labels:  proteomics
mokapot
Fast and flexible semi-supervised learning for peptide detection in Python
Stars: ✭ 25 (-37.5%)
Mutual labels:  proteomics
pyprophet
PyProphet: Semi-supervised learning and scoring of OpenSWATH results.
Stars: ✭ 23 (-42.5%)
Mutual labels:  proteomics
FALDO
Feature Annotation Location Description Ontology
Stars: ✭ 28 (-30%)
Mutual labels:  proteomics
compomics-utilities
Open source Java library for computational proteomics
Stars: ✭ 24 (-40%)
Mutual labels:  proteomics
pFind3
No description or website provided.
Stars: ✭ 20 (-50%)
Mutual labels:  proteomics

OmicsIntegrator2

Background

Omics Integrator is a package designed to integrate proteomic data, gene expression data and/or epigenetic data using a protein-protein interaction network. It identifies high-confidence, relevant subnetworks from the underlying interactome. It is comprised of two modules, Garnet and Forest. This repository holds the code for Forest in version 2 of Omics Integrator.

Forest first maps your high-throughput data onto the network. Proteins in the network are 'nodes' connected by edges representing physical interactions of two protein nodes. You should assign protein nodes prizes from your high-throughput data, i.e. the prize could be the log fold change of that protein in your system. The edges are assigned costs, often proportional to the confidence in that interaction.

network with prizes

Forest then adds a 'dummy node' to the network with edges to all of the nodes you've assigned prizes, called terminals. There are several parameters you can change in Forest. Omega, the cost of the edges between the root dummy node and the terminals, determines the number of pathways in the final solution. Beta, the relative weighting between node prizes and edge costs, determines the size of the final solution. And alpha adds a penalty to edges based on the degrees of the two nodes that the edge connects. This keeps the network from being biased towards "hub nodes", often highly studied and promiscuous proteins that may not be specific to your system.

network with parameters

Finally, Forest uses the Prize-Collecting Steiner Forest algorithm to whittle the large interactome down to relevant sub-networks, or pathways. These pathways are likely places to look for important cellular functions altered in your system. They will include some, but not all, of your terminals. They may also include "Steiner nodes", nodes that you did not assign a prize to, but that the algorithm is predicting are important to the pathways altered in your system.

final network

With the output of these sub-networks, Omics Integrator allows researchers to go from huge, often contradictory lists of genes, proteins, and metabolites from multiple -omics data sources to a few important cellular pathways to focus on in follow-up studies of their system.

Changelog:

6/20/18: 2.3.0

  • Many breaking changes, e.g. method names, data formats
  • Add tests
  • New annotation file contains process and function gene annotations on top of subcellular localizations.
  • HTML visualization improvements
  • First release with a changelog entry and github release!
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].