All Projects → idekerlab → Network_evaluation_tools

idekerlab / Network_evaluation_tools

Licence: mit
Python 2.7 package with examples for evaluating a network's ability to group a given node set in network proximity.

Projects that are alternatives of or similar to Network evaluation tools

Kaggle Bike Sharing Demand
Stars: ✭ 24 (-4%)
Mutual labels:  jupyter-notebook
Computational Linguistics Class.github.io
Stars: ✭ 25 (+0%)
Mutual labels:  jupyter-notebook
Mlhep2017
MLHEP 2017 slides & seminars
Stars: ✭ 25 (+0%)
Mutual labels:  jupyter-notebook
Aprendaprog
Aprenda a Programar
Stars: ✭ 24 (-4%)
Mutual labels:  jupyter-notebook
Option Pricing Under Uncertainty
By means of stochastic volatility models
Stars: ✭ 25 (+0%)
Mutual labels:  jupyter-notebook
Shapedetector keras coreml
Deep learning installed Shape Detector using Keras and CoreML 📱
Stars: ✭ 25 (+0%)
Mutual labels:  jupyter-notebook
Pythonfundamentos
Repositório do Curso Online Python Fundamentos Para Análise de Dados.
Stars: ✭ 924 (+3596%)
Mutual labels:  jupyter-notebook
Sentence Aspect Category Detection
Aspect-Based Sentiment Analysis
Stars: ✭ 24 (-4%)
Mutual labels:  jupyter-notebook
Happiness Prediction
Sentimental analysis using keras in python
Stars: ✭ 25 (+0%)
Mutual labels:  jupyter-notebook
Databases workshop
RCS Intro to Databases workshop materials
Stars: ✭ 25 (+0%)
Mutual labels:  jupyter-notebook
Jupyter Workings
A collection of Jupyter notebook workings
Stars: ✭ 25 (+0%)
Mutual labels:  jupyter-notebook
Tf Keras Surgeon
Pruning and other network surgery for trained TF.Keras models.
Stars: ✭ 25 (+0%)
Mutual labels:  jupyter-notebook
Openmined demo
This is the code for "How Do We Democratize Access to Data?" by Siraj Raval on Youtube
Stars: ✭ 25 (+0%)
Mutual labels:  jupyter-notebook
Spml4dm
Statistics and Machine Learning for Data Mining
Stars: ✭ 24 (-4%)
Mutual labels:  jupyter-notebook
Threader
Easy Twitter threads with Python
Stars: ✭ 25 (+0%)
Mutual labels:  jupyter-notebook
Lre
NIST Language i-vector Machine Learning Challenge
Stars: ✭ 24 (-4%)
Mutual labels:  jupyter-notebook
Datadoubleconfirm
Simple datasets and notebooks for data visualization, statistical analysis and modelling - with write-ups here: http://projectosyo.wix.com/datadoubleconfirm.
Stars: ✭ 24 (-4%)
Mutual labels:  jupyter-notebook
Variational Autoencoder
PyTorch implementation of "Auto-Encoding Variational Bayes"
Stars: ✭ 25 (+0%)
Mutual labels:  jupyter-notebook
Oscon Neo4j Recommendations
Jupyter notebook for OSCON 2017 Neo4j recommendations workshop
Stars: ✭ 25 (+0%)
Mutual labels:  jupyter-notebook
Cryptocompare Api
Jupyter Notebook with examples of useful CryptoCompare API calls
Stars: ✭ 25 (+0%)
Mutual labels:  jupyter-notebook

Network Evaluation Tools

Network Evaluation Tools is a Python 2.7 package with corresponding examples for evaluating a network's ability to group a given node set in network proximity. This package was developed as a part of the work done in Huang and Carlin et al. 2018.

Modules in this package

  • data_import_tools - This module contains functions for helping import network files and gene set files for analysis.
  • gene_conversion_tools - This module contains functions for helping convert, filter, and save networks from their raw database form. Used in the Network Processing Jupyter Notebooks.
  • miscellaneous_functions - This module contains various functions developed to help with analysis along the way. These functions are not well tested and may contain bugs. These functions were generally used to determine other network performance metrics on network recovery of gene sets.
  • network_evaluation_functions - This module contains many of the core functions of the set-based network evaluation algorithm.
  • network_propagation - This module contains functions to help with network propagation steps used in the set-based network evaluation algorithm.

Version and Dendencies

Currently, the network_evaluation_tools package requires Python 2.7 - Python 2.7.13. Note that some functions in this package may not work with Python 3.0+. network_evaluation_tools requires:

  • Argparse >= 1.1
  • NetworkX >= 2.1
  • Numpy >= 1.11.0
  • Matplotlib >= 1.5.1
  • Pandas >= 0.19.0
  • Requests >= 2.13.0
  • Scipy >= 0.17.0
  • Scikit-learn >= 0.17.1

Note:

  • In Pandas v0.20.0+, the .ixindexer has been deprecated. There may be warning regarding this issue, yet the function still works.

Installation

  1. Clone the repository
  2. cd to new respository
  3. Execute following command:
    python setup.py install

Network analysis

  1. If the network needs to be normalized to a particular naming scheme:
    A Jupyter Notebook describing how each network was processed from the raw download file in the original paper can be found in the Network Processing Notebooks folder.
  2. There are two ways to perform the network evaluation on a gene set:
    The following network analyses can be performed either from a Jupyter Notebook or from the command line (see Network Evaluation Examples folder). Jupyter notebooks are documented within the notebook and the documentation for the python scripts can be seen using the command python [script_name].py -h.

Data provided in this repository (see Data Folder)

  • Database Citations - An Excel file containing details about all of the networks used in the original paper's analysis and affiliated citations for all of the databases used.
  • DisGeNET / Oncogenic Component Gene Sets - Two tab separated files, each line containing a gene set from either DisGeNET or the Oncogenic Component collection. The first column of each file is the name of the gene set followed by the list of genes associated with that given gene set on the same line.
  • Network performance (AUPRCs) on DisGeNET / Oncogenic Component Gene Sets - Two csv files containing the raw Z-normalized AUPRC scores (network performance scores) of each network analyzed on each gene set analyzed from DisGeNET or the Oncogenic Component gene set collection.
  • Network performance effect sizes on DisGeNET / Oncogenic Component Gene Sets - Two csv files containing the relative performance gain of each network's AUPRC score over the median null AUPRC score for each gene set analyzed from DisGeNET or the Oncogenic Component gene set collection.

Issues

Please feel free to post issues/bug reports. Questions can be sent to [email protected]

License

See the LICENSE file for license rights and limitations (MIT).

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].