All Projects → stracquadaniolab → pygna

stracquadaniolab / pygna

Licence: MIT license
A Python package for gene network analysis

Programming Languages

python
139335 projects - #7 most used programming language
Dockerfile
14818 projects

Projects that are alternatives of or similar to pygna

Final Project
Using Twitter Ego Network Analysis to Detect Sources of Fake News
Stars: ✭ 44 (+76%)
Mutual labels:  networkx
LNTopology
A tool to analyze the topology of Bitcoin's Lightning Network
Stars: ✭ 19 (-24%)
Mutual labels:  networkx
geneSCF inactive
GeneSCF moved to a dedicated GitHub page, https://github.com/genescf/GeneSCF
Stars: ✭ 21 (-16%)
Mutual labels:  enrichment-analysis
vrpy
A python framework for solving the VRP and its variants with column generation.
Stars: ✭ 94 (+276%)
Mutual labels:  networkx
Algorithms
Free hands-on course with the implementation (in Python) and description of several computational, mathematical and statistical algorithms.
Stars: ✭ 117 (+368%)
Mutual labels:  networkx
gqlalchemy
GQLAlchemy is a library developed with the purpose of assisting in writing and running queries on Memgraph. GQLAlchemy supports high-level connection to Memgraph as well as modular query builder.
Stars: ✭ 39 (+56%)
Mutual labels:  networkx
tsplib95
Library for working with TSPLIB files.
Stars: ✭ 48 (+92%)
Mutual labels:  networkx
EgoSplitting
A NetworkX implementation of "Ego-splitting Framework: from Non-Overlapping to Overlapping Clusters" (KDD 2017).
Stars: ✭ 78 (+212%)
Mutual labels:  networkx
flownetwork
A python package for flow network analysis
Stars: ✭ 22 (-12%)
Mutual labels:  networkx
Stellargraph
StellarGraph - Machine Learning on Graphs
Stars: ✭ 2,235 (+8840%)
Mutual labels:  networkx
networkx-neo4j
NetworkX API for Neo4j Graph Algorithms.
Stars: ✭ 98 (+292%)
Mutual labels:  networkx
netdiff
Python library for parsing network topology data (eg: dynamic routing protocols, OpenVPN, NetJSON, CNML) and detect changes.
Stars: ✭ 66 (+164%)
Mutual labels:  networkx
Osmnx
OSMnx: Python for street networks. Retrieve, model, analyze, and visualize street networks and other spatial data from OpenStreetMap.
Stars: ✭ 3,357 (+13328%)
Mutual labels:  networkx
lightflow
A lightweight, distributed workflow system
Stars: ✭ 67 (+168%)
Mutual labels:  networkx
Social-Network-Analysis-in-Python
Social Network Facebook Analysis (Python, Networkx)
Stars: ✭ 26 (+4%)
Mutual labels:  networkx
FEATHER
The reference implementation of FEATHER from the CIKM '20 paper "Characteristic Functions on Graphs: Birds of a Feather, from Statistical Descriptors to Parametric Models".
Stars: ✭ 34 (+36%)
Mutual labels:  networkx
disparity filter
Implements a disparity filter in Python, based on graphs in NetworkX, to extract the multiscale backbone of a complex weighted network (Serrano, et al., 2009)
Stars: ✭ 17 (-32%)
Mutual labels:  networkx
tlverse-handbook
🎯 📕 Targeted Learning in R: A Causal Data Science Handbook
Stars: ✭ 50 (+100%)
Mutual labels:  biostatistics
depx
Examine and visualize dependencies used by Python modules 🔍
Stars: ✭ 19 (-24%)
Mutual labels:  networkx
Awesome Community Detection
A curated list of community detection research papers with implementations.
Stars: ✭ 1,874 (+7396%)
Mutual labels:  networkx

PyGNA: a Python framework for geneset network analysis

GitHub tag (latest by date) Anaconda-Server Badge Build Release

PyGNA is a framework for statistical network analysis of high-throughput experiments. It can be used both as a standalone command line application or it can be used as API to develop custom analyses.

For an overview of PyGNA functionalities check the infographic below or dive into our Getting started tour.

Infographic

Installation

The easiest and fastest way to install pygna using conda:

$ conda install -c stracquadaniolab -c bioconda -c conda-forge pygna

Alternatively you can install it through pip:

$ pip install pygna

We also provide a docker image installation with the latest version of PyGNA. It can be easily executed from the command line from DockerHub:

$ docker run stracquadaniolab/pygna/pygna:latest

or GitHub Packages:

$ docker run docker.pkg.github.com/stracquadaniolab/pygna/pygna:latest

which will show the PyGNA command line help.

Getting started

A typical pygna analysis consists of 3 steps:

  1. Generate the RWR and SP matrices for the network you are using ( once they are generated, you won't need to repeat the same step again)
  2. Make sure that the input genesets are in the right format. If a network uses entrez ID, and your file is in HUGO symbols, use the pygna utility for the name conversion.
  3. Run the analysis you are interested into.
  4. Once you have the output tables, you can choose to visualize one or more plots.

Otherwise you can check our snakemake workflow for the full geneset analysis; our workflow contains sample data that you can use to familiarize with our software.

The examples below show some basic analysis that can be carried out with pygna.

Example 1: Running pygna GNT analysis

Running pygna on this input as follows:

$ cd ./your-path/min-working-example/

$ pygna build-rwr-diffusion barabasi.interactome.tsv --output-file interactome_RWR.hdf5

$ pygna test-topology-rwr barabasi.interactome.tsv disgenet_cancer_groups_subset.gmt interactome_RWR.hdf5 table_topology_rwr.csv --number-of-permutations 1000 --cores 4

$ pygna paint-datasets-stats table_topology_rwr.csv barplot_rwr.pdf

You can look at the plot of the results in the barplot_rwr.pdf file, and the corresponding table in table_topology_rwr.csv.

Example 2: Running pygna GNA analysis

$ cd ./your-path/min-working-example/

skip this step if the matrix is already computed

$ pygna build-RWR-diffusion barabasi.interactome.tsv --output-file interactome_RWR.hdf5

The association analysis is run N x M times (N number of genesets, M number of pathways), we use only 50 permutations in this example to avoid long computations; however, the recommended value is 1000.

$ pygna test-association-rwr barabasi.interactome.tsv disgenet_cancer_groups_subset.gmt interactome_RWR.hdf5 table_association_rwr.csv -B disgenet_cancer_groups_subset.gmt --keep --number-of-permutations 100 --cores 4

If you don't include the --results-figure flag at the comparison step, plot the matrix as follows

$ pygna paint-comparison-matrix table_association_rwr.csv heatmap_association_rwr.png --rwr --annotate

The -k flag, keeps the -B geneset and permutes only on the set A.

If setname B is not passed, the analysis is run between each couple of setnames in the geneset.

$ pygna test-association-rwr barabasi.interactome.tsv disgenet_cancer_groups_subset.gmt interactome_RWR.hdf5 table_within_comparison_rwr.csv --number-of-permutations 100 --cores 4

$ pygna paint-comparison-matrix table_within_comparison_rwr.csv heatmap_within_comparison_rwr.png --rwr --single-geneset

You can look at the plot of the results in the heatmap_within_comparison_rwr.png file, and the corresponding table in table_within_comparison_rwr.csv.

Documentation

The official documentation for pygna can be found on readthedocs.

Authors

Citation

V. Fanfani, F. Cassano, and G. Stracquadanio, “PyGNA: a unified framework for geneset network analysis,” BMC Bioinformatics, vol. 21, no. 1, 2020. DOI: https://doi.org/10.1186/s12859-020-03801-1

@article{Fanfani2020,
author = {Fanfani, Viola and Cassano, Fabio and Stracquadanio, Giovanni},
doi = {10.1186/s12859-020-03801-1},
issn = {14712105},
journal = {BMC Bioinformatics},
number = {1},
pmid = {33092528},
title = {{PyGNA: a unified framework for geneset network analysis}},
volume = {21},
year = {2020}
}

Issues

Please post an issue to report a bug or request new features.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].