All Projects → kundajelab → Dragonn

kundajelab / Dragonn

Licence: mit
A toolkit to learn how to model and interpret regulatory sequence data using deep learning.

Projects that are alternatives of or similar to Dragonn

Janggu
Deep learning infrastructure for bioinformatics
Stars: ✭ 174 (-21.62%)
Mutual labels:  jupyter-notebook, genomics
Gcp For Bioinformatics
GCP Essentials for Bioinformatics Researchers
Stars: ✭ 95 (-57.21%)
Mutual labels:  jupyter-notebook, genomics
Sklearn pycon2014
Repository containing files for my PyCon 2014 scikit-learn tutorial.
Stars: ✭ 221 (-0.45%)
Mutual labels:  jupyter-notebook
Ai Platform Samples
Official Repo for Google Cloud AI Platform
Stars: ✭ 222 (+0%)
Mutual labels:  jupyter-notebook
Ownphotos
Self hosted alternative to Google Photos
Stars: ✭ 2,587 (+1065.32%)
Mutual labels:  jupyter-notebook
Team Learning Rs
主要存储Datawhale组队学习中“推荐系统”方向的资料。
Stars: ✭ 216 (-2.7%)
Mutual labels:  jupyter-notebook
Deep Vector Quantization
VQVAEs, GumbelSoftmaxes and friends
Stars: ✭ 222 (+0%)
Mutual labels:  jupyter-notebook
Scikit Geometry
Scientific Python Geometric Algorithms Library
Stars: ✭ 220 (-0.9%)
Mutual labels:  jupyter-notebook
Machine Learning Notebooks
Machine Learning notebooks for refreshing concepts.
Stars: ✭ 222 (+0%)
Mutual labels:  jupyter-notebook
Vqa demo
Visual Question Answering Demo on pretrained model
Stars: ✭ 222 (+0%)
Mutual labels:  jupyter-notebook
Covid 19
Ciência de Dados aplicada à pandemia do novo coronavírus.
Stars: ✭ 223 (+0.45%)
Mutual labels:  jupyter-notebook
Sec
Seed, Expand, Constrain: Three Principles for Weakly-Supervised Image Segmentation
Stars: ✭ 221 (-0.45%)
Mutual labels:  jupyter-notebook
Melusine
Melusine is a high-level library for emails classification and feature extraction "dédiée aux courriels français".
Stars: ✭ 222 (+0%)
Mutual labels:  jupyter-notebook
Navigan
Navigating the GAN Parameter Space for Semantic Image Editing
Stars: ✭ 221 (-0.45%)
Mutual labels:  jupyter-notebook
Bowtie
An ultrafast memory-efficient short read aligner
Stars: ✭ 221 (-0.45%)
Mutual labels:  genomics
Lfortran
Official mirror of https://gitlab.com/lfortran/lfortran. Please submit pull requests (PR) there. Any PR sent here will be closed automatically.
Stars: ✭ 220 (-0.9%)
Mutual labels:  jupyter-notebook
Htmresearch
Experimental algorithms. Unsupported.
Stars: ✭ 221 (-0.45%)
Mutual labels:  jupyter-notebook
Data Augmentation For Wearable Sensor Data
A sample code of data augmentation methods for wearable sensor data (time-series data)
Stars: ✭ 222 (+0%)
Mutual labels:  jupyter-notebook
Ipython Notebooks
A collection of IPython notebooks covering various topics.
Stars: ✭ 2,543 (+1045.5%)
Mutual labels:  jupyter-notebook
Skylift
Wi-Fi Geolocation Spoofing with the ESP8266
Stars: ✭ 223 (+0.45%)
Mutual labels:  jupyter-notebook

DragoNN

Build Status license

The dragonn package implements Deep RegulAtory GenOmic Neural Networks (DragoNNs) for predictive modeling of regulatory genomics, nucleotide-resolution feature discovery, and simulations for systematic development and benchmarking.

demo

Installation

To install the latest released version of DragoNN, install the Anaconda python distribution. Then, run:

conda install dragonn -c kundajelab

DragoNN is compatible with Python2 and Python3. Specific optional features such as DeepLIFT and MOE are compatible with Python2 only.

15 seconds to your first DragoNN model

The dragonn package provides a simple command line interface to train DragoNN models, test them, and predict on sequence data. Train an example model by running:

dragonn train --pos-sequences examples/example_pos_sequences.fa --neg-sequences examples/example_neg_sequences.fa --prefix training_example

This will store a model file, training_example.model.json, with the model architecture and a weights file, training_example.weights.h5, with the parameters of the trained model. Test the model by running:

dragonn test --pos-sequences examples/example_pos_sequences.fa --neg-sequences examples/example_neg_sequences.fa --arch-file training_example.arch.json --weights-file training_example.weights.h5

This will print the model's test performance metrics. Model predictions on sequence data can be obtained by running:

dragonn predict --sequences examples/example_pos_sequences.fa --arch-file training_example.arch.json --weights-file training_example.weights.h5 --output-file example_predictions.txt

This will store the model predictions for sequences in example_pos_sequences.fa in the output file example_predictions.txt. Interpret sequence data with a dragonn model by running:

dragonn interpret --sequences examples/example_pos_sequences.fa --arch-file training_example.arch.json --weights-file training_example.weights.h5 --prefix example_interpretation

This will write the most important subsequence in each input sequence along with its location in the input sequence in the file example_interpretation.task_0.important_sequences.txt. Note: by default, only examples with predicted positive class probability >0.5 are interpreted. Examples below this threshold yield important subsequence of Ns with location -1. This default can be changed with the flag --pos-threshold.

We encourage DragoNN users to share models in the Kipoi Genomics Model Zoo. Enjoy!

DragoNN paper supplement

We provide trained models, data, and code in the paper supplement to reproduce results in the DragoNN manuscript.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].