1. ScihubSource code and data analyses for the Sci-Hub Coverage Study
2. TybaltTraining and evaluating a variational autoencoder for pan-cancer gene expression data
3. Covid19 ReviewA collaborative review of the emerging COVID-19 literature. Join the chat here:
4. Sprint ganPrivacy-preserving generative deep neural networks support clinical data sharing
5. PancancerBuilding classifiers using cancer transcriptomes across 33 different cancer-types
6. Continuous analysisComputational reproducibility using Continuous Integration to produce verifiable end-to-end runs of scientific analysis.
7. Deep ReviewA collaboratively written review paper on deep learning, genomics, and precision medicine
8. DapsDenoising Autoencoders for Phenotype Stratification
9. Meta ReviewManuscript describing open collaborative writing with Manubot
10. TdmR package for normalizing RNA-seq data to make them comparable to microarray data.
11. Multi PlierAn unsupervised transfer learning approach for rare disease transcriptomics
12. Hgsc subtypesTwo or three subtypes of high grade serous ovarian cancer subtypes fit data from different populations better than four
14. Gcb535challengeWe play a prediction game in our GCB 535 class. The class aims to teach students, primarily biologists, about machine learning methods and their use. This repository hosts the challenge for individuals outside of our lab.
15. RNAseq titration resultsCross-platform normalization enables machine learning model training on microarray and RNA-seq data simultaneously
16. snorkelingExtracting biomedical relationships from literature with Snorkel 🏊
17. adageData and code related to the paper "ADAGE-Based Integration of Publicly Available Pseudomonas aeruginosa..." Jie Tan, et al · mSystems · 2016
18. miQCFlexible, probablistic metrics for quality control of scRNA-seq data