bumblebeeπ A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
Stars: β 120 (+344.44%)
openscienceEmpirical Software Engineering journal (EMSE) open science and reproducible research initiative
Stars: β 28 (+3.7%)
Drake ExamplesExample workflows for the drake R package
Stars: β 57 (+111.11%)
SarekDetect germline or somatic variants from normal or tumour/normal whole-genome or targeted sequencing
Stars: β 124 (+359.26%)
DrakeAn R-focused pipeline toolkit for reproducibility and high-performance computing
Stars: β 1,301 (+4718.52%)
bacannotGeneric but comprehensive pipeline for prokaryotic genome annotation and interrogation with interactive reports and shiny app.
Stars: β 51 (+88.89%)
Steppy ToolkitCurated set of transformers that make your work with steppy faster and more effective π
Stars: β 21 (-22.22%)
TargetsFunction-oriented Make-like declarative workflows for R
Stars: β 293 (+985.19%)
ngs-preprocessA pipeline for preprocessing NGS data from Illumina, Nanopore and PacBio technologies
Stars: β 22 (-18.52%)
skippaSciKIt-learn Pipeline in PAndas
Stars: β 33 (+22.22%)
targets-minimalA minimal example data analysis project with the targets R package
Stars: β 50 (+85.19%)
NextflowA DSL for data-driven computational pipelines
Stars: β 1,337 (+4851.85%)
NeuraxleA Sklearn-like Framework for Hyperparameter Tuning and AutoML in Deep Learning projects. Finally have the right abstractions and design patterns to properly do AutoML. Let your pipeline steps have hyperparameter spaces. Enable checkpoints to cut duplicate calculations. Go from research to production environment easily.
Stars: β 377 (+1296.3%)
SteppyLightweight, Python library for fast and reproducible experimentation π¬
Stars: β 119 (+340.74%)
bactmapA mapping-based pipeline for creating a phylogeny from bacterial whole genome sequences
Stars: β 36 (+33.33%)
mlhandbookMy textbook for teaching Machine Learning
Stars: β 23 (-14.81%)
nifiDeploy a secured, clustered, auto-scaling NiFi service in AWS.
Stars: β 37 (+37.04%)
showyourworkFully reproducible, open source scientific articles in LaTeX.
Stars: β 361 (+1237.04%)
piper-nfRNA mapping pipeline
Stars: β 18 (-33.33%)
cliPolyaxon Core Client & CLI to streamline MLOps
Stars: β 18 (-33.33%)
rmonadPipelines you can compute on
Stars: β 66 (+144.44%)
scikit-learn-moocMachine learning in Python with scikit-learn MOOC
Stars: β 783 (+2800%)
ukbrestukbREST: efficient and streamlined data access for reproducible research of large biobanks
Stars: β 32 (+18.52%)
CubistA Python package for fitting Quinlan's Cubist regression model
Stars: β 22 (-18.52%)
verbeccComplete Conjugation of any Verb using Machine Learning for French, Spanish, Portuguese, Italian and Romanian
Stars: β 45 (+66.67%)
fretFramework for Reproducible ExperimenTs
Stars: β 20 (-25.93%)
KMeans elbowCode for determining optimal number of clusters for K-means algorithm using the 'elbow criterion'
Stars: β 35 (+29.63%)
ngs pipelineExome/Capture/RNASeq Pipeline Implementation using snakemake
Stars: β 40 (+48.15%)
DUNCode for "Depth Uncertainty in Neural Networks" (https://arxiv.org/abs/2006.08437)
Stars: β 65 (+140.74%)
tpackPack a Go workflow/function as a Unix-style pipeline command
Stars: β 55 (+103.7%)
streamalgExtensible stream pipelines with object algebras.
Stars: β 26 (-3.7%)
naasβοΈ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable β‘οΈ Production environment
Stars: β 219 (+711.11%)
dolphinnextA graphical user interface for distributed data processing of high throughput genomics
Stars: β 92 (+240.74%)
microbiomeHDCross-disease comparison of case-control gut microbiome studies
Stars: β 58 (+114.81%)
dlime experimentsIn this work, we propose a deterministic version of Local Interpretable Model Agnostic Explanations (LIME) and the experimental results on three different medical datasets shows the superiority for Deterministic Local Interpretable Model-Agnostic Explanations (DLIME).
Stars: β 21 (-22.22%)
pipelineSpline is a tool that is capable of running locally as well as part of well known pipelines like Jenkins (Jenkinsfile), Travis CI (.travis.yml) or similar ones.
Stars: β 29 (+7.41%)
ImcSegmentationPipelineA pixel classification based multiplexed image segmentation pipeline
Stars: β 62 (+129.63%)
ML-TrackThis repository is a recommended track, designed to get started with Machine Learning.
Stars: β 19 (-29.63%)
Voice4RuralA complete one stop solution for all the problems of Rural area people. π©π»βπΎ
Stars: β 12 (-55.56%)
audio noise clusteringhttps://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.
Stars: β 24 (-11.11%)
machine-learning-data-pipelinePipeline module for parallel real-time data processing for machine learning models development and production purposes.
Stars: β 22 (-18.52%)
AutoTabularAutomatic machine learning for tabular data. β‘π₯β‘
Stars: β 51 (+88.89%)
chartsHelm charts for creating reproducible and maintainable deployments of Polyaxon with Kubernetes.
Stars: β 32 (+18.52%)
human genomics pipelineA Snakemake workflow to process single samples or cohorts of paired-end sequencing data (WGS or WES) using trim galore/bwa/GATK4/parabricks.
Stars: β 19 (-29.63%)
nlp workshop odsc europe20Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep learning and deep transfer learning to learn and solve popular tasks using NLP including NER, Classification, Recommendation \ Information Retrieval, Summarization, Classification, Language Translation, Q&A and Tβ¦
Stars: β 127 (+370.37%)
sklearn-pmml-modelA library to parse and convert PMML models into Scikit-learn estimators.
Stars: β 71 (+162.96%)
pocoInteractive pipeline filtering in PowerShell (a port of peco).
Stars: β 16 (-40.74%)
rocket-pipesPowerful pipes for TypeScript, that chain Promise and ADT for you π -> β°οΈ -> π -> π -> π
Stars: β 18 (-33.33%)