All Projects → daskperiment → Similar Projects or Alternatives

125 Open source projects that are alternatives of or similar to daskperiment

lazycluster
🎛 Distributed machine learning made simple.
Stars: ✭ 43 (+72%)
Mutual labels:  dask
Reprozip
ReproZip is a tool that simplifies the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in computational science.
Stars: ✭ 231 (+824%)
Mutual labels:  reproducibility
Dask
Parallel computing with task scheduling
Stars: ✭ 9,309 (+37136%)
Mutual labels:  dask
dvc dask use case
A use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.
Stars: ✭ 22 (-12%)
Mutual labels:  dask
Dna Seq Gatk Variant Calling
This Snakemake pipeline implements the GATK best-practices workflow
Stars: ✭ 133 (+432%)
Mutual labels:  reproducibility
Mars
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
Stars: ✭ 2,308 (+9132%)
Mutual labels:  dask
prefect-saturn
Python client for using Prefect Cloud with Saturn Cloud
Stars: ✭ 15 (-40%)
Mutual labels:  dask
hydra-zen
Pythonic functions for creating and enhancing Hydra applications
Stars: ✭ 165 (+560%)
Mutual labels:  reproducibility
Anaconda Project
Tool for encapsulating, running, and reproducing data science projects
Stars: ✭ 153 (+512%)
Mutual labels:  reproducibility
framequery
SQL on dataframes - pandas and dask
Stars: ✭ 63 (+152%)
Mutual labels:  dask
arboreto
A scalable python-based framework for gene regulatory network inference using tree-based ensemble regressors.
Stars: ✭ 33 (+32%)
Mutual labels:  dask
Make Novice
Automation and Make
Stars: ✭ 122 (+388%)
Mutual labels:  reproducibility
graphchain
⚡️ An efficient cache for the execution of dask graphs.
Stars: ✭ 63 (+152%)
Mutual labels:  dask
mloperator
Machine Learning Operator & Controller for Kubernetes
Stars: ✭ 85 (+240%)
Mutual labels:  dask
reproducibility-guide
⛔ ARCHIVED ⛔
Stars: ✭ 119 (+376%)
Mutual labels:  reproducibility
bumblebee
🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
Stars: ✭ 120 (+380%)
Mutual labels:  dask
Stumpy
STUMPY is a powerful and scalable Python library for modern time series analysis
Stars: ✭ 2,019 (+7976%)
Mutual labels:  dask
Jupyterwith
declarative and reproducible Jupyter environments - powered by Nix
Stars: ✭ 235 (+840%)
Mutual labels:  reproducibility
xarray-beam
Distributed Xarray with Apache Beam
Stars: ✭ 83 (+232%)
Mutual labels:  dask
Popper
Container-native task automation engine.
Stars: ✭ 216 (+764%)
Mutual labels:  reproducibility
knit
Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead
Stars: ✭ 53 (+112%)
Mutual labels:  dask
Renku
The Renku Project provides a platform and tools for reproducible and collaborative data analysis.
Stars: ✭ 141 (+464%)
Mutual labels:  reproducibility
coiled-resources
Notebooks that support blog posts and tech talks on Dask / Coiled.
Stars: ✭ 33 (+32%)
Mutual labels:  dask
Datapackager
An R package to enable reproducible data processing, packaging and sharing.
Stars: ✭ 125 (+400%)
Mutual labels:  reproducibility
dask-rasterio
Read and write rasters in parallel using Rasterio and Dask
Stars: ✭ 82 (+228%)
Mutual labels:  dask
madpy-dask
MadPy Dask talk materials
Stars: ✭ 33 (+32%)
Mutual labels:  dask
Steppy
Lightweight, Python library for fast and reproducible experimentation 🔬
Stars: ✭ 119 (+376%)
Mutual labels:  reproducibility
targets-tutorial
Short course on the targets R package
Stars: ✭ 87 (+248%)
Mutual labels:  reproducibility
datatile
A library for managing, validating, summarizing, and visualizing data.
Stars: ✭ 419 (+1576%)
Mutual labels:  dask
qhub
🪴 Nebari - your open source data science platform
Stars: ✭ 175 (+600%)
Mutual labels:  dask
esmlab
Earth System Model Lab (esmlab). ⚠️⚠️ ESMLab functionality has been moved into <https://github.com/NCAR/geocat-comp>. ⚠️⚠️
Stars: ✭ 23 (-8%)
Mutual labels:  dask
fertile
creating optimal conditions for reproducibility
Stars: ✭ 52 (+108%)
Mutual labels:  reproducibility
dask-awkward
Native Dask collection for awkward arrays, and the library to use it.
Stars: ✭ 25 (+0%)
Mutual labels:  dask
EasyGitianBuilder
🔨 Gitian Building made simpler on any Windows Debian/Ubuntu MacOS with Vagrant, lxc, and virtualbox
Stars: ✭ 18 (-28%)
Mutual labels:  reproducibility
dask-pytorch-ddp
dask-pytorch-ddp is a Python package that makes it easy to train PyTorch models on dask clusters using distributed data parallel.
Stars: ✭ 50 (+100%)
Mutual labels:  dask
Xarray
N-D labeled arrays and datasets in Python
Stars: ✭ 2,353 (+9312%)
Mutual labels:  dask
mlforecast
Scalable machine 🤖 learning for time series forecasting.
Stars: ✭ 96 (+284%)
Mutual labels:  dask
benchmark VAE
Unifying Variational Autoencoder (VAE) implementations in Pytorch (NeurIPS 2022)
Stars: ✭ 1,211 (+4744%)
Mutual labels:  reproducibility
codex-africanus
Radio Astronomy Algorithms Library
Stars: ✭ 13 (-48%)
Mutual labels:  dask
Swifter
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
Stars: ✭ 1,844 (+7276%)
Mutual labels:  dask
Mach Nix
Create highly reproducible python environments
Stars: ✭ 231 (+824%)
Mutual labels:  reproducibility
dask-sql
Distributed SQL Engine in Python using Dask
Stars: ✭ 271 (+984%)
Mutual labels:  dask
Catalyst
Accelerated deep learning R&D
Stars: ✭ 2,804 (+11116%)
Mutual labels:  reproducibility
dask-ec2
Start a cluster in EC2 for dask.distributed
Stars: ✭ 103 (+312%)
Mutual labels:  dask
Plynx
PLynx is a domain agnostic platform for managing reproducible experiments and data-oriented workflows.
Stars: ✭ 192 (+668%)
Mutual labels:  reproducibility
synthesizing-robust-adversarial-examples
My entry for ICLR 2018 Reproducibility Challenge for paper Synthesizing robust adversarial examples https://openreview.net/pdf?id=BJDH5M-AW
Stars: ✭ 60 (+140%)
Mutual labels:  reproducibility
Nn Template
Generic template to bootstrap your PyTorch project with PyTorch Lightning, Hydra, W&B, and DVC.
Stars: ✭ 145 (+480%)
Mutual labels:  reproducibility
HyperGBM
A full pipeline AutoML tool for tabular data
Stars: ✭ 172 (+588%)
Mutual labels:  dask
Accelerator
The Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (+448%)
Mutual labels:  reproducibility
binderhub-deploy
Deploy a BinderHub from scratch on Microsoft Azure
Stars: ✭ 27 (+8%)
Mutual labels:  reproducibility
Batchtools
Tools for computation on batch systems
Stars: ✭ 127 (+408%)
Mutual labels:  reproducibility
flox
Fast & furious GroupBy operations for dask.array
Stars: ✭ 42 (+68%)
Mutual labels:  dask
Rl Medical
Deep Reinforcement Learning (DRL) agents applied to medical images
Stars: ✭ 123 (+392%)
Mutual labels:  reproducibility
narps
Code related to Neuroimaging Analysis Replication and Prediction Study
Stars: ✭ 31 (+24%)
Mutual labels:  reproducibility
gaia
Gaia is a geospatial analysis library jointly developed by Kitware and Epidemico.
Stars: ✭ 29 (+16%)
Mutual labels:  dask
analysis-flow
Data Analysis Workflows & Reproducibility Learning Resources
Stars: ✭ 108 (+332%)
Mutual labels:  reproducibility
rna-seq-kallisto-sleuth
A Snakemake workflow for differential expression analysis of RNA-seq data with Kallisto and Sleuth.
Stars: ✭ 56 (+124%)
Mutual labels:  reproducibility
software-dev
Coding Standards for the USC Biostats group
Stars: ✭ 33 (+32%)
Mutual labels:  reproducibility
optimus
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+5304%)
Mutual labels:  dask
php-uavt-adreskodu-botu
Php ile uavt adres kodu botu
Stars: ✭ 2 (-92%)
Mutual labels:  dask
1-60 of 125 similar projects