All Projects → daskperiment → Similar Projects or Alternatives

125 Open source projects that are alternatives of or similar to daskperiment

🎛 Distributed machine learning made simple.

Stars: ✭ 43 (+72%)

Mutual labels: dask

ReproZip is a tool that simplifies the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in computational science.

Stars: ✭ 231 (+824%)

Mutual labels: reproducibility

Dask

Parallel computing with task scheduling

Stars: ✭ 9,309 (+37136%)

Mutual labels: dask

dvc dask use case

A use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.

Stars: ✭ 22 (-12%)

Mutual labels: dask

Dna Seq Gatk Variant Calling

This Snakemake pipeline implements the GATK best-practices workflow

Stars: ✭ 133 (+432%)

Mutual labels: reproducibility

Mars

Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.

Stars: ✭ 2,308 (+9132%)

Mutual labels: dask

prefect-saturn

Python client for using Prefect Cloud with Saturn Cloud

Stars: ✭ 15 (-40%)

Mutual labels: dask

hydra-zen

Pythonic functions for creating and enhancing Hydra applications

Stars: ✭ 165 (+560%)

Mutual labels: reproducibility

Anaconda Project

Tool for encapsulating, running, and reproducing data science projects

Stars: ✭ 153 (+512%)

Mutual labels: reproducibility

framequery

SQL on dataframes - pandas and dask

Stars: ✭ 63 (+152%)

Mutual labels: dask

arboreto

A scalable python-based framework for gene regulatory network inference using tree-based ensemble regressors.

Stars: ✭ 33 (+32%)

Mutual labels: dask

Make Novice

Automation and Make

Stars: ✭ 122 (+388%)

Mutual labels: reproducibility

graphchain

⚡️ An efficient cache for the execution of dask graphs.

Stars: ✭ 63 (+152%)

Mutual labels: dask

mloperator

Machine Learning Operator & Controller for Kubernetes

Stars: ✭ 85 (+240%)

Mutual labels: dask

reproducibility-guide

⛔ ARCHIVED ⛔

Stars: ✭ 119 (+376%)

Mutual labels: reproducibility

bumblebee

🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)

Stars: ✭ 120 (+380%)

Mutual labels: dask

Stumpy

STUMPY is a powerful and scalable Python library for modern time series analysis

Stars: ✭ 2,019 (+7976%)

Mutual labels: dask

Jupyterwith

declarative and reproducible Jupyter environments - powered by Nix

Stars: ✭ 235 (+840%)

Mutual labels: reproducibility

xarray-beam

Distributed Xarray with Apache Beam

Stars: ✭ 83 (+232%)

Mutual labels: dask

Popper

Container-native task automation engine.

Stars: ✭ 216 (+764%)

Mutual labels: reproducibility

knit

Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead

Stars: ✭ 53 (+112%)

Mutual labels: dask

Renku

The Renku Project provides a platform and tools for reproducible and collaborative data analysis.

Stars: ✭ 141 (+464%)

Mutual labels: reproducibility

coiled-resources

Notebooks that support blog posts and tech talks on Dask / Coiled.

Stars: ✭ 33 (+32%)

Mutual labels: dask

Datapackager

An R package to enable reproducible data processing, packaging and sharing.

Stars: ✭ 125 (+400%)

Mutual labels: reproducibility

dask-rasterio

Read and write rasters in parallel using Rasterio and Dask

Stars: ✭ 82 (+228%)

Mutual labels: dask

madpy-dask

MadPy Dask talk materials

Stars: ✭ 33 (+32%)

Mutual labels: dask

Steppy

Lightweight, Python library for fast and reproducible experimentation 🔬

Stars: ✭ 119 (+376%)

Mutual labels: reproducibility

targets-tutorial

Short course on the targets R package

Stars: ✭ 87 (+248%)

Mutual labels: reproducibility

datatile

A library for managing, validating, summarizing, and visualizing data.

Stars: ✭ 419 (+1576%)

Mutual labels: dask

qhub

🪴 Nebari - your open source data science platform

Stars: ✭ 175 (+600%)

Mutual labels: dask

esmlab

Earth System Model Lab (esmlab). ⚠️⚠️ ESMLab functionality has been moved into <https://github.com/NCAR/geocat-comp>. ⚠️⚠️

Stars: ✭ 23 (-8%)

Mutual labels: dask

fertile

creating optimal conditions for reproducibility

Stars: ✭ 52 (+108%)

Mutual labels: reproducibility

dask-awkward

Native Dask collection for awkward arrays, and the library to use it.

Stars: ✭ 25 (+0%)

Mutual labels: dask

EasyGitianBuilder

🔨 Gitian Building made simpler on any Windows Debian/Ubuntu MacOS with Vagrant, lxc, and virtualbox

Stars: ✭ 18 (-28%)

Mutual labels: reproducibility

dask-pytorch-ddp

dask-pytorch-ddp is a Python package that makes it easy to train PyTorch models on dask clusters using distributed data parallel.

Stars: ✭ 50 (+100%)

Mutual labels: dask

Xarray

N-D labeled arrays and datasets in Python

Stars: ✭ 2,353 (+9312%)

Mutual labels: dask

mlforecast

Scalable machine 🤖 learning for time series forecasting.

Stars: ✭ 96 (+284%)

Mutual labels: dask

benchmark VAE

Unifying Variational Autoencoder (VAE) implementations in Pytorch (NeurIPS 2022)

Stars: ✭ 1,211 (+4744%)

Mutual labels: reproducibility

codex-africanus

Radio Astronomy Algorithms Library

Stars: ✭ 13 (-48%)

Mutual labels: dask

Swifter

A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner

Stars: ✭ 1,844 (+7276%)

Mutual labels: dask

Mach Nix

Create highly reproducible python environments

Stars: ✭ 231 (+824%)

Mutual labels: reproducibility

dask-sql

Distributed SQL Engine in Python using Dask

Stars: ✭ 271 (+984%)

Mutual labels: dask

Catalyst

Accelerated deep learning R&D

Stars: ✭ 2,804 (+11116%)

Mutual labels: reproducibility

dask-ec2

Start a cluster in EC2 for dask.distributed

Stars: ✭ 103 (+312%)

Mutual labels: dask

Plynx

PLynx is a domain agnostic platform for managing reproducible experiments and data-oriented workflows.

Stars: ✭ 192 (+668%)

Mutual labels: reproducibility

synthesizing-robust-adversarial-examples

My entry for ICLR 2018 Reproducibility Challenge for paper Synthesizing robust adversarial examples https://openreview.net/pdf?id=BJDH5M-AW

Stars: ✭ 60 (+140%)

Mutual labels: reproducibility

Nn Template

Generic template to bootstrap your PyTorch project with PyTorch Lightning, Hydra, W&B, and DVC.

Stars: ✭ 145 (+480%)

Mutual labels: reproducibility

HyperGBM

A full pipeline AutoML tool for tabular data

Stars: ✭ 172 (+588%)

Mutual labels: dask

Accelerator

The Accelerator is a tool for fast and reproducible processing of large amounts of data.

Stars: ✭ 137 (+448%)

Mutual labels: reproducibility

binderhub-deploy

Deploy a BinderHub from scratch on Microsoft Azure

Stars: ✭ 27 (+8%)

Mutual labels: reproducibility

Batchtools

Tools for computation on batch systems

Stars: ✭ 127 (+408%)

Mutual labels: reproducibility

flox

Fast & furious GroupBy operations for dask.array

Stars: ✭ 42 (+68%)

Mutual labels: dask

Rl Medical

Deep Reinforcement Learning (DRL) agents applied to medical images

Stars: ✭ 123 (+392%)

Mutual labels: reproducibility

narps

Code related to Neuroimaging Analysis Replication and Prediction Study

Stars: ✭ 31 (+24%)

Mutual labels: reproducibility

gaia

Gaia is a geospatial analysis library jointly developed by Kitware and Epidemico.

Stars: ✭ 29 (+16%)

Mutual labels: dask

analysis-flow

Data Analysis Workflows & Reproducibility Learning Resources

Stars: ✭ 108 (+332%)

Mutual labels: reproducibility

rna-seq-kallisto-sleuth

A Snakemake workflow for differential expression analysis of RNA-seq data with Kallisto and Sleuth.

Stars: ✭ 56 (+124%)

Mutual labels: reproducibility

software-dev

Coding Standards for the USC Biostats group

Stars: ✭ 33 (+32%)

Mutual labels: reproducibility

optimus

🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

Stars: ✭ 1,351 (+5304%)

Mutual labels: dask

php-uavt-adreskodu-botu

Php ile uavt adres kodu botu

Stars: ✭ 2 (-92%)

Mutual labels: dask

1-60 of 125 similar projects

›