Top 33 dask open source projects

Mars
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
Xarray
N-D labeled arrays and datasets in Python
Swifter
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
Dask
Parallel computing with task scheduling
dask-ec2
Start a cluster in EC2 for dask.distributed
knit
Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead
framequery
SQL on dataframes - pandas and dask
flox
Fast & furious GroupBy operations for dask.array
dask-rasterio
Read and write rasters in parallel using Rasterio and Dask
gaia
Gaia is a geospatial analysis library jointly developed by Kitware and Epidemico.
arboreto
A scalable python-based framework for gene regulatory network inference using tree-based ensemble regressors.
dvc dask use case
A use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.
esmlab
Earth System Model Lab (esmlab). ⚠️⚠️ ESMLab functionality has been moved into <https://github.com/NCAR/geocat-comp>. ⚠️⚠️
dask-awkward
Native Dask collection for awkward arrays, and the library to use it.
dask-pytorch-ddp
dask-pytorch-ddp is a Python package that makes it easy to train PyTorch models on dask clusters using distributed data parallel.
bumblebee
🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
mlforecast
Scalable machine 🤖 learning for time series forecasting.
prefect-saturn
Python client for using Prefect Cloud with Saturn Cloud
codex-africanus
Radio Astronomy Algorithms Library
daskperiment
Reproducibility for Humans: A lightweight tool to perform reproducible machine learning experiment.
dask-sql
Distributed SQL Engine in Python using Dask
xarray-beam
Distributed Xarray with Apache Beam
coiled-resources
Notebooks that support blog posts and tech talks on Dask / Coiled.
graphchain
⚡️ An efficient cache for the execution of dask graphs.
1-33 of 33 dask projects