knitDeprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead
Stars: ✭ 53 (-48.54%)
HyperGBMA full pipeline AutoML tool for tabular data
Stars: ✭ 172 (+66.99%)
framequerySQL on dataframes - pandas and dask
Stars: ✭ 63 (-38.83%)
floxFast & furious GroupBy operations for dask.array
Stars: ✭ 42 (-59.22%)
dask-rasterioRead and write rasters in parallel using Rasterio and Dask
Stars: ✭ 82 (-20.39%)
gaiaGaia is a geospatial analysis library jointly developed by Kitware and Epidemico.
Stars: ✭ 29 (-71.84%)
madpy-daskMadPy Dask talk materials
Stars: ✭ 33 (-67.96%)
arboretoA scalable python-based framework for gene regulatory network inference using tree-based ensemble regressors.
Stars: ✭ 33 (-67.96%)
datatileA library for managing, validating, summarizing, and visualizing data.
Stars: ✭ 419 (+306.8%)
dvc dask use caseA use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.
Stars: ✭ 22 (-78.64%)
esmlabEarth System Model Lab (esmlab). ⚠️⚠️ ESMLab functionality has been moved into <https://github.com/NCAR/geocat-comp>. ⚠️⚠️
Stars: ✭ 23 (-77.67%)
mloperatorMachine Learning Operator & Controller for Kubernetes
Stars: ✭ 85 (-17.48%)
dask-awkwardNative Dask collection for awkward arrays, and the library to use it.
Stars: ✭ 25 (-75.73%)
lazycluster🎛 Distributed machine learning made simple.
Stars: ✭ 43 (-58.25%)
dask-pytorch-ddpdask-pytorch-ddp is a Python package that makes it easy to train PyTorch models on dask clusters using distributed data parallel.
Stars: ✭ 50 (-51.46%)
bumblebee🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
Stars: ✭ 120 (+16.5%)
mlforecastScalable machine 🤖 learning for time series forecasting.
Stars: ✭ 96 (-6.8%)
prefect-saturnPython client for using Prefect Cloud with Saturn Cloud
Stars: ✭ 15 (-85.44%)
daskperimentReproducibility for Humans: A lightweight tool to perform reproducible machine learning experiment.
Stars: ✭ 25 (-75.73%)
dask-sqlDistributed SQL Engine in Python using Dask
Stars: ✭ 271 (+163.11%)
xarray-beamDistributed Xarray with Apache Beam
Stars: ✭ 83 (-19.42%)
qhub🪴 Nebari - your open source data science platform
Stars: ✭ 175 (+69.9%)
coiled-resourcesNotebooks that support blog posts and tech talks on Dask / Coiled.
Stars: ✭ 33 (-67.96%)
optimus🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+1211.65%)
graphchain⚡️ An efficient cache for the execution of dask graphs.
Stars: ✭ 63 (-38.83%)
MarsMars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
Stars: ✭ 2,308 (+2140.78%)
XarrayN-D labeled arrays and datasets in Python
Stars: ✭ 2,353 (+2184.47%)
StumpySTUMPY is a powerful and scalable Python library for modern time series analysis
Stars: ✭ 2,019 (+1860.19%)
SwifterA package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
Stars: ✭ 1,844 (+1690.29%)
DaskParallel computing with task scheduling
Stars: ✭ 9,309 (+8937.86%)