xarray-beamDistributed Xarray with Apache Beam
Stars: ✭ 83 (+97.62%)
esmlabEarth System Model Lab (esmlab). ⚠️⚠️ ESMLab functionality has been moved into <https://github.com/NCAR/geocat-comp>. ⚠️⚠️
Stars: ✭ 23 (-45.24%)
XarrayN-D labeled arrays and datasets in Python
Stars: ✭ 2,353 (+5502.38%)
xarrayutilsxarrayutils.readthedocs.io/
Stars: ✭ 50 (+19.05%)
aospyPython package for automated analysis and management of gridded climate data
Stars: ✭ 80 (+90.48%)
climate systemNotes and practicals for my "Physics of the Climate System" lecture
Stars: ✭ 13 (-69.05%)
xbatcherBatch generation from xarray datasets
Stars: ✭ 93 (+121.43%)
mloperatorMachine Learning Operator & Controller for Kubernetes
Stars: ✭ 85 (+102.38%)
xmcaMaximum Covariance Analysis in Python
Stars: ✭ 41 (-2.38%)
XArrayAndRasterioExperimental code for loading/saving XArray DataArrays to Geographic Rasters using rasterio
Stars: ✭ 21 (-50%)
xarray-sentinelXarray backend to Copernicus Sentinel-1 satellite data products
Stars: ✭ 189 (+350%)
dvc dask use caseA use case of a reproducible machine learning pipeline using Dask, DVC, and MLflow.
Stars: ✭ 22 (-47.62%)
dask-sqlDistributed SQL Engine in Python using Dask
Stars: ✭ 271 (+545.24%)
spyndexAwesome Spectral Indices in Python.
Stars: ✭ 56 (+33.33%)
PyEarthScienceThe PyEarthScience repository created by DKRZ (German Climate Computing Center) provides Python scripts and Jupyter notebooks in particular for scientific data processing and visualization used in climate science. It contains scripts for visualization, I/O, and analysis using PyNGL, PyNIO, xarray, cfgrib, xesmf, cartopy, and others.
Stars: ✭ 56 (+33.33%)
deafrica-sandbox-notebooksRepository for Digital Earth Africa Sandbox, including: Jupyter notebooks, scripts, tools and workflows for geospatial analysis with Open Data Cube and xarray
Stars: ✭ 108 (+157.14%)
xwrfA lightweight interface for working with the Weather Research and Forecasting (WRF) model output in Xarray.
Stars: ✭ 45 (+7.14%)
gcpyPython toolkit for GEOS-Chem.
Stars: ✭ 34 (-19.05%)
DtaleVisualizer for pandas data structures
Stars: ✭ 2,864 (+6719.05%)
xcastA High-Performance Data Science Toolkit for the Earth Sciences
Stars: ✭ 28 (-33.33%)
arboretoA scalable python-based framework for gene regulatory network inference using tree-based ensemble regressors.
Stars: ✭ 33 (-21.43%)
HerbiePython for downloading model data (HRRR, RAP, GFS, NBM, etc.) from NOMADS, NOAA's Big Data Program partners (Amazon, Google, Microsoft), ECMWF open data, and the University of Utah Pando Archive System.
Stars: ✭ 92 (+119.05%)
lazycluster🎛 Distributed machine learning made simple.
Stars: ✭ 43 (+2.38%)
wxeeA Python interface between Earth Engine and xarray for processing time series data
Stars: ✭ 113 (+169.05%)
GleamFast, efficient, and scalable distributed map/reduce system, DAG execution, in memory or on disk, written in pure Go, runs standalone or distributedly.
Stars: ✭ 2,949 (+6921.43%)
xpublishPublish Xarray Datasets via a REST API.
Stars: ✭ 86 (+104.76%)
prefect-saturnPython client for using Prefect Cloud with Saturn Cloud
Stars: ✭ 15 (-64.29%)
daskperimentReproducibility for Humans: A lightweight tool to perform reproducible machine learning experiment.
Stars: ✭ 25 (-40.48%)
clisopsClimate Simulation Operations
Stars: ✭ 17 (-59.52%)
bumblebee🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
Stars: ✭ 120 (+185.71%)
prostoProsto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Stars: ✭ 54 (+28.57%)
hypothesis-gufuncExtension to hypothesis for testing numpy general universal functions
Stars: ✭ 32 (-23.81%)
qhub🪴 Nebari - your open source data science platform
Stars: ✭ 175 (+316.67%)
FoldsCUDA.jlData-parallelism on CUDA using Transducers.jl and for loops (FLoops.jl)
Stars: ✭ 48 (+14.29%)
madpy-daskMadPy Dask talk materials
Stars: ✭ 33 (-21.43%)
coiled-resourcesNotebooks that support blog posts and tech talks on Dask / Coiled.
Stars: ✭ 33 (-21.43%)
optimus🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+3116.67%)
pyparEfficient and scalable parallelism using the message passing interface (MPI) to handle big data and highly computational problems.
Stars: ✭ 66 (+57.14%)
graphchain⚡️ An efficient cache for the execution of dask graphs.
Stars: ✭ 63 (+50%)
wax-mlA Python library for machine-learning and feedback loops on streaming data
Stars: ✭ 36 (-14.29%)
MarsMars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
Stars: ✭ 2,308 (+5395.24%)
future.mapreduce[EXPERIMENTAL] R package: future.mapreduce - Utility Functions for Future Map-Reduce API Packages
Stars: ✭ 12 (-71.43%)
xbpchxarray interface for bpch files
Stars: ✭ 17 (-59.52%)
dask-awkwardNative Dask collection for awkward arrays, and the library to use it.
Stars: ✭ 25 (-40.48%)
resteePython package to call processed EE objects via the REST API to local data
Stars: ✭ 26 (-38.1%)
PoseidonA search engine which can hold 100 trillion lines of log data.
Stars: ✭ 1,793 (+4169.05%)
dask-pytorch-ddpdask-pytorch-ddp is a Python package that makes it easy to train PyTorch models on dask clusters using distributed data parallel.
Stars: ✭ 50 (+19.05%)
open-soqlOpen source implementation of the SOQL.
Stars: ✭ 15 (-64.29%)
mlforecastScalable machine 🤖 learning for time series forecasting.
Stars: ✭ 96 (+128.57%)
dask-rasterioRead and write rasters in parallel using Rasterio and Dask
Stars: ✭ 82 (+95.24%)
gaiaGaia is a geospatial analysis library jointly developed by Kitware and Epidemico.
Stars: ✭ 29 (-30.95%)
hack parallelThe core parallel and shared memory library used by Hack, Flow, and Pyre
Stars: ✭ 39 (-7.14%)
datatileA library for managing, validating, summarizing, and visualizing data.
Stars: ✭ 419 (+897.62%)
goes2goDownload and process GOES-16 and GOES-17 data from NOAA's archive on AWS using Python.
Stars: ✭ 77 (+83.33%)