All Projects → google → xarray-beam

google / xarray-beam

Licence: Apache-2.0 license
Distributed Xarray with Apache Beam

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to xarray-beam

pangeo-forge-recipes
Python library for building Pangeo Forge recipes.
Stars: ✭ 64 (-22.89%)
Mutual labels:  xarray, zarr
flox
Fast & furious GroupBy operations for dask.array
Stars: ✭ 42 (-49.4%)
Mutual labels:  xarray, dask
esmlab
Earth System Model Lab (esmlab). ⚠️⚠️ ESMLab functionality has been moved into <https://github.com/NCAR/geocat-comp>. ⚠️⚠️
Stars: ✭ 23 (-72.29%)
Mutual labels:  xarray, dask
Xarray
N-D labeled arrays and datasets in Python
Stars: ✭ 2,353 (+2734.94%)
Mutual labels:  xarray, dask
jet
Jet is a simple OOP, dynamically typed, functional language that runs on the Erlang virtual machine (BEAM). Jet's syntax is Ruby-like syntax.
Stars: ✭ 22 (-73.49%)
Mutual labels:  beam
Scio
A Scala API for Apache Beam and Google Cloud Dataflow.
Stars: ✭ 2,247 (+2607.23%)
Mutual labels:  beam
Mad
⚡ MAD: Manage Dependencies
Stars: ✭ 175 (+110.84%)
Mutual labels:  beam
Prom ex
An Elixir Prometheus metrics collection library built on top of Telemetry with accompanying Grafana dashboards
Stars: ✭ 149 (+79.52%)
Mutual labels:  beam
qhub
🪴 Nebari - your open source data science platform
Stars: ✭ 175 (+110.84%)
Mutual labels:  dask
deafrica-sandbox-notebooks
Repository for Digital Earth Africa Sandbox, including: Jupyter notebooks, scripts, tools and workflows for geospatial analysis with Open Data Cube and xarray
Stars: ✭ 108 (+30.12%)
Mutual labels:  xarray
optimus
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+1527.71%)
Mutual labels:  dask
Pytorch Beam Search Decoding
PyTorch implementation of beam search decoding for seq2seq models
Stars: ✭ 204 (+145.78%)
Mutual labels:  beam
coiled-resources
Notebooks that support blog posts and tech talks on Dask / Coiled.
Stars: ✭ 33 (-60.24%)
Mutual labels:  dask
Lam
🚀 a lightweight, universal actor-model vm for writing scalable and reliable applications that run natively and on WebAssembly
Stars: ✭ 176 (+112.05%)
Mutual labels:  beam
iridium
A register-based VM in Rust
Stars: ✭ 60 (-27.71%)
Mutual labels:  beam
Codec Beam
Generate Erlang VM byte code from Haskell
Stars: ✭ 152 (+83.13%)
Mutual labels:  beam
xmca
Maximum Covariance Analysis in Python
Stars: ✭ 41 (-50.6%)
Mutual labels:  xarray
xbatcher
Batch generation from xarray datasets
Stars: ✭ 93 (+12.05%)
Mutual labels:  xarray
graphchain
⚡️ An efficient cache for the execution of dask graphs.
Stars: ✭ 63 (-24.1%)
Mutual labels:  dask
Beam
Beam: the open source Reddit client for iOS
Stars: ✭ 217 (+161.45%)
Mutual labels:  beam

Xarray-Beam

Xarray-Beam is a Python library for building Apache Beam pipelines with Xarray datasets.

The project aims to facilitate data transformations and analysis on large-scale multi-dimensional labeled arrays, such as:

  • Ad-hoc computation on Xarray data, by dividing a xarray.Dataset into many smaller pieces ("chunks").
  • Adjusting array chunks, using the Rechunker algorithm.
  • Ingesting large, multi-dimensional array datasets into an analysis-ready, cloud-optimized format, namely Zarr (see also Pangeo Forge).
  • Calculating statistics (e.g., "climatology") across distributed datasets with arbitrary groups.

For more about our approach and how to get started, read the documentation!

Warning: Xarray-Beam is a sharp tool 🔪

Xarray-Beam is relatively new, and focused on expert users:

  • We use it extensively at Google for processing large-scale weather datasets, but there is not yet a vibrant external community.
  • It provides low-level abstractions that facilitate writing very large scale data pipelines (e.g., 100+ TB), but by design it requires explicitly thinking about how every operation is parallelized.

Installation

Xarray-Beam requires recent versions of immutabledict, Xarray, Dask, Rechunker, Zarr, and Apache Beam. For best performance when writing Zarr files, use Xarray 0.19.0 or later.

Disclaimer

Xarray-Beam is an experiment that we are sharing with the outside world in the hope that it will be useful. It is not a supported Google product. We welcome feedback, bug reports and code contributions, but cannot guarantee they will be addressed.

See the "Contribution guidelines" for more.

Credits

Contributors:

  • Stephan Hoyer
  • Jason Hickey
  • Cenk Gazen
  • Alex Merose
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].