All Projects → Data Engineering Howto → Similar Projects or Alternatives

526 Open source projects that are alternatives of or similar to Data Engineering Howto

AirflowETL
Blog post on ETL pipelines with Airflow
Stars: ✭ 20 (-99.03%)
Mutual labels:  data-engineering, data-pipeline
practical-data-engineering
Real estate dagster pipeline
Stars: ✭ 110 (-94.65%)
Mutual labels:  data-engineering, data-pipeline
jobAnalytics and search
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (-98.78%)
Mutual labels:  data-engineering, data-pipeline
Rd Blender Docker
A collection of Docker containers for running Blender headless or distributed ✨
Stars: ✭ 111 (-94.6%)
Mutual labels:  distributed-systems
Circuitbreaker.net
Circuit Breaker pattern for .NET
Stars: ✭ 116 (-94.36%)
Mutual labels:  distributed-systems
Butterfree
A tool for building feature stores.
Stars: ✭ 126 (-93.87%)
Mutual labels:  data-engineering
Go Grpc
A simpler grpc framework
Stars: ✭ 133 (-93.53%)
Mutual labels:  distributed-systems
Superset
Apache Superset is a Data Visualization and Data Exploration Platform
Stars: ✭ 42,634 (+1973.64%)
Mutual labels:  data-engineering
Pyro5
Pyro 5 - Python remote objects for modern python versions
Stars: ✭ 123 (-94.02%)
Mutual labels:  distributed-systems
Door Slam
Distributed, Online, and Outlier Resilient SLAM for Robotic Teams
Stars: ✭ 107 (-94.8%)
Mutual labels:  distributed-systems
Adaptdl
Resource-adaptive cluster scheduler for deep learning training.
Stars: ✭ 100 (-95.14%)
Mutual labels:  distributed-systems
Playground
A new kind of virtual event platform 🐧
Stars: ✭ 120 (-94.16%)
Mutual labels:  distributed-systems
Feast
Feature Store for Machine Learning
Stars: ✭ 2,576 (+25.29%)
Mutual labels:  data-engineering
Xaynet
Xaynet represents an agnostic Federated Machine Learning framework to build privacy-preserving AI applications.
Stars: ✭ 111 (-94.6%)
Mutual labels:  distributed-systems
Saltie
🚗 Rocket League Distributed Deep Reinforcement Learning Bot
Stars: ✭ 134 (-93.48%)
Mutual labels:  distributed-systems
Nginx Lua Redis Rate Measuring
A lua library to provide distributed rate measurement using nginx + redis, you can use it to do a throttling system within many nodes.
Stars: ✭ 109 (-94.7%)
Mutual labels:  distributed-systems
Aws Data Wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+16%)
Mutual labels:  data-engineering
Micro
Micro is a distributed cloud operating system
Stars: ✭ 10,778 (+424.22%)
Mutual labels:  distributed-systems
Airflow Autoscaling Ecs
Airflow Deployment on AWS ECS Fargate Using Cloudformation
Stars: ✭ 136 (-93.39%)
Mutual labels:  data-engineering
Hermes
Hermes: a fault-tolerant replication protocol, implemented over RDMA, guaranteeing linearizability and achieving low latency and high throughput.
Stars: ✭ 105 (-94.89%)
Mutual labels:  distributed-systems
Lottor
distributed transaction service based on reliable msg,基于可靠消息的柔性分布式事务实现方案。
Stars: ✭ 122 (-94.07%)
Mutual labels:  distributed-systems
Rucio
Rucio - Scientific Data Management
Stars: ✭ 131 (-93.63%)
Mutual labels:  distributed-systems
Foundatio
Pluggable foundation blocks for building distributed apps.
Stars: ✭ 1,365 (-33.61%)
Mutual labels:  distributed-systems
Orbit
Orbit - Virtual actor framework for building distributed systems
Stars: ✭ 1,585 (-22.91%)
Mutual labels:  distributed-systems
Short Url
简单的分布式短链接服务实现
Stars: ✭ 100 (-95.14%)
Mutual labels:  distributed-systems
Zookeeper
Apache ZooKeeper
Stars: ✭ 10,061 (+389.35%)
Mutual labels:  distributed-systems
Scalecube Cluster
ScaleCube Cluster is a lightweight Java VM implementation of SWIM: Scalable Weakly-consistent Infection-style Process Group Membership Protocol. features cluster membership, failure detection, and gossip protocol library.
Stars: ✭ 119 (-94.21%)
Mutual labels:  distributed-systems
Panic Server
Testing for collaborative apps and tools
Stars: ✭ 128 (-93.77%)
Mutual labels:  distributed-systems
D6t Python
Accelerate data science
Stars: ✭ 118 (-94.26%)
Mutual labels:  data-engineering
Temporal
Temporal service
Stars: ✭ 3,212 (+56.23%)
Mutual labels:  distributed-systems
Just Dashboard
📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (-26.51%)
Mutual labels:  data-engineering
Pipelinex
PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more
Stars: ✭ 127 (-93.82%)
Mutual labels:  data-engineering
Genie
Distributed Big Data Orchestration Service
Stars: ✭ 1,544 (-24.9%)
Mutual labels:  distributed-systems
Mit6.824 distributedsystem
MIT6.824分布式系统(2018秋)
Stars: ✭ 135 (-93.43%)
Mutual labels:  distributed-systems
Pegasus
Pegasus Workflow Management System - Automate, recover, and debug scientific computations.
Stars: ✭ 110 (-94.65%)
Mutual labels:  distributed-systems
Faust
Python Stream Processing. A Faust fork
Stars: ✭ 124 (-93.97%)
Mutual labels:  distributed-systems
Dotnet Istanbul Microservices Demo
This is the demo application that i created for my talk 'Microservice Architecture & Implementation with Asp.Net Core' at Dotnet İstanbul Meetup Group.
Stars: ✭ 109 (-94.7%)
Mutual labels:  distributed-systems
Vertx In Action
Examples for the Manning "Vert.x in Action" book
Stars: ✭ 134 (-93.48%)
Mutual labels:  distributed-systems
Etcd
Distributed reliable key-value store for the most critical data of a distributed system
Stars: ✭ 38,238 (+1759.82%)
Mutual labels:  distributed-systems
Mangle
Git Repository for the Mangle tool
Stars: ✭ 125 (-93.92%)
Mutual labels:  distributed-systems
Awesome Distributed Systems
Awesome list of distributed systems resources
Stars: ✭ 1,466 (-28.7%)
Mutual labels:  distributed-systems
Examples
DC/OS examples
Stars: ✭ 139 (-93.24%)
Mutual labels:  distributed-systems
Parapet
A purely functional library to build distributed and event-driven systems
Stars: ✭ 106 (-94.84%)
Mutual labels:  distributed-systems
Dtcraft
A High-performance Cluster Computing Engine
Stars: ✭ 122 (-94.07%)
Mutual labels:  distributed-systems
Rapid
Rapid is a scalable distributed membership service
Stars: ✭ 103 (-94.99%)
Mutual labels:  distributed-systems
Go Archaius
a dynamic configuration framework used in distributed system
Stars: ✭ 133 (-93.53%)
Mutual labels:  distributed-systems
Jupiter
Jupiter是一款性能非常不错的, 轻量级的分布式服务框架
Stars: ✭ 1,372 (-33.27%)
Mutual labels:  distributed-systems
Spark Alchemy
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Stars: ✭ 122 (-94.07%)
Mutual labels:  data-engineering
Gossip Python
Implementation of the gossip protocol
Stars: ✭ 100 (-95.14%)
Mutual labels:  distributed-systems
Swim Js
JavaScript implementation of SWIM membership protocol
Stars: ✭ 135 (-93.43%)
Mutual labels:  distributed-systems
Library
Collection of papers in the field of distributed systems, game theory, cryptography, cryptoeconomics, zero knowledge
Stars: ✭ 100 (-95.14%)
Mutual labels:  distributed-systems
Sandglass
Sandglass is a distributed, horizontally scalable, persistent, time sorted message queue.
Stars: ✭ 1,531 (-25.54%)
Mutual labels:  distributed-systems
Kronos
Distributed Time Synchronization Service
Stars: ✭ 131 (-93.63%)
Mutual labels:  distributed-systems
Specs
COALA IP is a blockchain-ready, community-driven protocol for intellectual property licensing.
Stars: ✭ 98 (-95.23%)
Mutual labels:  distributed-systems
Zatt
Python implementation of the Raft algorithm for distributed consensus
Stars: ✭ 119 (-94.21%)
Mutual labels:  distributed-systems
Zookeeper Cpp
A ZooKeeper client for C++.
Stars: ✭ 98 (-95.23%)
Mutual labels:  distributed-systems
Xraft
xnnyygn's raft implementation
Stars: ✭ 99 (-95.18%)
Mutual labels:  distributed-systems
Bifrost
Pure rust building block for distributed systems
Stars: ✭ 118 (-94.26%)
Mutual labels:  distributed-systems
Orleans.clustering.kubernetes
Orleans Membership provider for Kubernetes
Stars: ✭ 140 (-93.19%)
Mutual labels:  distributed-systems
Accelerator
The Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-93.34%)
Mutual labels:  data-engineering
1-60 of 526 similar projects