All Projects → Data Engineering Howto → Similar Projects or Alternatives

526 Open source projects that are alternatives of or similar to Data Engineering Howto

Prefect
The easiest way to automate your data
Stars: ✭ 7,956 (+286.96%)
Mutual labels:  data-engineering
yt-channels-DS-AI-ML-CS
A comprehensive list of 180+ YouTube Channels for Data Science, Data Engineering, Machine Learning, Deep learning, Computer Science, programming, software engineering, etc.
Stars: ✭ 1,038 (-49.51%)
Mutual labels:  data-engineering
Genie
Distributed Big Data Orchestration Service
Stars: ✭ 1,544 (-24.9%)
Mutual labels:  distributed-systems
hlclock
Hybrid Logical Clocks for Elixir
Stars: ✭ 46 (-97.76%)
Mutual labels:  distributed-systems
Xingo
高性能golang网络库,游戏开发脚手架
Stars: ✭ 727 (-64.64%)
Mutual labels:  distributed-systems
auklet
Auklet is a high performance storage engine based on Openstack Swift
Stars: ✭ 86 (-95.82%)
Mutual labels:  distributed-systems
Go2p
Simple to use but full configurable p2p framework
Stars: ✭ 80 (-96.11%)
Mutual labels:  distributed-systems
mpc-DL-controller
Deep Neural Network architecture as a predictive optimal controller for {HVAC+Solar cell + battery} disturbance afflicted system vs classic Model Predictive Control
Stars: ✭ 37 (-98.2%)
Mutual labels:  data-engineering
Talent Plan
open source training courses about distributed database and distributed systemes
Stars: ✭ 6,965 (+238.76%)
Mutual labels:  distributed-systems
Gauntlet
🔖 Guides, Articles, Podcasts, Videos and Notes to Build Reliable Large-Scale Distributed Systems.
Stars: ✭ 336 (-83.66%)
Mutual labels:  distributed-systems
Mit6.824 distributedsystem
MIT6.824分布式系统(2018秋)
Stars: ✭ 135 (-93.43%)
Mutual labels:  distributed-systems
reacted
Actor based reactive java framework for microservices in local and distributed environment
Stars: ✭ 17 (-99.17%)
Mutual labels:  distributed-systems
Node
Mysterium Network Node - official implementation of distributed VPN network (dVPN) protocol
Stars: ✭ 681 (-66.88%)
Mutual labels:  distributed-systems
DataEngineering
This repo contains commands that data engineers use in day to day work.
Stars: ✭ 47 (-97.71%)
Mutual labels:  data-engineering
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-96.16%)
Mutual labels:  data-engineering
pixie
Instant Kubernetes-Native Application Observability
Stars: ✭ 3,238 (+57.49%)
Mutual labels:  distributed-systems
Snowplow
The enterprise-grade behavioral data engine (web, mobile, server-side, webhooks), running cloud-natively on AWS and GCP
Stars: ✭ 5,935 (+188.67%)
Mutual labels:  data-pipeline
file management sys
file_management_sys 是一个文件共享系统,包括前端文件展示系统和后台管理系统,基于SpringBoot + MyBatis实现。前端文件展示系统包括文件分类和展示界面,文件搜索和文件上传等模块。后台管理系统包含文件管理,权限管理等模块。
Stars: ✭ 60 (-97.08%)
Mutual labels:  distributed-systems
Pegasus
Pegasus Workflow Management System - Automate, recover, and debug scientific computations.
Stars: ✭ 110 (-94.65%)
Mutual labels:  distributed-systems
Systemizer
A system design tool that allows you to simulate data flow of distributed systems.
Stars: ✭ 1,219 (-40.71%)
Mutual labels:  distributed-systems
Hivemind
Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.
Stars: ✭ 661 (-67.85%)
Mutual labels:  distributed-systems
research
distributed system;blokchain;filecoin/ipfs,...
Stars: ✭ 39 (-98.1%)
Mutual labels:  distributed-systems
Kbfs
Keybase Filesystem (KBFS)
Stars: ✭ 1,218 (-40.76%)
Mutual labels:  distributed-systems
ml-in-production
The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.
Stars: ✭ 29 (-98.59%)
Mutual labels:  data-engineering
Partisan
High-performance, high-scalability distributed computing with Erlang and Elixir.
Stars: ✭ 652 (-68.29%)
Mutual labels:  distributed-systems
MIT6.824-2017-Chinese
A Chinese version of MIT 6.824 (Distributed System)
Stars: ✭ 67 (-96.74%)
Mutual labels:  distributed-systems
Faust
Python Stream Processing. A Faust fork
Stars: ✭ 124 (-93.97%)
Mutual labels:  distributed-systems
augraphy
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Stars: ✭ 49 (-97.62%)
Mutual labels:  data-pipeline
Lightctr
Lightweight and Scalable framework that combines mainstream algorithms of Click-Through-Rate prediction based computational DAG, philosophy of Parameter Server and Ring-AllReduce collective communication.
Stars: ✭ 644 (-68.68%)
Mutual labels:  distributed-systems
nkn-shell-daemon
NKN shell daemon
Stars: ✭ 29 (-98.59%)
Mutual labels:  distributed-systems
Rsf
已作为 Hasor 的子项目,迁移到:http://git.oschina.net/zycgit/hasor
Stars: ✭ 77 (-96.25%)
Mutual labels:  distributed-systems
richflow
A Node.js and JavaScript synchronous data pipeline processing, data sharing and stream processing library. Actionable & Transformable Pipeline data processing.
Stars: ✭ 17 (-99.17%)
Mutual labels:  data-pipeline
Ray
项目停止更新,新项目:https://github.com/RayTale/Vertex
Stars: ✭ 635 (-69.11%)
Mutual labels:  distributed-systems
Data-Engineering-Projects
Personal Data Engineering Projects
Stars: ✭ 167 (-91.88%)
Mutual labels:  data-engineering
Dotnet Istanbul Microservices Demo
This is the demo application that i created for my talk 'Microservice Architecture & Implementation with Asp.Net Core' at Dotnet İstanbul Meetup Group.
Stars: ✭ 109 (-94.7%)
Mutual labels:  distributed-systems
Faust
Python Stream Processing
Stars: ✭ 5,899 (+186.92%)
Mutual labels:  distributed-systems
Go Queue
Multi backend queues for Golang
Stars: ✭ 15 (-99.27%)
Mutual labels:  distributed-systems
Benthos
Fancy stream processing made operationally mundane
Stars: ✭ 3,705 (+80.2%)
Mutual labels:  data-engineering
Anubis
Distributed LMS for automating Computing Science Courses From NYU
Stars: ✭ 184 (-91.05%)
Mutual labels:  distributed-systems
Trustgraph
Decentralized trust ratings using signed claims
Stars: ✭ 75 (-96.35%)
Mutual labels:  distributed-systems
protoactor-go
Proto Actor - Ultra fast distributed actors for Go, C# and Java/Kotlin
Stars: ✭ 4,138 (+101.26%)
Mutual labels:  distributed-systems
Elasticdl
Kubernetes-native Deep Learning Framework
Stars: ✭ 604 (-70.62%)
Mutual labels:  distributed-systems
awesome-list-of-awesomes
A curated list of all the Awesome --Topic Name-- lists I've found till date relevant to Data lifecycle, ML and DL.
Stars: ✭ 259 (-87.4%)
Mutual labels:  distributed-systems
Vertx In Action
Examples for the Manning "Vert.x in Action" book
Stars: ✭ 134 (-93.48%)
Mutual labels:  distributed-systems
nact
nact ⇒ node.js + actors ⇒ your services have never been so µ
Stars: ✭ 1,003 (-51.22%)
Mutual labels:  distributed-systems
Pixie
Instant Kubernetes-Native Application Observability
Stars: ✭ 589 (-71.35%)
Mutual labels:  distributed-systems
teracache
Scalable, fault-tolerant, highly-available cache
Stars: ✭ 15 (-99.27%)
Mutual labels:  distributed-systems
Cookbook
The Data Engineering Cookbook
Stars: ✭ 9,829 (+378.06%)
Mutual labels:  data-engineering
zimfarm
Farm operated by bots to grow and harvest new zim files
Stars: ✭ 58 (-97.18%)
Mutual labels:  distributed-systems
Golimit
Golimit is Uber ringpop based distributed and decentralized rate limiter
Stars: ✭ 581 (-71.74%)
Mutual labels:  distributed-systems
viewflow
Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
Stars: ✭ 110 (-94.65%)
Mutual labels:  data-engineering
Etcd
Distributed reliable key-value store for the most critical data of a distributed system
Stars: ✭ 38,238 (+1759.82%)
Mutual labels:  distributed-systems
Trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+122.81%)
Mutual labels:  distributed-systems
Faang
Facebook, Amazon, Apple, Netflix and Google (FAANG) Job preparation.
Stars: ✭ 557 (-72.91%)
Mutual labels:  distributed-systems
Orleans.clustering.kubernetes
Orleans Membership provider for Kubernetes
Stars: ✭ 140 (-93.19%)
Mutual labels:  distributed-systems
Accelerator
The Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-93.34%)
Mutual labels:  data-engineering
Wukong
A graph-based distributed in-memory store that leverages efficient graph exploration to provide highly concurrent and low-latency queries over big linked data
Stars: ✭ 134 (-93.48%)
Mutual labels:  distributed-systems
Nano
Lightweight, facility, high performance golang based game server framework
Stars: ✭ 1,888 (-8.17%)
Mutual labels:  distributed-systems
Raft Rs
Raft distributed consensus algorithm implemented in Rust.
Stars: ✭ 1,859 (-9.58%)
Mutual labels:  distributed-systems
Rails Disco
Distributed Rails with commands, events and projections.
Stars: ✭ 95 (-95.38%)
Mutual labels:  distributed-systems
301-360 of 526 similar projects