All Projects → neon-workshop → Similar Projects or Alternatives

103 Open source projects that are alternatives of or similar to neon-workshop

Sayn
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Stars: ✭ 79 (+315.79%)
Mutual labels:  data-engineering
Ansible Playbook
Ansible playbook to deploy distributed technologies
Stars: ✭ 61 (+221.05%)
Mutual labels:  data-engineering
Waimak
Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.
Stars: ✭ 60 (+215.79%)
Mutual labels:  data-engineering
Quilt
Quilt is a self-organizing data hub for S3
Stars: ✭ 1,007 (+5200%)
Mutual labels:  data-engineering
Dbt Sqlserver
dbt adapter for SQL Server and Azure SQL
Stars: ✭ 41 (+115.79%)
Mutual labels:  data-engineering
Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+4447.37%)
Mutual labels:  data-engineering
Lakefs
Git-like capabilities for your object storage
Stars: ✭ 847 (+4357.89%)
Mutual labels:  data-engineering
Goodreads etl pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+4073.68%)
Mutual labels:  data-engineering
Prefect
The easiest way to automate your data
Stars: ✭ 7,956 (+41773.68%)
Mutual labels:  data-engineering
Pyjanitor
Clean APIs for data cleaning. Python implementation of R package Janitor
Stars: ✭ 647 (+3305.26%)
Mutual labels:  data-engineering
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+3231.58%)
Mutual labels:  data-engineering
Pointblank
Data validation and organization of metadata for data frames and database tables
Stars: ✭ 480 (+2426.32%)
Mutual labels:  data-engineering
Data Engineering Book
Accumulated knowledge and experience in the field of Data Engineering
Stars: ✭ 471 (+2378.95%)
Mutual labels:  data-engineering
Udacity Data Engineering Projects
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Stars: ✭ 458 (+2310.53%)
Mutual labels:  data-engineering
Great expectations
Always know what to expect from your data.
Stars: ✭ 5,808 (+30468.42%)
Mutual labels:  data-engineering
Active workflow
Turn complex requirements to workflows without leaving the comfort of your technology stack.
Stars: ✭ 413 (+2073.68%)
Mutual labels:  data-engineering
Awesome Opensource Data Engineering
An Awesome List of Open-Source Data Engineering Projects
Stars: ✭ 381 (+1905.26%)
Mutual labels:  data-engineering
Learn Something Every Day
📝 A compilation of everything that I learn; Computer Science, Software Development, Engineering, Math, and Coding in General. Read the rendered results here ->
Stars: ✭ 362 (+1805.26%)
Mutual labels:  data-engineering
Dataform
Dataform is a framework for managing SQL based data operations in BigQuery, Snowflake, and Redshift
Stars: ✭ 342 (+1700%)
Mutual labels:  data-engineering
Egeria
Open Metadata and Governance
Stars: ✭ 328 (+1626.32%)
Mutual labels:  data-engineering
Benthos
Fancy stream processing made operationally mundane
Stars: ✭ 3,705 (+19400%)
Mutual labels:  data-engineering
Around Dataengineering
A Data Engineering & Machine Learning Knowledge Hub
Stars: ✭ 257 (+1252.63%)
Mutual labels:  data-engineering
Airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+25789.47%)
Mutual labels:  data-engineering
Feast
Feature Store for Machine Learning
Stars: ✭ 2,576 (+13457.89%)
Mutual labels:  data-engineering
Cookbook
The Data Engineering Cookbook
Stars: ✭ 9,829 (+51631.58%)
Mutual labels:  data-engineering
etl manager
A python package to create a database on the platform using our moj data warehousing framework
Stars: ✭ 14 (-26.32%)
Mutual labels:  data-engineering
ClassifyBot
Automate building ML classification pipelines in .NET
Stars: ✭ 16 (-15.79%)
Mutual labels:  data-engineering
growthbook
Open Source Feature Flagging and A/B Testing Platform
Stars: ✭ 2,342 (+12226.32%)
Mutual labels:  data-engineering
arthur-redshift-etl
ELT Code for your Data Warehouse
Stars: ✭ 22 (+15.79%)
Mutual labels:  data-engineering
pangeo-forge-recipes
Python library for building Pangeo Forge recipes.
Stars: ✭ 64 (+236.84%)
Mutual labels:  data-engineering
yt-channels-DS-AI-ML-CS
A comprehensive list of 180+ YouTube Channels for Data Science, Data Engineering, Machine Learning, Deep learning, Computer Science, programming, software engineering, etc.
Stars: ✭ 1,038 (+5363.16%)
Mutual labels:  data-engineering
Kaggle-project-list
Summary of my projects on kaggle
Stars: ✭ 20 (+5.26%)
Mutual labels:  data-engineering
mpc-DL-controller
Deep Neural Network architecture as a predictive optimal controller for {HVAC+Solar cell + battery} disturbance afflicted system vs classic Model Predictive Control
Stars: ✭ 37 (+94.74%)
Mutual labels:  data-engineering
DataEngineering
This repo contains commands that data engineers use in day to day work.
Stars: ✭ 47 (+147.37%)
Mutual labels:  data-engineering
Data-Engineering-Projects
Personal Data Engineering Projects
Stars: ✭ 167 (+778.95%)
Mutual labels:  data-engineering
jobAnalytics and search
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (+31.58%)
Mutual labels:  data-engineering
gallia-core
A schema-aware Scala library for data transformation
Stars: ✭ 44 (+131.58%)
Mutual labels:  data-engineering
viewflow
Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
Stars: ✭ 110 (+478.95%)
Mutual labels:  data-engineering
Dagster
An orchestration platform for the development, production, and observation of data assets.
Stars: ✭ 4,099 (+21473.68%)
Mutual labels:  data-pipelines
Hub
Dataset format for AI. Build, manage, & visualize datasets for deep learning. Stream data real-time to PyTorch/TensorFlow & version-control it. https://activeloop.ai
Stars: ✭ 4,003 (+20968.42%)
Mutual labels:  data-pipelines
arakat
ARAKAT - Big Data Analysis and Business Intelligence Application Development Platform
Stars: ✭ 23 (+21.05%)
Mutual labels:  data-pipelines
spark-transformers
Spark-Transformers: Library for exporting Apache Spark MLLIB models to use them in any Java application with no other dependencies.
Stars: ✭ 39 (+105.26%)
Mutual labels:  data-pipelines
rivery cli
Rivery CLI
Stars: ✭ 16 (-15.79%)
Mutual labels:  data-pipelines
61-103 of 103 similar projects