All Projects → Data-Engineering-Projects → Similar Projects or Alternatives

993 Open source projects that are alternatives of or similar to Data-Engineering-Projects

Udacity Data Engineering Projects
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Stars: ✭ 458 (+174.25%)
jobAnalytics and search
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
Stars: ✭ 25 (-85.03%)
Data Engineering Nanodegree
Projects done in the Data Engineering Nanodegree by Udacity.com
Stars: ✭ 151 (-9.58%)
Mutual labels:  postgres, cassandra, data-engineering
Soda Sql
Metric collection, data testing and monitoring for SQL accessible data
Stars: ✭ 173 (+3.59%)
Mutual labels:  airflow, data-engineering
Data Science Stack Cookiecutter
🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)
Stars: ✭ 153 (-8.38%)
Mutual labels:  postgres, airflow
pipeline
PipelineAI Kubeflow Distribution
Stars: ✭ 4,154 (+2387.43%)
Mutual labels:  airflow, cassandra
Airflow Autoscaling Ecs
Airflow Deployment on AWS ECS Fargate Using Cloudformation
Stars: ✭ 136 (-18.56%)
Mutual labels:  airflow, data-engineering
beneath
Beneath is a serverless real-time data platform ⚡️
Stars: ✭ 65 (-61.08%)
Mutual labels:  data-warehouse, data-engineering
viewflow
Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
Stars: ✭ 110 (-34.13%)
Mutual labels:  airflow, data-engineering
Quill
Compile-time Language Integrated Queries for Scala
Stars: ✭ 1,998 (+1096.41%)
Mutual labels:  postgres, cassandra
Dataengineeringproject
Example end to end data engineering project.
Stars: ✭ 82 (-50.9%)
Mutual labels:  airflow, data-engineering
polygon-etl
ETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (-68.26%)
Mutual labels:  airflow, data-engineering
Sqlpad
Web-based SQL editor run in your own private cloud. Supports MySQL, Postgres, SQL Server, Vertica, Crate, ClickHouse, Trino, Presto, SAP HANA, Cassandra, Snowflake, BigQuery, SQLite, and more with ODBC
Stars: ✭ 4,113 (+2362.87%)
Mutual labels:  postgres, cassandra
Migrate
Database migrations. CLI and Golang library.
Stars: ✭ 7,712 (+4517.96%)
Mutual labels:  postgres, cassandra
versatile-data-kit
Versatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
Stars: ✭ 144 (-13.77%)
Mutual labels:  data-warehouse, data-engineering
airflow-dbt-python
A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.
Stars: ✭ 111 (-33.53%)
Mutual labels:  airflow, data-engineering
Goodreads etl pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+374.85%)
Mutual labels:  airflow, data-engineering
Migrate
Database migrations. CLI and Golang library.
Stars: ✭ 2,315 (+1286.23%)
Mutual labels:  postgres, cassandra
AirflowETL
Blog post on ETL pipelines with Airflow
Stars: ✭ 20 (-88.02%)
Mutual labels:  airflow, data-engineering
AirflowDataPipeline
Example of an ETL Pipeline using Airflow
Stars: ✭ 24 (-85.63%)
Mutual labels:  airflow, data-engineering
Around Dataengineering
A Data Engineering & Machine Learning Knowledge Hub
Stars: ✭ 257 (+53.89%)
Mutual labels:  airflow, data-engineering
astro
Astro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (-52.69%)
Mutual labels:  postgres, airflow
Udacity Data Engineering
Udacity Data Engineering Nano Degree (DEND)
Stars: ✭ 89 (-46.71%)
Mutual labels:  airflow, cassandra
Beyond Jupyter
🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
Stars: ✭ 135 (-19.16%)
Mutual labels:  postgres, airflow
Azure-Certification-DP-200
Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution
Stars: ✭ 54 (-67.66%)
Mutual labels:  data-engineering, data-lake
Cassandra-Data-Modeling
Basic Rules of Cassandra Data Modeling
Stars: ✭ 29 (-82.63%)
Mutual labels:  cassandra, data-modeling
agent
Job tracker & performance platform
Stars: ✭ 26 (-84.43%)
Mutual labels:  postgres
skyline-query
Simple implementation of spatial skyline query algorithms
Stars: ✭ 17 (-89.82%)
Mutual labels:  data-modeling
factory
Docker microservice & Crawler by scrapy
Stars: ✭ 56 (-66.47%)
Mutual labels:  scrapy
FsCassy
Functional F# API for Cassandra
Stars: ✭ 20 (-88.02%)
Mutual labels:  cassandra
cassandra-migration
Apache Cassandra / DataStax Enterprise database migration (schema evolution) library
Stars: ✭ 51 (-69.46%)
Mutual labels:  cassandra
wait-for-pg
Check if PostgreSQL database is ready
Stars: ✭ 22 (-86.83%)
Mutual labels:  postgres
general-angular
Realtime Angular Admin/CRUD Front End App
Stars: ✭ 24 (-85.63%)
Mutual labels:  postgres
docker-elassandra
Docker Image packaging for Elassandra
Stars: ✭ 25 (-85.03%)
Mutual labels:  cassandra
api.pokedextracker.com
API for pokedextracker.com
Stars: ✭ 38 (-77.25%)
Mutual labels:  postgres
scrapy.dart
Scrapy, a fast high-level web crawling & scraping framework for dart and Flutter
Stars: ✭ 50 (-70.06%)
Mutual labels:  scrapy
scraping-ebay
Scraping Ebay's products using Scrapy Web Crawling Framework
Stars: ✭ 79 (-52.69%)
Mutual labels:  scrapy
scrapy-distributed
A series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.
Stars: ✭ 38 (-77.25%)
Mutual labels:  scrapy
erdiagram
Entity-Relationship diagram code generator library
Stars: ✭ 28 (-83.23%)
Mutual labels:  postgres
create-fastify-app
An utility that help you to generate or add plugin to your Fastify project
Stars: ✭ 53 (-68.26%)
Mutual labels:  postgres
pg migrate
Manage postgres schema, triggers, procedures, and views
Stars: ✭ 25 (-85.03%)
Mutual labels:  postgres
kubernetes-examples
A bunch of examples of how to deploy things on kubernetes
Stars: ✭ 34 (-79.64%)
Mutual labels:  cassandra
cassandra-client
Cassandra 3 GUI client
Stars: ✭ 49 (-70.66%)
Mutual labels:  cassandra
libpq.framework
An XCode project to compile your own libpq.framework for iOS 11.x
Stars: ✭ 27 (-83.83%)
Mutual labels:  postgres
docker-postgresql-pro-1c
Dockerfile для сборки PostgreSQL под 1С:Предприятие 8
Stars: ✭ 27 (-83.83%)
Mutual labels:  postgres
OLX Scraper
📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (-91.02%)
Mutual labels:  scrapy
nim-gatabase
Connection-Pooling Compile-Time ORM for Nim
Stars: ✭ 103 (-38.32%)
Mutual labels:  postgres
terraform-aws-druid
Terraform module to deploy Apache Druid in Kubernetes
Stars: ✭ 16 (-90.42%)
Mutual labels:  postgres
cassandra.realtime
Different ways to process data into Cassandra in realtime with technologies such as Kafka, Spark, Akka, Flink
Stars: ✭ 25 (-85.03%)
Mutual labels:  cassandra
proxi
Proxy pool. Finds and checks proxies with rest api for querying results. Can find over 25k proxies in under 5 minutes.
Stars: ✭ 32 (-80.84%)
Mutual labels:  scrapy
Commando
[DEPRECATED] ⚫ Commando Discord bot built on discord.js-commando.
Stars: ✭ 78 (-53.29%)
Mutual labels:  postgres
pg-error-enum
TypeScript Enum for Postgres Errors with no runtime dependencies. Also compatible with plain JavaScript.
Stars: ✭ 18 (-89.22%)
Mutual labels:  postgres
hiveberg
Demonstration of a Hive Input Format for Iceberg
Stars: ✭ 22 (-86.83%)
Mutual labels:  data-lake
Scrapy IPProxyPool
免费 IP 代理池。Scrapy 爬虫框架插件
Stars: ✭ 100 (-40.12%)
Mutual labels:  scrapy
metamapper
Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.
Stars: ✭ 60 (-64.07%)
Mutual labels:  data-warehouse
gallia-core
A schema-aware Scala library for data transformation
Stars: ✭ 44 (-73.65%)
Mutual labels:  data-engineering
postgresql-resilient
Automatic re-connection support for PostgreSQL.
Stars: ✭ 16 (-90.42%)
Mutual labels:  postgres
elves
🎊 Design and implement of lightweight crawler framework.
Stars: ✭ 322 (+92.81%)
Mutual labels:  scrapy
logparser
A tool for parsing Scrapy log files periodically and incrementally, extending the HTTP JSON API of Scrapyd.
Stars: ✭ 70 (-58.08%)
Mutual labels:  scrapy
EFCore.Cassandra
Entity Framework Core provider for Cassandra
Stars: ✭ 23 (-86.23%)
Mutual labels:  cassandra
1-60 of 993 similar projects