All Projects → versatile-data-kit → Similar Projects or Alternatives

632 Open source projects that are alternatives of or similar to versatile-data-kit

beneath
Beneath is a serverless real-time data platform ⚡️
Stars: ✭ 65 (-54.86%)
rivery cli
Rivery CLI
Stars: ✭ 16 (-88.89%)
Mutual labels:  etl, dataops, elt, data-pipelines
contessa
Easy way to define, execute and store quality rules for your data.
Stars: ✭ 17 (-88.19%)
dbd
dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.
Stars: ✭ 30 (-79.17%)
Mutual labels:  etl, snowflake, elt
astro
Astro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (-45.14%)
Mutual labels:  etl, snowflake, elt
Airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+3315.97%)
Mutual labels:  etl, data-engineering, elt
arthur-redshift-etl
ELT Code for your Data Warehouse
Stars: ✭ 22 (-84.72%)
Mutual labels:  etl, data-engineering, elt
AirflowDataPipeline
Example of an ETL Pipeline using Airflow
Stars: ✭ 24 (-83.33%)
Mutual labels:  etl, data-engineering, data-pipelines
google-sheets-etl
Live import all your Google Sheets to your data warehouse
Stars: ✭ 15 (-89.58%)
Mutual labels:  etl, data-warehouse
Cookbook
The Data Engineering Cookbook
Stars: ✭ 9,829 (+6725.69%)
Mutual labels:  data-engineering, data-engineer
Applied Ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Stars: ✭ 17,824 (+12277.78%)
Mutual labels:  data-engineering, data-quality
hive-metastore-client
A client for connecting and running DDLs on hive metastore.
Stars: ✭ 37 (-74.31%)
Mutual labels:  etl, data-engineering
Dagster
An orchestration platform for the development, production, and observation of data assets.
Stars: ✭ 4,099 (+2746.53%)
Mutual labels:  etl, data-pipelines
ml-in-production
The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.
Stars: ✭ 29 (-79.86%)
Mutual labels:  data-engineering, data-pipelines
Azure-Certification-DP-200
Road to Azure Data Engineer Part-I: DP-200 - Implementing an Azure Data Solution
Stars: ✭ 54 (-62.5%)
Mutual labels:  data-engineering, data-engineer
growthbook
Open Source Feature Flagging and A/B Testing Platform
Stars: ✭ 2,342 (+1526.39%)
Mutual labels:  snowflake, data-engineering
gallia-core
A schema-aware Scala library for data transformation
Stars: ✭ 44 (-69.44%)
Mutual labels:  etl, data-engineering
Addax
Addax is an open source universal ETL tool that supports most of those RDBMS and NoSQLs on the planet, helping you transfer data from any one place to another.
Stars: ✭ 615 (+327.08%)
Mutual labels:  etl, trino
Data-Engineering-Projects
Personal Data Engineering Projects
Stars: ✭ 167 (+15.97%)
Mutual labels:  data-warehouse, data-engineering
pangeo-forge-recipes
Python library for building Pangeo Forge recipes.
Stars: ✭ 64 (-55.56%)
Mutual labels:  etl, data-engineering
etl manager
A python package to create a database on the platform using our moj data warehousing framework
Stars: ✭ 14 (-90.28%)
Mutual labels:  etl, data-engineering
neon-workshop
A Pachyderm deep learning tutorial for conference workshops
Stars: ✭ 19 (-86.81%)
Mutual labels:  data-engineering, data-pipelines
Locopy
locopy: Loading/Unloading to Redshift and Snowflake using Python.
Stars: ✭ 73 (-49.31%)
Mutual labels:  etl, snowflake
Sayn
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Stars: ✭ 79 (-45.14%)
Mutual labels:  etl, data-engineering
Aws Data Wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+1556.25%)
Mutual labels:  etl, data-engineering
AirflowETL
Blog post on ETL pipelines with Airflow
Stars: ✭ 20 (-86.11%)
Mutual labels:  etl, data-engineering
Butterfree
A tool for building feature stores.
Stars: ✭ 126 (-12.5%)
Mutual labels:  etl, data-engineering
NBi
NBi is a testing framework (add-on to NUnit) for Business Intelligence and Data Access. The main goal of this framework is to let users create tests with a declarative approach based on an Xml syntax. By the means of NBi, you don't need to develop C# or Java code to specify your tests! Either, you don't need Visual Studio or Eclipse to compile y…
Stars: ✭ 102 (-29.17%)
Mutual labels:  etl, data-quality
etl
[READ-ONLY] PHP - ETL (Extract Transform Load) data processing library
Stars: ✭ 279 (+93.75%)
Mutual labels:  etl, data-engineering
Aws Serverless Data Lake Framework
Enterprise-grade, production-hardened, serverless data lake on AWS
Stars: ✭ 179 (+24.31%)
Mutual labels:  etl, data-engineering
Dataform
Dataform is a framework for managing SQL based data operations in BigQuery, Snowflake, and Redshift
Stars: ✭ 342 (+137.5%)
Mutual labels:  etl, data-engineering
datatile
A library for managing, validating, summarizing, and visualizing data.
Stars: ✭ 419 (+190.97%)
Mutual labels:  dataops, data-quality
soda-spark
Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
Stars: ✭ 58 (-59.72%)
Mutual labels:  data-engineering, data-quality
Great expectations
Always know what to expect from your data.
Stars: ✭ 5,808 (+3933.33%)
Mutual labels:  data-engineering, data-quality
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-45.14%)
Mutual labels:  etl, data-engineering
Yuniql
Free and open source schema versioning and database migration made natively with .NET Core.
Stars: ✭ 156 (+8.33%)
Mutual labels:  snowflake, data-engineering
Sqlpad
Web-based SQL editor run in your own private cloud. Supports MySQL, Postgres, SQL Server, Vertica, Crate, ClickHouse, Trino, Presto, SAP HANA, Cassandra, Snowflake, BigQuery, SQLite, and more with ODBC
Stars: ✭ 4,113 (+2756.25%)
Mutual labels:  snowflake, trino
deordie-meetups
DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.
Stars: ✭ 48 (-66.67%)
Mutual labels:  data-engineering, data-engineer
morph-kgc
Powerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (-46.53%)
Mutual labels:  etl, data-engineering
polygon-etl
ETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (-63.19%)
Mutual labels:  etl, data-engineering
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+339.58%)
Mutual labels:  etl, data-engineering
blockchain-etl-streaming
Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (-60.42%)
Mutual labels:  etl, data-engineering
hamilton
A scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+325%)
Mutual labels:  etl, data-engineering
Benthos
Fancy stream processing made operationally mundane
Stars: ✭ 3,705 (+2472.92%)
Mutual labels:  etl, data-engineering
wikirepo
Python based Wikidata framework for easy dataframe extraction
Stars: ✭ 33 (-77.08%)
Mutual labels:  etl, elt
starlake
Starlake is a Spark Based On Premise and Cloud ELT/ETL Framework for Batch & Stream Processing
Stars: ✭ 16 (-88.89%)
Mutual labels:  etl, snowflake
uptasticsearch
An Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (-67.36%)
Mutual labels:  etl, data-engineering
awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 11,093 (+7603.47%)
Mutual labels:  data-warehouse
metamapper
Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.
Stars: ✭ 60 (-58.33%)
Mutual labels:  data-warehouse
choria
Finally, an MMORPG that's all about grinding and doing chores.
Stars: ✭ 19 (-86.81%)
Mutual labels:  sqlite3
bigquery-kafka-connect
☁️ nodejs kafka connect connector for Google BigQuery
Stars: ✭ 17 (-88.19%)
Mutual labels:  etl
nim-gatabase
Connection-Pooling Compile-Time ORM for Nim
Stars: ✭ 103 (-28.47%)
Mutual labels:  sqlite3
cobrix
A COBOL parser and Mainframe/EBCDIC data source for Apache Spark
Stars: ✭ 109 (-24.31%)
Mutual labels:  etl
architect big data solutions with spark
code, labs and lectures for the course
Stars: ✭ 40 (-72.22%)
Mutual labels:  etl
exqlite
An SQLite3 driver for Elixir
Stars: ✭ 128 (-11.11%)
Mutual labels:  sqlite3
sqlite-spellfix
Loadable spellfix1 extension for sqlite as python package
Stars: ✭ 13 (-90.97%)
Mutual labels:  sqlite3
singer-runner
A CLI and library to run Singer Taps and Targets
Stars: ✭ 33 (-77.08%)
Mutual labels:  etl
sqlite-gui
Lightweight SQLite editor for Windows
Stars: ✭ 151 (+4.86%)
Mutual labels:  sqlite3
docker-sqlite3
Sqlite3 command line in a docker container
Stars: ✭ 28 (-80.56%)
Mutual labels:  sqlite3
PDAP-Scrapers
Code relating to scraping public police data.
Stars: ✭ 72 (-50%)
Mutual labels:  etl
1-60 of 632 similar projects