All Projects → hamilton → Similar Projects or Alternatives

1326 Open source projects that are alternatives of or similar to hamilton

Mars
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
Stars: ✭ 2,308 (+277.12%)
Mutual labels:  numpy, pandas, dataframe
Butterfree
A tool for building feature stores.
Stars: ✭ 126 (-79.41%)
Mutual labels:  etl, data-engineering, etl-framework
Ditching Excel For Python
Functionalities in Excel translated to Python
Stars: ✭ 172 (-71.9%)
Mutual labels:  numpy, pandas, dataframe
Panthera
Data-frames & arrays on Clojure
Stars: ✭ 168 (-72.55%)
Mutual labels:  numpy, pandas, dataframe
AirflowETL
Blog post on ETL pipelines with Airflow
Stars: ✭ 20 (-96.73%)
Mutual labels:  etl, data-engineering, etl-pipeline
etlflow
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (-93.79%)
Mutual labels:  etl, etl-framework, etl-pipeline
csvplus
csvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (-89.05%)
Mutual labels:  etl, etl-framework, etl-pipeline
Aws Data Wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+289.71%)
Mutual labels:  etl, pandas, data-engineering
pyjanitor
Clean APIs for data cleaning. Python implementation of R package Janitor
Stars: ✭ 970 (+58.5%)
Mutual labels:  pandas, data-engineering, dataframe
saddle
SADDLE: Scala Data Library
Stars: ✭ 23 (-96.24%)
Mutual labels:  numpy, pandas, dataframe
vixtract
www.vixtract.ru
Stars: ✭ 40 (-93.46%)
Mutual labels:  etl, etl-framework, etl-pipeline
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-93.63%)
Mutual labels:  etl, etl-framework, etl-pipeline
Eland
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (-61.6%)
Mutual labels:  etl, pandas, dataframe
covid-19
Data ETL & Analysis on the global and Mexican datasets of the COVID-19 pandemic.
Stars: ✭ 14 (-97.71%)
Mutual labels:  etl, numpy, pandas
gallia-core
A schema-aware Scala library for data transformation
Stars: ✭ 44 (-92.81%)
Baby Names Analysis
Data ETL & Analysis on the dataset 'Baby Names from Social Security Card Applications - National Data'.
Stars: ✭ 557 (-8.99%)
Mutual labels:  etl, numpy, pandas
DIRECT
DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics framework that can be used to monitor, log, audit and control data integration / ETL processes.
Stars: ✭ 20 (-96.73%)
Mutual labels:  etl, etl-framework, etl-pipeline
Pyjanitor
Clean APIs for data cleaning. Python implementation of R package Janitor
Stars: ✭ 647 (+5.72%)
Mutual labels:  pandas, data-engineering, dataframe
DaFlow
Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-96.08%)
Mutual labels:  etl, etl-framework, etl-pipeline
redis-connect-dist
Real-Time Event Streaming & Change Data Capture
Stars: ✭ 21 (-96.57%)
Mutual labels:  etl, etl-framework, etl-pipeline
Data-Wrangling-with-Python
Simplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Stars: ✭ 90 (-85.29%)
Mutual labels:  numpy, pandas
AirflowDataPipeline
Example of an ETL Pipeline using Airflow
Stars: ✭ 24 (-96.08%)
Mutual labels:  etl, data-engineering
etl manager
A python package to create a database on the platform using our moj data warehousing framework
Stars: ✭ 14 (-97.71%)
Mutual labels:  etl, data-engineering
Choetl
ETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (-39.22%)
Mutual labels:  etl, etl-framework
pangeo-forge-recipes
Python library for building Pangeo Forge recipes.
Stars: ✭ 64 (-89.54%)
Mutual labels:  etl, data-engineering
Benthos
Fancy stream processing made operationally mundane
Stars: ✭ 3,705 (+505.39%)
Mutual labels:  etl, data-engineering
Etlalchemy
Extract, Transform, Load: Any SQL Database in 4 lines of Code.
Stars: ✭ 460 (-24.84%)
Mutual labels:  etl, etl-framework
Datscan
DatScan is an initiative to build an open-source CMS that will have the capability to solve any problem using data Analysis just with the help of various modules and a vast standardized module library
Stars: ✭ 13 (-97.88%)
Mutual labels:  numpy, pandas
beneath
Beneath is a serverless real-time data platform ⚡️
Stars: ✭ 65 (-89.38%)
Mutual labels:  etl, data-engineering
qwery
A SQL-like language for performing ETL transformations.
Stars: ✭ 28 (-95.42%)
Mutual labels:  etl, etl-framework
arthur-redshift-etl
ELT Code for your Data Warehouse
Stars: ✭ 22 (-96.41%)
Mutual labels:  etl, data-engineering
Metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (-41.01%)
Mutual labels:  etl, etl-framework
Dataform
Dataform is a framework for managing SQL based data operations in BigQuery, Snowflake, and Redshift
Stars: ✭ 342 (-44.12%)
Mutual labels:  etl, data-engineering
Locopy
locopy: Loading/Unloading to Redshift and Snowflake using Python.
Stars: ✭ 73 (-88.07%)
Mutual labels:  etl, pandas
Stetl
Stetl, Streaming ETL, is a lightweight geospatial processing and ETL framework written in Python.
Stars: ✭ 64 (-89.54%)
Mutual labels:  etl, etl-framework
Pyetl
python ETL framework
Stars: ✭ 33 (-94.61%)
Mutual labels:  etl, etl-framework
Getting Started
This repository is a getting started guide to Singer.
Stars: ✭ 734 (+19.93%)
Mutual labels:  etl, etl-framework
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-87.09%)
Mutual labels:  etl, data-engineering
Sayn
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Stars: ✭ 79 (-87.09%)
Mutual labels:  etl, data-engineering
Udacity-Data-Analyst-Nanodegree
Repository for the projects needed to complete the Data Analyst Nanodegree.
Stars: ✭ 31 (-94.93%)
Mutual labels:  numpy, pandas
Aws Ecs Airflow
Run Airflow in AWS ECS(Elastic Container Service) using Fargate tasks
Stars: ✭ 107 (-82.52%)
Mutual labels:  etl, dag
Openkettlewebui
一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 125 (-79.58%)
Mutual labels:  etl, etl-framework
carry
Python ETL(Extract-Transform-Load) tool / Data migration tool
Stars: ✭ 115 (-81.21%)
Mutual labels:  etl, pandas
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+3.43%)
Mutual labels:  etl, data-engineering
Hale
(Spatial) data harmonisation with hale studio (formerly HUMBOLDT Alignment Editor)
Stars: ✭ 84 (-86.27%)
Mutual labels:  etl, etl-framework
Transformalize
Configurable Extract, Transform, and Load
Stars: ✭ 125 (-79.58%)
Mutual labels:  etl, etl-framework
Hydrograph
A visual ETL development and debugging tool for big data
Stars: ✭ 144 (-76.47%)
Mutual labels:  etl, etl-framework
Aws Serverless Data Lake Framework
Enterprise-grade, production-hardened, serverless data lake on AWS
Stars: ✭ 179 (-70.75%)
Mutual labels:  etl, data-engineering
Etlbox
A lightweight ETL (extract, transform, load) library and data integration toolbox for .NET.
Stars: ✭ 203 (-66.83%)
Mutual labels:  etl, etl-framework
Bender
Bender - Serverless ETL Framework
Stars: ✭ 171 (-72.06%)
Mutual labels:  etl, etl-framework
Example Airflow Dags
Example DAGs using hooks and operators from Airflow Plugins
Stars: ✭ 243 (-60.29%)
Mutual labels:  etl, dag
hive-metastore-client
A client for connecting and running DDLs on hive metastore.
Stars: ✭ 37 (-93.95%)
Mutual labels:  etl, data-engineering
Engezny
Engezny is a python package that quickly generates all possible charts from your dataframe and saves them for you, and engezny is only supporting now uni-parameter visualization using the pie, bar and barh visualizations.
Stars: ✭ 25 (-95.92%)
Mutual labels:  numpy, pandas
Airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+703.76%)
Mutual labels:  etl, data-engineering
link-move
A model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
Stars: ✭ 32 (-94.77%)
Mutual labels:  etl, etl-framework
etl
[READ-ONLY] PHP - ETL (Extract Transform Load) data processing library
Stars: ✭ 279 (-54.41%)
Mutual labels:  etl, data-engineering
datascienv
datascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest that provide single line of import all required ml libraries
Stars: ✭ 53 (-91.34%)
Mutual labels:  numpy, pandas
seatunnel-example
seatunnel plugin developing examples.
Stars: ✭ 27 (-95.59%)
Mutual labels:  etl-framework, etl-pipeline
morph-kgc
Powerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (-87.42%)
Mutual labels:  etl, data-engineering
BETL-old
BETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (-97.22%)
Mutual labels:  etl, etl-framework
1-60 of 1326 similar projects