All Projects → maxwell-sink → Similar Projects or Alternatives

279 Open source projects that are alternatives of or similar to maxwell-sink

kafka-connect-datagen
A Kafka Connect source connector that generates data for tests
Stars: ✭ 27 (+68.75%)
Mutual labels:  etl, kafka-connect
mydataharbor
🇨🇳 MyDataHarbor是一个致力于解决任意数据源到任意数据源的分布式、高扩展性、高性能、事务级的数据同步中间件。帮助用户可靠、快速、稳定的对海量数据进行准实时增量同步或者定时全量同步,主要定位是为实时交易系统服务,亦可用于大数据的数据同步(ETL领域)。
Stars: ✭ 28 (+75%)
Mutual labels:  etl, data-sync
bigquery-kafka-connect
☁️ nodejs kafka connect connector for Google BigQuery
Stars: ✭ 17 (+6.25%)
Mutual labels:  etl, kafka-connect
DataBridge.NET
Configurable data bridge for permanent ETL jobs
Stars: ✭ 16 (+0%)
Mutual labels:  etl, data-sync
Kafka Connect
equivalent to kafka-connect 🔧 for nodejs ✨🐢🚀✨
Stars: ✭ 102 (+537.5%)
Mutual labels:  etl, kafka-connect
kafka-connect-jenkins
Kafka Connect Connector for Jenkins Open Source Continuous Integration Tool
Stars: ✭ 29 (+81.25%)
Mutual labels:  kafka-connect
cubetl
CubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)
Stars: ✭ 21 (+31.25%)
Mutual labels:  etl
python mozetl
ETL jobs for Firefox Telemetry
Stars: ✭ 25 (+56.25%)
Mutual labels:  etl
uptasticsearch
An Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (+193.75%)
Mutual labels:  etl
dflib
In-memory Java DataFrame library
Stars: ✭ 50 (+212.5%)
Mutual labels:  etl
etlflow
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (+137.5%)
Mutual labels:  etl
sync-engine-example
Synchronization Algorithm Exploration: Techniques to synchronize a SQL database with external destinations.
Stars: ✭ 17 (+6.25%)
Mutual labels:  etl
nasdaq-symbols
ETL for the NASDAQ symbol file
Stars: ✭ 13 (-18.75%)
Mutual labels:  etl
go-bqloader
bqloader is a simple ETL framework to load data from Cloud Storage into BigQuery.
Stars: ✭ 16 (+0%)
Mutual labels:  etl
OpenKettleWebUI
一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 138 (+762.5%)
Mutual labels:  etl
web-click-flow
网站点击流离线日志分析
Stars: ✭ 14 (-12.5%)
Mutual labels:  etl
CVparser
CVparser is software for parsing or extracting data out of CV/resumes.
Stars: ✭ 28 (+75%)
Mutual labels:  etl
wrangle
A data transformation package for deep learning with Autonomio, Keras and TensorFlow.
Stars: ✭ 15 (-6.25%)
Mutual labels:  etl
flock
Flock: A Low-Cost Streaming Query Engine on FaaS Platforms
Stars: ✭ 232 (+1350%)
Mutual labels:  etl
dswarm
an open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)
Stars: ✭ 57 (+256.25%)
Mutual labels:  etl
starlake
Starlake is a Spark Based On Premise and Cloud ELT/ETL Framework for Batch & Stream Processing
Stars: ✭ 16 (+0%)
Mutual labels:  etl
polygon-etl
ETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (+231.25%)
Mutual labels:  etl
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+143.75%)
Mutual labels:  etl
zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Stars: ✭ 655 (+3993.75%)
Mutual labels:  etl
kafka-connect-examples
Kafka Connect Examples
Stars: ✭ 36 (+125%)
Mutual labels:  kafka-connect
singer-runner
A CLI and library to run Singer Taps and Targets
Stars: ✭ 33 (+106.25%)
Mutual labels:  etl
zdh web
大数据采集,抽取平台
Stars: ✭ 292 (+1725%)
Mutual labels:  etl
kafkacli
CLI and Go Clients to manage Kafka components (Kafka Connect & SchemaRegistry)
Stars: ✭ 28 (+75%)
Mutual labels:  kafka-connect
fdtd3d
fdtd3d is an open source 1D, 2D, 3D FDTD electromagnetics solver with MPI, OpenMP and CUDA support for x86, arm, arm64 architectures
Stars: ✭ 77 (+381.25%)
Mutual labels:  maxwell
connor
A commandline tool for resetting Kafka Connect source connector offsets.
Stars: ✭ 17 (+6.25%)
Mutual labels:  kafka-connect
covid-19
Data ETL & Analysis on the global and Mexican datasets of the COVID-19 pandemic.
Stars: ✭ 14 (-12.5%)
Mutual labels:  etl
cobrix
A COBOL parser and Mainframe/EBCDIC data source for Apache Spark
Stars: ✭ 109 (+581.25%)
Mutual labels:  etl
csvplus
csvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (+318.75%)
Mutual labels:  etl
persistity
A persistence framework for game developers
Stars: ✭ 34 (+112.5%)
Mutual labels:  etl
MongoDb-Sink-Connector
Kafka MongoDb sink connector
Stars: ✭ 19 (+18.75%)
Mutual labels:  kafka-connect
DaFlow
Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (+50%)
Mutual labels:  etl
kafka-connect-http
Kafka Connect connector that enables Change Data Capture from JSON/HTTP APIs into Kafka.
Stars: ✭ 81 (+406.25%)
Mutual labels:  kafka-connect
oic-options-chains
ETL for OIC Options Chains
Stars: ✭ 22 (+37.5%)
Mutual labels:  etl
django-calaccess-raw-data
A Django app to download, extract and load campaign finance and lobbying activity data from the California Secretary of State's CAL-ACCESS database
Stars: ✭ 61 (+281.25%)
Mutual labels:  etl
mik
The Move to Islandora Kit is an extensible PHP command-line tool for converting source content and metadata into packages suitable for importing into Islandora (or other digital repository and preservations systems).
Stars: ✭ 32 (+100%)
Mutual labels:  etl
kafka-connect-cosmosdb
Kafka Connect connectors for Azure Cosmos DB
Stars: ✭ 28 (+75%)
Mutual labels:  kafka-connect
redis-connect-dist
Real-Time Event Streaming & Change Data Capture
Stars: ✭ 21 (+31.25%)
Mutual labels:  etl
blockchain-etl-streaming
Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (+256.25%)
Mutual labels:  etl
kafka-connect-arangodb
🥑 Kafka connect sink connector for ArangoDB
Stars: ✭ 22 (+37.5%)
Mutual labels:  kafka-connect
versatile-data-kit
Versatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
Stars: ✭ 144 (+800%)
Mutual labels:  etl
django-data-migration
Data migration framework for Django that migrates legacy data into your new django app
Stars: ✭ 18 (+12.5%)
Mutual labels:  etl
proc-that
proc(ess)-that - easy extendable ETL tool for Node.js. Written in TypeScript.
Stars: ✭ 25 (+56.25%)
Mutual labels:  etl
architect big data solutions with spark
code, labs and lectures for the course
Stars: ✭ 40 (+150%)
Mutual labels:  etl
gallia-core
A schema-aware Scala library for data transformation
Stars: ✭ 44 (+175%)
Mutual labels:  etl
sql-to-redis
🔄 Simple tool for ETL. From SQL to Redis.
Stars: ✭ 18 (+12.5%)
Mutual labels:  etl
naas
⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Stars: ✭ 219 (+1268.75%)
Mutual labels:  etl
csv-cruncher
Treats CSV and JSON files as SQL tables, and exports SQL SELECTs back to CSV or JSON.
Stars: ✭ 32 (+100%)
Mutual labels:  etl
dogETL
A lib to transform data from jdbc,csv,json to ecah other.
Stars: ✭ 15 (-6.25%)
Mutual labels:  etl
morph-kgc
Powerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (+381.25%)
Mutual labels:  etl
PDAP-Scrapers
Code relating to scraping public police data.
Stars: ✭ 72 (+350%)
Mutual labels:  etl
google-sheets-etl
Live import all your Google Sheets to your data warehouse
Stars: ✭ 15 (-6.25%)
Mutual labels:  etl
hamilton
A scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+3725%)
Mutual labels:  etl
scylla-cdc-source-connector
A Kafka source connector capturing Scylla CDC changes
Stars: ✭ 19 (+18.75%)
Mutual labels:  kafka-connect
ruby-for-pentaho-kettle
Ruby scripting for pentaho-kettle
Stars: ✭ 42 (+162.5%)
Mutual labels:  etl
kafka-junit
Enables you to start and stop a fully-fledged embedded Kafka cluster from within JUnit and provides a rich set of convenient accessors and fault injectors through a lean API. Supports working against external clusters as well.
Stars: ✭ 38 (+137.5%)
Mutual labels:  kafka-connect
1-60 of 279 similar projects