TrubkaA CLI tool for Kafka
Stars: ✭ 296 (+957.14%)
DataBridge.NETConfigurable data bridge for permanent ETL jobs
Stars: ✭ 16 (-42.86%)
Kattlo CliKattlo CLI Project
Stars: ✭ 58 (+107.14%)
RafkaKafka proxy with a simple API, speaking the Redis protocol
Stars: ✭ 49 (+75%)
beekeeperService for automatically managing and cleaning up unreferenced data
Stars: ✭ 43 (+53.57%)
Getting StartedThis repository is a getting started guide to Singer.
Stars: ✭ 734 (+2521.43%)
EtlalchemyExtract, Transform, Load: Any SQL Database in 4 lines of Code.
Stars: ✭ 460 (+1542.86%)
Hale(Spatial) data harmonisation with hale studio (formerly HUMBOLDT Alignment Editor)
Stars: ✭ 84 (+200%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+1189.29%)
ButterfreeA tool for building feature stores.
Stars: ✭ 126 (+350%)
TransformalizeConfigurable Extract, Transform, and Load
Stars: ✭ 125 (+346.43%)
Metlmito ETL tool
Stars: ✭ 153 (+446.43%)
InsulatorA client UI to inspect Kafka topics, consume, produce and much more
Stars: ✭ 53 (+89.29%)
Luigi WarehouseA luigi powered analytics / warehouse stack
Stars: ✭ 72 (+157.14%)
WedatasphereWeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (+1228.57%)
DataxDataX is an open source universal ETL tool that support Cassandra, ClickHouse, DBF, Hive, InfluxDB, Kudu, MySQL, Oracle, Presto(Trino), PostgreSQL, SQL Server
Stars: ✭ 116 (+314.29%)
AddaxAddax is an open source universal ETL tool that supports most of those RDBMS and NoSQLs on the planet, helping you transfer data from any one place to another.
Stars: ✭ 615 (+2096.43%)
Aws Data WranglerPandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+8417.86%)
dswarman open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)
Stars: ✭ 57 (+103.57%)
ETL-Starter-Kit📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
Stars: ✭ 21 (-25%)
Omniparseromniparser: a native Golang ETL streaming parser and transform library for CSV, JSON, XML, EDI, text, etc.
Stars: ✭ 148 (+428.57%)
Etl.netMass processing data with a complete ETL for .net developers
Stars: ✭ 129 (+360.71%)
Csv2dbThe CSV to database command line loader
Stars: ✭ 102 (+264.29%)
Sqswiss-army knife for data
Stars: ✭ 275 (+882.14%)
FlatfilesReads and writes CSV, fixed-length and other flat file formats with a focus on schema definition, configuration and speed.
Stars: ✭ 275 (+882.14%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+2085.71%)
Structured Text ToolsA list of command line tools for manipulating structured text data
Stars: ✭ 6,180 (+21971.43%)
SqlitebiterA CLI tool to convert CSV / Excel / HTML / JSON / Jupyter Notebook / LDJSON / LTSV / Markdown / SQLite / SSV / TSV / Google-Sheets to a SQLite database file.
Stars: ✭ 601 (+2046.43%)
Kafka-quickstartKafka Examples focusing on Producer, Consumer, KStreams, KTable, Global KTable using Spring, Kafka Cluster Setup & Monitoring. Implementing Event Sourcing and CQRS Design Pattern using Kafka
Stars: ✭ 31 (+10.71%)
CsvtkA cross-platform, efficient and practical CSV/TSV toolkit in Golang
Stars: ✭ 566 (+1921.43%)
SchemerSchema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (+246.43%)
Tsv UtilseBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
Stars: ✭ 1,215 (+4239.29%)
Intellij Csv ValidatorCSV validator, highlighter and formatter plugin for JetBrains Intellij IDEA, PyCharm, WebStorm, ...
Stars: ✭ 198 (+607.14%)
SwiftcsvCSV parser for Swift
Stars: ✭ 511 (+1725%)
Topos🌀 .NET Event Processing library
Stars: ✭ 22 (-21.43%)
vixtractwww.vixtract.ru
Stars: ✭ 40 (+42.86%)
DIRECTDIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics framework that can be used to monitor, log, audit and control data integration / ETL processes.
Stars: ✭ 20 (-28.57%)
MillerMiller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Stars: ✭ 4,633 (+16446.43%)
athenadriverA fully-featured AWS Athena database driver (+ athenareader https://github.com/uber/athenadriver/tree/master/athenareader)
Stars: ✭ 116 (+314.29%)
BETL-oldBETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (-39.29%)
athena-sqliteA SQLite driver for S3 and Amazon Athena 😳
Stars: ✭ 82 (+192.86%)
VroomFast reading of delimited files
Stars: ✭ 462 (+1550%)
dogETLA lib to transform data from jdbc,csv,json to ecah other.
Stars: ✭ 15 (-46.43%)
Pytablewriterpytablewriter is a Python library to write a table in various formats: CSV / Elasticsearch / HTML / JavaScript / JSON / LaTeX / LDJSON / LTSV / Markdown / MediaWiki / NumPy / Excel / Pandas / Python / reStructuredText / SQLite / TOML / TSV.
Stars: ✭ 422 (+1407.14%)
csvpluscsvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (+139.29%)
TILToday I Learned
Stars: ✭ 43 (+53.57%)
astroAstro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (+182.14%)
workbooksimple framework for containing spreadsheet like data
Stars: ✭ 13 (-53.57%)
WebServerCloudBackupsAutomatic backups your web projects bases and files to the clouds via WebDAV.
Stars: ✭ 20 (-28.57%)
AvroConvertApache Avro serializer for .NET
Stars: ✭ 44 (+57.14%)
go-csv-tagRead csv file from go using tags
Stars: ✭ 94 (+235.71%)
avro-schema-generatorLibrary for generating avro schema files (.avsc) based on DB tables structure
Stars: ✭ 38 (+35.71%)
s3-syncMigrating S3 Buckets Across AWS Accounts
Stars: ✭ 55 (+96.43%)
s3-serverGeneric S3 server implementation
Stars: ✭ 27 (-3.57%)