Luigi WarehouseA luigi powered analytics / warehouse stack
Stars: ✭ 72 (-68.42%)
BenderBender - Serverless ETL Framework
Stars: ✭ 171 (-25%)
TransporterSync data between persistence engines, like ETL only not stodgy
Stars: ✭ 1,175 (+415.35%)
Etl.netMass processing data with a complete ETL for .net developers
Stars: ✭ 129 (-43.42%)
CqlCategorical Query Language IDE
Stars: ✭ 196 (-14.04%)
StetlStetl, Streaming ETL, is a lightweight geospatial processing and ETL framework written in Python.
Stars: ✭ 64 (-71.93%)
ButterfreeA tool for building feature stores.
Stars: ✭ 126 (-44.74%)
Kiba PlusKiba enhancement for Ruby ETL.
Stars: ✭ 47 (-79.39%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+2057.46%)
Ether sqlA python library to push ethereum blockchain data into an sql database.
Stars: ✭ 41 (-82.02%)
Openkettlewebui一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 125 (-45.18%)
ConfigsPublic, free to use, repository with diggers configs for scraping / extracting data from various e-commerce websites and online stores
Stars: ✭ 37 (-83.77%)
Pyetlpython ETL framework
Stars: ✭ 33 (-85.53%)
RikoA Python stream processing engine modeled after Yahoo! Pipes
Stars: ✭ 1,571 (+589.04%)
Crafter🔬 An R package to work with PCAPs
Stars: ✭ 27 (-88.16%)
Open Semantic EtlPython based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
Stars: ✭ 165 (-27.63%)
Yunmai Data ExtractExtract your data from the Yunmai weighing scales cloud API so you can use it elsewhere
Stars: ✭ 21 (-90.79%)
Tcpdumpthe TCPdump network dissector
Stars: ✭ 1,731 (+659.21%)
PantherDetect threats with log data and improve cloud security posture
Stars: ✭ 885 (+288.16%)
ExtractA cross-platform command line tool for parallelised content extraction and analysis.
Stars: ✭ 188 (-17.54%)
Tuna🐟 A streaming ETL for fish
Stars: ✭ 11 (-95.18%)
DataxDataX is an open source universal ETL tool that support Cassandra, ClickHouse, DBF, Hive, InfluxDB, Kudu, MySQL, Oracle, Presto(Trino), PostgreSQL, SQL Server
Stars: ✭ 116 (-49.12%)
Mara Example Project 2An example mini data warehouse for python project stats, template for new projects
Stars: ✭ 154 (-32.46%)
DataformDataform is a framework for managing SQL based data operations in BigQuery, Snowflake, and Redshift
Stars: ✭ 342 (+50%)
Bandar LogMonitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 19 (-91.67%)
Aws Ecs AirflowRun Airflow in AWS ECS(Elastic Container Service) using Fargate tasks
Stars: ✭ 107 (-53.07%)
WindowsspyblockerWindowsSpyBlocker 🛡️ is an application written in Go and delivered as
a single executable to block spying and
tracking on Windows systems.
Stars: ✭ 2,913 (+1177.63%)
Getting StartedThis repository is a getting started guide to Singer.
Stars: ✭ 734 (+221.93%)
Qqwry2mmdb为 Wireshark 能使用纯真网络 IP 数据库(QQwry)而提供的格式转换工具
Stars: ✭ 105 (-53.95%)
React CsvReact components to build CSV files on the fly basing on Array/literal object of data
Stars: ✭ 732 (+221.05%)
Metlmito ETL tool
Stars: ✭ 153 (-32.89%)
Pyspark Example ProjectExample project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+177.63%)
Csv2dbThe CSV to database command line loader
Stars: ✭ 102 (-55.26%)
Go StreamsA lightweight stream processing library for Go
Stars: ✭ 615 (+169.74%)
MetlMetl is a simple, web-based integration platform that allows for several different styles of data integration including messaging, file based Extract/Transform/Load (ETL), and remote procedure invocation via Web Services. Read more at www.jumpmind.com/products/metl/overview
Stars: ✭ 185 (-18.86%)
Ananas DesktopA hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.
Stars: ✭ 551 (+141.67%)
OdČeská otevřená data
Stars: ✭ 99 (-56.58%)
OstinatoOstinato - Packet/Traffic Generator and Analyzer
Stars: ✭ 513 (+125%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (-36.84%)
Koop🔮 Transform, query, and download geospatial data on the web.
Stars: ✭ 505 (+121.49%)
Open Data Etl Utility KitUse Pentaho's open source data integration tool (Kettle) to create Extract-Transform-Load (ETL) processes to update a Socrata open data portal. Documentation is available at http://open-data-etl-utility-kit.readthedocs.io/en/stable
Stars: ✭ 93 (-59.21%)
SmartcodeSmartCode = IDataSource -> IBuildTask -> IOutput => Build Everything!!!
Stars: ✭ 464 (+103.51%)
EtlboxA lightweight ETL (extract, transform, load) library and data integration toolbox for .NET.
Stars: ✭ 203 (-10.96%)
PglogicalLogical Replication extension for PostgreSQL 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgrades.
Stars: ✭ 455 (+99.56%)
DaggyDaggy - Data Aggregation Utility. Open source, free, cross-platform, server-less, useful utility for remote or local data aggregation and streaming
Stars: ✭ 91 (-60.09%)
DatacleanerThe premier open source Data Quality solution
Stars: ✭ 391 (+71.49%)
SnoopyA highly configurable multi-threaded packet sniffer and parser build in rust-lang.
Stars: ✭ 138 (-39.47%)
ChoetlETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (+63.16%)
Mara PipelinesA lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Stars: ✭ 1,841 (+707.46%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-65.35%)
WedatasphereWeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (+63.16%)
GrafterLinked Data & RDF Manufacturing Tools in Clojure
Stars: ✭ 174 (-23.68%)
SaynData processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Stars: ✭ 79 (-65.35%)
Hale(Spatial) data harmonisation with hale studio (formerly HUMBOLDT Alignment Editor)
Stars: ✭ 84 (-63.16%)
ElasticR client for the Elasticsearch HTTP API
Stars: ✭ 227 (-0.44%)
Bulk WriterProvides guidance for fast ETL jobs, an IDataReader implementation for SqlBulkCopy (or the MySql or Oracle equivalents) that wraps an IEnumerable, and libraries for mapping entites to table columns.
Stars: ✭ 210 (-7.89%)
Sniff ProbesPlug-and-play bash script for sniffing 802.11 probes requests 👃
Stars: ✭ 200 (-12.28%)
Bitcoin EtlETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 174 (-23.68%)