openrefine-batchShell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that communicates with the OpenRefine API.
Stars: ✭ 76 (+375%)
fdtd3dfdtd3d is an open source 1D, 2D, 3D FDTD electromagnetics solver with MPI, OpenMP and CUDA support for x86, arm, arm64 architectures
Stars: ✭ 77 (+381.25%)
awesome-integrationA curated list of awesome system integration software and resources.
Stars: ✭ 117 (+631.25%)
connorA commandline tool for resetting Kafka Connect source connector offsets.
Stars: ✭ 17 (+6.25%)
neo4j-jdbcJDBC driver for Neo4j
Stars: ✭ 110 (+587.5%)
covid-19Data ETL & Analysis on the global and Mexican datasets of the COVID-19 pandemic.
Stars: ✭ 14 (-12.5%)
link-moveA model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
Stars: ✭ 32 (+100%)
cobrixA COBOL parser and Mainframe/EBCDIC data source for Apache Spark
Stars: ✭ 109 (+581.25%)
kafka-jdbc-connectorSimple way to copy data from relational databases into kafka.
Stars: ✭ 19 (+18.75%)
csvpluscsvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (+318.75%)
etl[READ-ONLY] PHP - ETL (Extract Transform Load) data processing library
Stars: ✭ 279 (+1643.75%)
persistityA persistence framework for game developers
Stars: ✭ 34 (+112.5%)
chronicle-etl📜 A CLI toolkit for extracting and working with your digital history
Stars: ✭ 78 (+387.5%)
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (+50%)
DIRECTDIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics framework that can be used to monitor, log, audit and control data integration / ETL processes.
Stars: ✭ 20 (+25%)
kafka-connect-httpKafka Connect connector that enables Change Data Capture from JSON/HTTP APIs into Kafka.
Stars: ✭ 81 (+406.25%)
NBiNBi is a testing framework (add-on to NUnit) for Business Intelligence and Data Access. The main goal of this framework is to let users create tests with a declarative approach based on an Xml syntax. By the means of NBi, you don't need to develop C# or Java code to specify your tests! Either, you don't need Visual Studio or Eclipse to compile y…
Stars: ✭ 102 (+537.5%)
django-calaccess-raw-dataA Django app to download, extract and load campaign finance and lobbying activity data from the California Secretary of State's CAL-ACCESS database
Stars: ✭ 61 (+281.25%)
openmrs-fhir-analyticsA collection of tools for extracting FHIR resources and analytics services on top of that data.
Stars: ✭ 55 (+243.75%)
mikThe Move to Islandora Kit is an extensible PHP command-line tool for converting source content and metadata into packages suitable for importing into Islandora (or other digital repository and preservations systems).
Stars: ✭ 32 (+100%)
thainThain is a distributed flow schedule platform.
Stars: ✭ 81 (+406.25%)
Aws Etl OrchestratorA serverless architecture for orchestrating ETL jobs in arbitrarily-complex workflows using AWS Step Functions and AWS Lambda.
Stars: ✭ 245 (+1431.25%)
blockchain-etl-streamingStreaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (+256.25%)
ElandPython Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (+1368.75%)
Etl2pcapngUtility that converts an .etl file containing a Windows network packet capture into .pcapng format.
Stars: ✭ 228 (+1325%)
polygon-etlETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (+231.25%)
versatile-data-kitVersatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
Stars: ✭ 144 (+800%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (+800%)
CqlCategorical Query Language IDE
Stars: ✭ 196 (+1125%)
proc-thatproc(ess)-that - easy extendable ETL tool for Node.js. Written in TypeScript.
Stars: ✭ 25 (+56.25%)
Mongo EsA MongoDB to Elasticsearch connector
Stars: ✭ 185 (+1056.25%)
gallia-coreA schema-aware Scala library for data transformation
Stars: ✭ 44 (+175%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+3725%)
YaEtlYet Another ETL in PHP
Stars: ✭ 60 (+275%)
Linq2dbLinq to database provider.
Stars: ✭ 2,211 (+13718.75%)
csv-cruncherTreats CSV and JSON files as SQL tables, and exports SQL SELECTs back to CSV or JSON.
Stars: ✭ 32 (+100%)
Usaspending ApiServer application to serve U.S. federal spending data via a RESTful API
Stars: ✭ 166 (+937.5%)
dogETLA lib to transform data from jdbc,csv,json to ecah other.
Stars: ✭ 15 (-6.25%)
morph-kgcPowerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (+381.25%)
Metlmito ETL tool
Stars: ✭ 153 (+856.25%)
PDAP-ScrapersCode relating to scraping public police data.
Stars: ✭ 72 (+350%)
BETL-oldBETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (+6.25%)
google-sheets-etlLive import all your Google Sheets to your data warehouse
Stars: ✭ 15 (-6.25%)
kafka-junitEnables you to start and stop a fully-fledged embedded Kafka cluster from within JUnit and provides a rich set of convenient accessors and fault injectors through a lean API. Supports working against external clusters as well.
Stars: ✭ 38 (+137.5%)
cassandra.realtimeDifferent ways to process data into Cassandra in realtime with technologies such as Kafka, Spark, Akka, Flink
Stars: ✭ 25 (+56.25%)