watchman-processorFolder synchronization tool with a simple dashboard
Stars: ✭ 38 (+137.5%)
Rudder ServerPrivacy and Security focused Segment-alternative, in Golang and React
Stars: ✭ 2,874 (+17862.5%)
HudiUpserts, Deletes And Incremental Processing on Big Data.
Stars: ✭ 2,586 (+16062.5%)
Awesome Single CellCommunity-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
Stars: ✭ 1,937 (+12006.25%)
scarchesReference mapping for single-cell genomics
Stars: ✭ 175 (+993.75%)
data-product-batchTemplate to deploy a Data Product for Batch data processing into a Data Landing Zone of the Data Management & Analytics Scenario (former Enterprise-Scale Analytics). The Data Product template can be used by cross-functional teams to ingest, provide and create new data assets within the platform.
Stars: ✭ 27 (+68.75%)
thymeflowInstaller for Thymeflow, a personal knowledge management system.
Stars: ✭ 27 (+68.75%)
kuwalaKuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
Stars: ✭ 474 (+2862.5%)
SDM-RDFizerAn Efficient RML-Compliant Engine for Knowledge Graph Construction
Stars: ✭ 68 (+325%)
RonRusty Object Notation
Stars: ✭ 1,834 (+11362.5%)
envfileParse and write environment files with Node.js
Stars: ✭ 42 (+162.5%)
specJust Data. Save up to 85% network bandwidth and storage.
Stars: ✭ 86 (+437.5%)
LogstashLogstash - transport and process your logs, events, or other data
Stars: ✭ 12,543 (+78293.75%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+11500%)
Dig Etl EngineDownload DIG to run on your laptop or server.
Stars: ✭ 81 (+406.25%)
KgtkKnowledge Graph Toolkit
Stars: ✭ 81 (+406.25%)
GlobalbioticinteractionsGlobal Biotic Interactions provides access to existing species interaction datasets
Stars: ✭ 71 (+343.75%)
AlchemistA realtime ETL engine
Stars: ✭ 40 (+150%)
Goodreads etl pipelineAn end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (+4856.25%)
NofloFlow-based programming for JavaScript
Stars: ✭ 3,202 (+19912.5%)
hotsubCommand line tool to run batch jobs concurrently with ETL framework on AWS or other cloud computing resources
Stars: ✭ 29 (+81.25%)
ETL-Starter-Kit📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
Stars: ✭ 21 (+31.25%)
FlinkxBased on Apache Flink. support data synchronization/integration and streaming SQL computation.
Stars: ✭ 2,651 (+16468.75%)
bigbrother-specsResearch and specification for Big Brother protocol
Stars: ✭ 13 (-18.75%)
laravel-data-syncLaravel utility to keep records synced between enviroments through source control
Stars: ✭ 33 (+106.25%)
mytosis🔀 A peer-to-peer data sync framework
Stars: ✭ 19 (+18.75%)