proc-thatproc(ess)-that - easy extendable ETL tool for Node.js. Written in TypeScript.
Stars: ✭ 25 (-77.06%)
coaxTools for connecting to real IBM 3270 type terminals
Stars: ✭ 29 (-73.39%)
python mozetlETL jobs for Firefox Telemetry
Stars: ✭ 25 (-77.06%)
FlowMasterETL flow framework based on Yaml configs in Python
Stars: ✭ 19 (-82.57%)
covid-19Data ETL & Analysis on the global and Mexican datasets of the COVID-19 pandemic.
Stars: ✭ 14 (-87.16%)
csv-cruncherTreats CSV and JSON files as SQL tables, and exports SQL SELECTs back to CSV or JSON.
Stars: ✭ 32 (-70.64%)
flowgraphFlowgraph package for scalable asynchronous system development
Stars: ✭ 51 (-53.21%)
hyperionThe SoftDevLabs (SDL) version of the Hercules 4.x Hyperion System/370, ESA/390, and z/Architecture Emulator
Stars: ✭ 149 (+36.7%)
django-calaccess-raw-dataA Django app to download, extract and load campaign finance and lobbying activity data from the California Secretary of State's CAL-ACCESS database
Stars: ✭ 61 (-44.04%)
blockchain-etl-streamingStreaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (-47.71%)
neo4j-jdbcJDBC driver for Neo4j
Stars: ✭ 110 (+0.92%)
DQCS数据质量控制系统
Stars: ✭ 34 (-68.81%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-64.22%)
zinggScalable identity resolution, entity resolution, data mastering and deduplication using ML
Stars: ✭ 655 (+500.92%)
OpenKettleWebUI一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 138 (+26.61%)
morph-kgcPowerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (-29.36%)
etlflowEtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (-65.14%)
YaEtlYet Another ETL in PHP
Stars: ✭ 60 (-44.95%)
CVparserCVparser is software for parsing or extracting data out of CV/resumes.
Stars: ✭ 28 (-74.31%)
zdh server数据采集平台zdh,etl 处理服务
Stars: ✭ 53 (-51.38%)
echo-serverEcho Server is a Docker-ready, multi-scalable Node.js application used to host your own Socket.IO server for Laravel Broadcasting.
Stars: ✭ 32 (-70.64%)
awesome-integrationA curated list of awesome system integration software and resources.
Stars: ✭ 117 (+7.34%)
jsberryJSBerry is open source modular simple architecture for building Node.js applications.
Stars: ✭ 85 (-22.02%)
sync-engine-exampleSynchronization Algorithm Exploration: Techniques to synchronize a SQL database with external destinations.
Stars: ✭ 17 (-84.4%)
link-moveA model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
Stars: ✭ 32 (-70.64%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+461.47%)
starlakeStarlake is a Spark Based On Premise and Cloud ELT/ETL Framework for Batch & Stream Processing
Stars: ✭ 16 (-85.32%)
polygon-etlETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (-51.38%)
harikaOffline-, mobile-first graph note-taking app focused on performance with the knowledgebase of any scale
Stars: ✭ 111 (+1.83%)
mikThe Move to Islandora Kit is an extensible PHP command-line tool for converting source content and metadata into packages suitable for importing into Islandora (or other digital repository and preservations systems).
Stars: ✭ 32 (-70.64%)
XH5ForXDMF parallel partitioned mesh I/O on top of HDF5
Stars: ✭ 23 (-78.9%)
assimilation-officialThis is the official main repository for the Assimilation project
Stars: ✭ 47 (-56.88%)
zdh web大数据采集,抽取平台
Stars: ✭ 292 (+167.89%)
singer-runnerA CLI and library to run Singer Taps and Targets
Stars: ✭ 33 (-69.72%)
django-data-migrationData migration framework for Django that migrates legacy data into your new django app
Stars: ✭ 18 (-83.49%)
csvpluscsvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (-38.53%)
naas⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Stars: ✭ 219 (+100.92%)
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-77.98%)
google-sheets-etlLive import all your Google Sheets to your data warehouse
Stars: ✭ 15 (-86.24%)
serverless-scaleway-functionsPlugin for Serverless Framework to allow users to deploy their serverless applications on Scaleway Functions
Stars: ✭ 58 (-46.79%)
BETL-oldBETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (-84.4%)
dogETLA lib to transform data from jdbc,csv,json to ecah other.
Stars: ✭ 15 (-86.24%)
dropA LÖVE visualizer and music player
Stars: ✭ 17 (-84.4%)
uptasticsearchAn Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (-56.88%)
openrefine-batchShell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that communicates with the OpenRefine API.
Stars: ✭ 76 (-30.28%)
minirocketMINIROCKET: A Very Fast (Almost) Deterministic Transform for Time Series Classification
Stars: ✭ 166 (+52.29%)
iex-stocksETL for the IEX Stocks API
Stars: ✭ 19 (-82.57%)
flockFlock: A Low-Cost Streaming Query Engine on FaaS Platforms
Stars: ✭ 232 (+112.84%)
scrSCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability for MPI codes.
Stars: ✭ 84 (-22.94%)
tomodachi💻 Microservice library / framework using Python's asyncio event loop with full support for HTTP + WebSockets, AWS SNS+SQS, RabbitMQ / AMQP, middleware, etc. Extendable for GraphQL, protobuf, gRPC, among other technologies.
Stars: ✭ 170 (+55.96%)
oecIBM 3270 terminal controller - a replacement for the IBM 3174
Stars: ✭ 29 (-73.39%)
cubetlCubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)
Stars: ✭ 21 (-80.73%)
wrangleA data transformation package for deep learning with Autonomio, Keras and TensorFlow.
Stars: ✭ 15 (-86.24%)
sql-to-redis🔄 Simple tool for ETL. From SQL to Redis.
Stars: ✭ 18 (-83.49%)
gordoAn API-first distributed deployment system of deep learning models using timeseries data to predict the behaviour of systems
Stars: ✭ 25 (-77.06%)