sparklanesA lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-39.29%)
dogETLA lib to transform data from jdbc,csv,json to ecah other.
Stars: ✭ 15 (-46.43%)
DataBridge.NETConfigurable data bridge for permanent ETL jobs
Stars: ✭ 16 (-42.86%)
Go StreamsA lightweight stream processing library for Go
Stars: ✭ 615 (+2096.43%)
basinBasin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
Stars: ✭ 25 (-10.71%)
StetlStetl, Streaming ETL, is a lightweight geospatial processing and ETL framework written in Python.
Stars: ✭ 64 (+128.57%)
neo4j-jdbcJDBC driver for Neo4j
Stars: ✭ 110 (+292.86%)
Metlmito ETL tool
Stars: ✭ 153 (+446.43%)
etlM-Lab ingestion pipeline
Stars: ✭ 15 (-46.43%)
SetlA simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (+182.14%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+17467.86%)
maxwell-sinkconsume maxwell generated message from kafka,export it to another mysql.
Stars: ✭ 16 (-42.86%)
lineageGenerate beautiful documentation for your data pipelines in markdown format
Stars: ✭ 16 (-42.86%)
Mara PipelinesA lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Stars: ✭ 1,841 (+6475%)
DatavecETL Library for Machine Learning - data pipelines, data munging and wrangling
Stars: ✭ 272 (+871.43%)
Bulk WriterProvides guidance for fast ETL jobs, an IDataReader implementation for SqlBulkCopy (or the MySql or Oracle equivalents) that wraps an IEnumerable, and libraries for mapping entites to table columns.
Stars: ✭ 210 (+650%)
naas⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Stars: ✭ 219 (+682.14%)
itstack-naive-chat-server💞 《服务端》| 服务端同样使用Netty4.x作为socket的通信框架,同时在服务端使用Layui作为管理后台的页面,并且我们的服务端采用偏向于DDD领域驱动设计的方式与Netty集合,以此来达到我们的框架结构整洁干净易于扩展。同时我们的通信协议也是在服务端进行定义的,并对外提供可引入的Jar包,这样来保证客户端与服务端共同协议下进行通信。
Stars: ✭ 21 (-25%)
httpitA rapid http(s) benchmark tool written in Go
Stars: ✭ 156 (+457.14%)
html-pipelineHTML processing filters and utilities in Go version
Stars: ✭ 18 (-35.71%)
TDAstatsR pipeline for computing persistent homology in topological data analysis. See https://doi.org/10.21105/joss.00860 for more details.
Stars: ✭ 26 (-7.14%)
etlflowEtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (+35.71%)
sqlite-jnaJava wrapper and Jdbc driver for SQLite using JNA or Bridj or JNR or JNI or JavaCPP.
Stars: ✭ 20 (-28.57%)
howtheydevopsA curated collection of publicly available resources on how companies around the world practice DevOps
Stars: ✭ 318 (+1035.71%)
linesA pure bash clojureish CI pipeline
Stars: ✭ 72 (+157.14%)
SeqToolsA python library to manipulate and transform indexable data (lists, arrays, ...)
Stars: ✭ 42 (+50%)
LabPypeFramework for Creating Pipeline Software
Stars: ✭ 18 (-35.71%)
go-bqloaderbqloader is a simple ETL framework to load data from Cloud Storage into BigQuery.
Stars: ✭ 16 (-42.86%)
tekniqA framework designed around Kotlin providing Restful HTTP Client, JDBC DSL, Loading Cache, Configurations, Validations, and more
Stars: ✭ 31 (+10.71%)
cobrixA COBOL parser and Mainframe/EBCDIC data source for Apache Spark
Stars: ✭ 109 (+289.29%)
MIPS-pipeline-processorA pipelined implementation of the MIPS processor featuring hazard detection as well as forwarding
Stars: ✭ 92 (+228.57%)
nanoflow🔬 De novo assembly of nanopore reads using nextflow
Stars: ✭ 20 (-28.57%)
Apos.ContentContent builder library for MonoGame.
Stars: ✭ 14 (-50%)
singer-runnerA CLI and library to run Singer Taps and Targets
Stars: ✭ 33 (+17.86%)
oracle-jdbc-testerA simple command line Java application to test JDBC connection to Oracle database
Stars: ✭ 37 (+32.14%)
cubetlCubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)
Stars: ✭ 21 (-25%)
komapperKotlin SQL Mapper
Stars: ✭ 28 (+0%)
hyperdriveExtensible streaming ingestion pipeline on top of Apache Spark
Stars: ✭ 31 (+10.71%)
RamsesThe Rx Asset Management System for motion picture production
Stars: ✭ 48 (+71.43%)
smagShow Me A Graph - Command Line Graphing
Stars: ✭ 78 (+178.57%)
PDAP-ScrapersCode relating to scraping public police data.
Stars: ✭ 72 (+157.14%)
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-14.29%)
hive-jdbc-driverAn alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC
Stars: ✭ 31 (+10.71%)
implyrSQL backend to dplyr for Impala
Stars: ✭ 74 (+164.29%)
lightflowA lightweight, distributed workflow system
Stars: ✭ 67 (+139.29%)
pipenpipen - A pipeline framework for python
Stars: ✭ 82 (+192.86%)
TOGGLEToolbox for generic NGS analyses - A framework to quickly build pipelines and to perform large-scale NGS analysis
Stars: ✭ 18 (-35.71%)