All Projects → mydataharbor → Similar Projects or Alternatives

720 Open source projects that are alternatives of or similar to mydataharbor

sparklanes
A lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-39.29%)
Mutual labels:  pipeline, etl
dogETL
A lib to transform data from jdbc,csv,json to ecah other.
Stars: ✭ 15 (-46.43%)
Mutual labels:  etl, jdbc
DataBridge.NET
Configurable data bridge for permanent ETL jobs
Stars: ✭ 16 (-42.86%)
Mutual labels:  etl, data-sync
Go Streams
A lightweight stream processing library for Go
Stars: ✭ 615 (+2096.43%)
Mutual labels:  pipeline, etl
basin
Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
Stars: ✭ 25 (-10.71%)
Mutual labels:  pipeline, etl
Stetl
Stetl, Streaming ETL, is a lightweight geospatial processing and ETL framework written in Python.
Stars: ✭ 64 (+128.57%)
Mutual labels:  pipeline, etl
Phila Airflow
Stars: ✭ 16 (-42.86%)
Mutual labels:  pipeline, etl
neo4j-jdbc
JDBC driver for Neo4j
Stars: ✭ 110 (+292.86%)
Mutual labels:  etl, jdbc
Metl
mito ETL tool
Stars: ✭ 153 (+446.43%)
Mutual labels:  pipeline, etl
etl
M-Lab ingestion pipeline
Stars: ✭ 15 (-46.43%)
Mutual labels:  pipeline, etl
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (+182.14%)
Mutual labels:  pipeline, etl
Airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+17467.86%)
Mutual labels:  pipeline, etl
maxwell-sink
consume maxwell generated message from kafka,export it to another mysql.
Stars: ✭ 16 (-42.86%)
Mutual labels:  etl, data-sync
lineage
Generate beautiful documentation for your data pipelines in markdown format
Stars: ✭ 16 (-42.86%)
Mutual labels:  pipeline, etl
Mara Pipelines
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Stars: ✭ 1,841 (+6475%)
Mutual labels:  pipeline, etl
Datavec
ETL Library for Machine Learning - data pipelines, data munging and wrangling
Stars: ✭ 272 (+871.43%)
Mutual labels:  pipeline, etl
Bulk Writer
Provides guidance for fast ETL jobs, an IDataReader implementation for SqlBulkCopy (or the MySql or Oracle equivalents) that wraps an IEnumerable, and libraries for mapping entites to table columns.
Stars: ✭ 210 (+650%)
Mutual labels:  pipeline, etl
naas
⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Stars: ✭ 219 (+682.14%)
Mutual labels:  pipeline, etl
itstack-naive-chat-server
💞 《服务端》| 服务端同样使用Netty4.x作为socket的通信框架,同时在服务端使用Layui作为管理后台的页面,并且我们的服务端采用偏向于DDD领域驱动设计的方式与Netty集合,以此来达到我们的框架结构整洁干净易于扩展。同时我们的通信协议也是在服务端进行定义的,并对外提供可引入的Jar包,这样来保证客户端与服务端共同协议下进行通信。
Stars: ✭ 21 (-25%)
Mutual labels:  jdbc
httpit
A rapid http(s) benchmark tool written in Go
Stars: ✭ 156 (+457.14%)
Mutual labels:  pipeline
html-pipeline
HTML processing filters and utilities in Go version
Stars: ✭ 18 (-35.71%)
Mutual labels:  pipeline
jenkins-terraform-pipeline
create a jenkins pipeline which uses terraform to manage AWS resources
Stars: ✭ 17 (-39.29%)
Mutual labels:  pipeline
classification
Catalyst.Classification
Stars: ✭ 35 (+25%)
Mutual labels:  pipeline
TDAstats
R pipeline for computing persistent homology in topological data analysis. See https://doi.org/10.21105/joss.00860 for more details.
Stars: ✭ 26 (-7.14%)
Mutual labels:  pipeline
database-metadata-bind
A library for binding information from java.sql.DatabaseMetadata
Stars: ✭ 17 (-39.29%)
Mutual labels:  jdbc
rails-docker-parallel-example
An example of how to run Rails CI and test steps in parallel with Docker and Buildkite
Stars: ✭ 19 (-32.14%)
Mutual labels:  pipeline
etlflow
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (+35.71%)
Mutual labels:  etl
sqlite-jna
Java wrapper and Jdbc driver for SQLite using JNA or Bridj or JNR or JNI or JavaCPP.
Stars: ✭ 20 (-28.57%)
Mutual labels:  jdbc
howtheydevops
A curated collection of publicly available resources on how companies around the world practice DevOps
Stars: ✭ 318 (+1035.71%)
Mutual labels:  pipeline
lines
A pure bash clojureish CI pipeline
Stars: ✭ 72 (+157.14%)
Mutual labels:  pipeline
SeqTools
A python library to manipulate and transform indexable data (lists, arrays, ...)
Stars: ✭ 42 (+50%)
Mutual labels:  pipeline
LabPype
Framework for Creating Pipeline Software
Stars: ✭ 18 (-35.71%)
Mutual labels:  pipeline
go-bqloader
bqloader is a simple ETL framework to load data from Cloud Storage into BigQuery.
Stars: ✭ 16 (-42.86%)
Mutual labels:  etl
bigquery-kafka-connect
☁️ nodejs kafka connect connector for Google BigQuery
Stars: ✭ 17 (-39.29%)
Mutual labels:  etl
tekniq
A framework designed around Kotlin providing Restful HTTP Client, JDBC DSL, Loading Cache, Configurations, Validations, and more
Stars: ✭ 31 (+10.71%)
Mutual labels:  jdbc
architect big data solutions with spark
code, labs and lectures for the course
Stars: ✭ 40 (+42.86%)
Mutual labels:  etl
cobrix
A COBOL parser and Mainframe/EBCDIC data source for Apache Spark
Stars: ✭ 109 (+289.29%)
Mutual labels:  etl
MIPS-pipeline-processor
A pipelined implementation of the MIPS processor featuring hazard detection as well as forwarding
Stars: ✭ 92 (+228.57%)
Mutual labels:  pipeline
flow-platform-x
Continuous Integration Platform
Stars: ✭ 21 (-25%)
Mutual labels:  pipeline
HiveJdbcStorageHandler
No description or website provided.
Stars: ✭ 21 (-25%)
Mutual labels:  jdbc
nanoflow
🔬 De novo assembly of nanopore reads using nextflow
Stars: ✭ 20 (-28.57%)
Mutual labels:  pipeline
Apos.Content
Content builder library for MonoGame.
Stars: ✭ 14 (-50%)
Mutual labels:  pipeline
singer-runner
A CLI and library to run Singer Taps and Targets
Stars: ✭ 33 (+17.86%)
Mutual labels:  etl
oracle-jdbc-tester
A simple command line Java application to test JDBC connection to Oracle database
Stars: ✭ 37 (+32.14%)
Mutual labels:  jdbc
EF-Migrations-Script-Generator-Task
No description or website provided.
Stars: ✭ 20 (-28.57%)
Mutual labels:  pipeline
predict-fraud-using-auto-ai
Use AutoAI to detect fraud
Stars: ✭ 27 (-3.57%)
Mutual labels:  pipeline
cubetl
CubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)
Stars: ✭ 21 (-25%)
Mutual labels:  etl
komapper
Kotlin SQL Mapper
Stars: ✭ 28 (+0%)
Mutual labels:  jdbc
hyperdrive
Extensible streaming ingestion pipeline on top of Apache Spark
Stars: ✭ 31 (+10.71%)
Mutual labels:  pipeline
Ramses
The Rx Asset Management System for motion picture production
Stars: ✭ 48 (+71.43%)
Mutual labels:  pipeline
bitbucket-push-and-pull-request-plugin
Plugin for Jenkins v2.138.2 or later, that triggers job builds on Bitbucket's push and pull request events.
Stars: ✭ 47 (+67.86%)
Mutual labels:  pipeline
smag
Show Me A Graph - Command Line Graphing
Stars: ✭ 78 (+178.57%)
Mutual labels:  pipeline
PDAP-Scrapers
Code relating to scraping public police data.
Stars: ✭ 72 (+157.14%)
Mutual labels:  etl
DaFlow
Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-14.29%)
Mutual labels:  etl
hive-jdbc-driver
An alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC
Stars: ✭ 31 (+10.71%)
Mutual labels:  jdbc
implyr
SQL backend to dplyr for Impala
Stars: ✭ 74 (+164.29%)
Mutual labels:  jdbc
lightflow
A lightweight, distributed workflow system
Stars: ✭ 67 (+139.29%)
Mutual labels:  pipeline
pipen
pipen - A pipeline framework for python
Stars: ✭ 82 (+192.86%)
Mutual labels:  pipeline
JDBCManager
一款操作数据库的小工具
Stars: ✭ 13 (-53.57%)
Mutual labels:  jdbc
TOGGLE
Toolbox for generic NGS analyses - A framework to quickly build pipelines and to perform large-scale NGS analysis
Stars: ✭ 18 (-35.71%)
Mutual labels:  pipeline
1-60 of 720 similar projects