All Projects → wrangle → Similar Projects or Alternatives

233 Open source projects that are alternatives of or similar to wrangle

csvplus
csvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (+346.67%)
Mutual labels:  etl
NBi
NBi is a testing framework (add-on to NUnit) for Business Intelligence and Data Access. The main goal of this framework is to let users create tests with a declarative approach based on an Xml syntax. By the means of NBi, you don't need to develop C# or Java code to specify your tests! Either, you don't need Visual Studio or Eclipse to compile y…
Stars: ✭ 102 (+580%)
Mutual labels:  etl
proc-that
proc(ess)-that - easy extendable ETL tool for Node.js. Written in TypeScript.
Stars: ✭ 25 (+66.67%)
Mutual labels:  etl
rtsp-video-server
RTSP video streaming server implementation based on Live555 and FFmpeg
Stars: ✭ 36 (+140%)
Mutual labels:  resampling
dogETL
A lib to transform data from jdbc,csv,json to ecah other.
Stars: ✭ 15 (+0%)
Mutual labels:  etl
id3c
Data logistics system enabling real-time pathogen surveillance. Built for the Seattle Flu Study.
Stars: ✭ 21 (+40%)
Mutual labels:  etl
gintonic
A declarative transformation language for GraphQL 🍸
Stars: ✭ 27 (+80%)
Mutual labels:  transformation
thain
Thain is a distributed flow schedule platform.
Stars: ✭ 81 (+440%)
Mutual labels:  etl
CVparser
CVparser is software for parsing or extracting data out of CV/resumes.
Stars: ✭ 28 (+86.67%)
Mutual labels:  etl
Data Making Guidelines
📘 Making Data, the DataMade Way
Stars: ✭ 248 (+1553.33%)
Mutual labels:  etl
csv-cruncher
Treats CSV and JSON files as SQL tables, and exports SQL SELECTs back to CSV or JSON.
Stars: ✭ 32 (+113.33%)
Mutual labels:  etl
Example Airflow Dags
Example DAGs using hooks and operators from Airflow Plugins
Stars: ✭ 243 (+1520%)
Mutual labels:  etl
bigquery-kafka-connect
☁️ nodejs kafka connect connector for Google BigQuery
Stars: ✭ 17 (+13.33%)
Mutual labels:  etl
Storagetapper
StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
Stars: ✭ 232 (+1446.67%)
Mutual labels:  etl
morph-kgc
Powerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (+413.33%)
Mutual labels:  etl
Elastic
R client for the Elasticsearch HTTP API
Stars: ✭ 227 (+1413.33%)
Mutual labels:  etl
django-calaccess-raw-data
A Django app to download, extract and load campaign finance and lobbying activity data from the California Secretary of State's CAL-ACCESS database
Stars: ✭ 61 (+306.67%)
Mutual labels:  etl
Etlbox
A lightweight ETL (extract, transform, load) library and data integration toolbox for .NET.
Stars: ✭ 203 (+1253.33%)
Mutual labels:  etl
google-sheets-etl
Live import all your Google Sheets to your data warehouse
Stars: ✭ 15 (+0%)
Mutual labels:  etl
Extract
A cross-platform command line tool for parallelised content extraction and analysis.
Stars: ✭ 188 (+1153.33%)
Mutual labels:  etl
PNG-Upscale
AI Super - Resolution
Stars: ✭ 116 (+673.33%)
Mutual labels:  resampling
Metl
Metl is a simple, web-based integration platform that allows for several different styles of data integration including messaging, file based Extract/Transform/Load (ETL), and remote procedure invocation via Web Services. Read more at www.jumpmind.com/products/metl/overview
Stars: ✭ 185 (+1133.33%)
Mutual labels:  etl
dsp
DSP and filtering library
Stars: ✭ 36 (+140%)
Mutual labels:  resampling
Grafter
Linked Data & RDF Manufacturing Tools in Clojure
Stars: ✭ 174 (+1060%)
Mutual labels:  etl
multi-imbalance
Python package for tackling multi-class imbalance problems. http://www.cs.put.poznan.pl/mlango/publications/multiimbalance/
Stars: ✭ 66 (+340%)
Mutual labels:  resampling
Bender
Bender - Serverless ETL Framework
Stars: ✭ 171 (+1040%)
Mutual labels:  etl
dot
distributed data sync with operational transformation/transforms
Stars: ✭ 73 (+386.67%)
Mutual labels:  transformation
Airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+32693.33%)
Mutual labels:  etl
etlflow
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (+153.33%)
Mutual labels:  etl
Open Semantic Etl
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
Stars: ✭ 165 (+1000%)
Mutual labels:  etl
openrefine-batch
Shell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that communicates with the OpenRefine API.
Stars: ✭ 76 (+406.67%)
Mutual labels:  etl
Mara Example Project 2
An example mini data warehouse for python project stats, template for new projects
Stars: ✭ 154 (+926.67%)
Mutual labels:  etl
cygrid
Cygrid is a cython-powered convolution-based gridding module for astronomy
Stars: ✭ 32 (+113.33%)
Mutual labels:  resampling
Omniparser
omniparser: a native Golang ETL streaming parser and transform library for CSV, JSON, XML, EDI, text, etc.
Stars: ✭ 148 (+886.67%)
Mutual labels:  etl
iex-stocks
ETL for the IEX Stocks API
Stars: ✭ 19 (+26.67%)
Mutual labels:  etl
Eel Sdk
Big Data Toolkit for the JVM
Stars: ✭ 140 (+833.33%)
Mutual labels:  etl
DQCS
数据质量控制系统
Stars: ✭ 34 (+126.67%)
Mutual labels:  etl
Kettle Web
基于spring boot通过java代码调用kette
Stars: ✭ 128 (+753.33%)
Mutual labels:  etl
neo4j-jdbc
JDBC driver for Neo4j
Stars: ✭ 110 (+633.33%)
Mutual labels:  etl
Etl.net
Mass processing data with a complete ETL for .net developers
Stars: ✭ 129 (+760%)
Mutual labels:  etl
blockchain-etl-streaming
Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (+280%)
Mutual labels:  etl
Transformalize
Configurable Extract, Transform, and Load
Stars: ✭ 125 (+733.33%)
Mutual labels:  etl
stargan2
StarGAN2 for practice
Stars: ✭ 89 (+493.33%)
Mutual labels:  transformation
Aws Data Wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+15800%)
Mutual labels:  etl
butterfly
Application transformation tool
Stars: ✭ 35 (+133.33%)
Mutual labels:  transformation
Kiba
Data processing & ETL framework for Ruby
Stars: ✭ 1,618 (+10686.67%)
Mutual labels:  etl
datawizard
Magic potions to clean and transform your data 🧙
Stars: ✭ 149 (+893.33%)
Mutual labels:  wrangling
Datax
DataX is an open source universal ETL tool that support Cassandra, ClickHouse, DBF, Hive, InfluxDB, Kudu, MySQL, Oracle, Presto(Trino), PostgreSQL, SQL Server
Stars: ✭ 116 (+673.33%)
Mutual labels:  etl
starlake
Starlake is a Spark Based On Premise and Cloud ELT/ETL Framework for Batch & Stream Processing
Stars: ✭ 16 (+6.67%)
Mutual labels:  etl
Aws Ecs Airflow
Run Airflow in AWS ECS(Elastic Container Service) using Fargate tasks
Stars: ✭ 107 (+613.33%)
Mutual labels:  etl
wikirepo
Python based Wikidata framework for easy dataframe extraction
Stars: ✭ 33 (+120%)
Mutual labels:  etl
Csv2db
The CSV to database command line loader
Stars: ✭ 102 (+580%)
Mutual labels:  etl
covid-19
Data ETL & Analysis on the global and Mexican datasets of the COVID-19 pandemic.
Stars: ✭ 14 (-6.67%)
Mutual labels:  etl
DataX-src
DataX 是异构数据广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、SqlServer、Postgre、HDFS、Hive、ADS、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
Stars: ✭ 21 (+40%)
Mutual labels:  etl
mik
The Move to Islandora Kit is an extensible PHP command-line tool for converting source content and metadata into packages suitable for importing into Islandora (or other digital repository and preservations systems).
Stars: ✭ 32 (+113.33%)
Mutual labels:  etl
modeltime.resample
Resampling Tools for Time Series Forecasting with Modeltime
Stars: ✭ 12 (-20%)
Mutual labels:  resampling
singer-runner
A CLI and library to run Singer Taps and Targets
Stars: ✭ 33 (+120%)
Mutual labels:  etl
OpenKettleWebUI
一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 138 (+820%)
Mutual labels:  etl
polygon-etl
ETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (+253.33%)
Mutual labels:  etl
dbt-databricks
A dbt adapter for Databricks.
Stars: ✭ 115 (+666.67%)
Mutual labels:  etl
61-120 of 233 similar projects