All Projects → wrangle → Similar Projects or Alternatives

233 Open source projects that are alternatives of or similar to wrangle

django-data-migration
Data migration framework for Django that migrates legacy data into your new django app
Stars: ✭ 18 (+20%)
Mutual labels:  etl
zdh server
数据采集平台zdh,etl 处理服务
Stars: ✭ 53 (+253.33%)
Mutual labels:  etl
A-Hierarchical-Transformation-Discriminating-Generative-Model-for-Few-Shot-Anomaly-Detection
Official pytorch implementation of the paper: "A Hierarchical Transformation-Discriminating Generative Model for Few Shot Anomaly Detection"
Stars: ✭ 42 (+180%)
Mutual labels:  transformation
zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Stars: ✭ 655 (+4266.67%)
Mutual labels:  etl
link-move
A model-driven dynamically-configurable framework to acquire data from external sources and save it to your database.
Stars: ✭ 32 (+113.33%)
Mutual labels:  etl
uptasticsearch
An Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (+213.33%)
Mutual labels:  etl
YaEtl
Yet Another ETL in PHP
Stars: ✭ 60 (+300%)
Mutual labels:  etl
hamilton
A scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+3980%)
Mutual labels:  etl
awesome-integration
A curated list of awesome system integration software and resources.
Stars: ✭ 117 (+680%)
Mutual labels:  etl
mlr3spatiotempcv
Spatiotemporal resampling methods for mlr3
Stars: ✭ 43 (+186.67%)
Mutual labels:  resampling
anovos
Anovos - An Open Source Library for Scalable feature engineering Using Apache-Spark
Stars: ✭ 77 (+413.33%)
Mutual labels:  transformation
etl
[READ-ONLY] PHP - ETL (Extract Transform Load) data processing library
Stars: ✭ 279 (+1760%)
Mutual labels:  etl
python mozetl
ETL jobs for Firefox Telemetry
Stars: ✭ 25 (+66.67%)
Mutual labels:  etl
zdh web
大数据采集,抽取平台
Stars: ✭ 292 (+1846.67%)
Mutual labels:  etl
sql-to-redis
🔄 Simple tool for ETL. From SQL to Redis.
Stars: ✭ 18 (+20%)
Mutual labels:  etl
naas
⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Stars: ✭ 219 (+1360%)
Mutual labels:  etl
flock
Flock: A Low-Cost Streaming Query Engine on FaaS Platforms
Stars: ✭ 232 (+1446.67%)
Mutual labels:  etl
BETL-old
BETL. Meta data driven ETL generation using T-SQL
Stars: ✭ 17 (+13.33%)
Mutual labels:  etl
architect big data solutions with spark
code, labs and lectures for the course
Stars: ✭ 40 (+166.67%)
Mutual labels:  etl
FlowMaster
ETL flow framework based on Yaml configs in Python
Stars: ✭ 19 (+26.67%)
Mutual labels:  etl
sync-engine-example
Synchronization Algorithm Exploration: Techniques to synchronize a SQL database with external destinations.
Stars: ✭ 17 (+13.33%)
Mutual labels:  etl
mmand
Mathematical Morphology in Any Number of Dimensions
Stars: ✭ 32 (+113.33%)
Mutual labels:  resampling
nasdaq-symbols
ETL for the NASDAQ symbol file
Stars: ✭ 13 (-13.33%)
Mutual labels:  etl
dtd2mysql
MySQL / MariaDB import for DTD feeds (fares, timetable and routeing)
Stars: ✭ 25 (+66.67%)
Mutual labels:  etl
refinery
Refinery is a tool to extract and transform semi-structured data from Excel spreadsheets of different layouts in a declarative way.
Stars: ✭ 30 (+100%)
Mutual labels:  wrangling
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+160%)
Mutual labels:  etl
DataX-src
DataX 是异构数据广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、SqlServer、Postgre、HDFS、Hive、ADS、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
Stars: ✭ 21 (+40%)
Mutual labels:  etl
csvplus
csvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (+346.67%)
Mutual labels:  etl
proc-that
proc(ess)-that - easy extendable ETL tool for Node.js. Written in TypeScript.
Stars: ✭ 25 (+66.67%)
Mutual labels:  etl
dogETL
A lib to transform data from jdbc,csv,json to ecah other.
Stars: ✭ 15 (+0%)
Mutual labels:  etl
gintonic
A declarative transformation language for GraphQL 🍸
Stars: ✭ 27 (+80%)
Mutual labels:  transformation
CVparser
CVparser is software for parsing or extracting data out of CV/resumes.
Stars: ✭ 28 (+86.67%)
Mutual labels:  etl
csv-cruncher
Treats CSV and JSON files as SQL tables, and exports SQL SELECTs back to CSV or JSON.
Stars: ✭ 32 (+113.33%)
Mutual labels:  etl
bigquery-kafka-connect
☁️ nodejs kafka connect connector for Google BigQuery
Stars: ✭ 17 (+13.33%)
Mutual labels:  etl
morph-kgc
Powerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (+413.33%)
Mutual labels:  etl
django-calaccess-raw-data
A Django app to download, extract and load campaign finance and lobbying activity data from the California Secretary of State's CAL-ACCESS database
Stars: ✭ 61 (+306.67%)
Mutual labels:  etl
google-sheets-etl
Live import all your Google Sheets to your data warehouse
Stars: ✭ 15 (+0%)
Mutual labels:  etl
PNG-Upscale
AI Super - Resolution
Stars: ✭ 116 (+673.33%)
Mutual labels:  resampling
dsp
DSP and filtering library
Stars: ✭ 36 (+140%)
Mutual labels:  resampling
multi-imbalance
Python package for tackling multi-class imbalance problems. http://www.cs.put.poznan.pl/mlango/publications/multiimbalance/
Stars: ✭ 66 (+340%)
Mutual labels:  resampling
dot
distributed data sync with operational transformation/transforms
Stars: ✭ 73 (+386.67%)
Mutual labels:  transformation
etlflow
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (+153.33%)
Mutual labels:  etl
openrefine-batch
Shell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that communicates with the OpenRefine API.
Stars: ✭ 76 (+406.67%)
Mutual labels:  etl
cygrid
Cygrid is a cython-powered convolution-based gridding module for astronomy
Stars: ✭ 32 (+113.33%)
Mutual labels:  resampling
iex-stocks
ETL for the IEX Stocks API
Stars: ✭ 19 (+26.67%)
Mutual labels:  etl
DQCS
数据质量控制系统
Stars: ✭ 34 (+126.67%)
Mutual labels:  etl
neo4j-jdbc
JDBC driver for Neo4j
Stars: ✭ 110 (+633.33%)
Mutual labels:  etl
blockchain-etl-streaming
Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (+280%)
Mutual labels:  etl
stargan2
StarGAN2 for practice
Stars: ✭ 89 (+493.33%)
Mutual labels:  transformation
butterfly
Application transformation tool
Stars: ✭ 35 (+133.33%)
Mutual labels:  transformation
datawizard
Magic potions to clean and transform your data 🧙
Stars: ✭ 149 (+893.33%)
Mutual labels:  wrangling
starlake
Starlake is a Spark Based On Premise and Cloud ELT/ETL Framework for Batch & Stream Processing
Stars: ✭ 16 (+6.67%)
Mutual labels:  etl
wikirepo
Python based Wikidata framework for easy dataframe extraction
Stars: ✭ 33 (+120%)
Mutual labels:  etl
covid-19
Data ETL & Analysis on the global and Mexican datasets of the COVID-19 pandemic.
Stars: ✭ 14 (-6.67%)
Mutual labels:  etl
spydrnet
A flexible framework for analyzing and transforming FPGA netlists. Official repository.
Stars: ✭ 49 (+226.67%)
Mutual labels:  transformation
mik
The Move to Islandora Kit is an extensible PHP command-line tool for converting source content and metadata into packages suitable for importing into Islandora (or other digital repository and preservations systems).
Stars: ✭ 32 (+113.33%)
Mutual labels:  etl
modeltime.resample
Resampling Tools for Time Series Forecasting with Modeltime
Stars: ✭ 12 (-20%)
Mutual labels:  resampling
singer-runner
A CLI and library to run Singer Taps and Targets
Stars: ✭ 33 (+120%)
Mutual labels:  etl
OpenKettleWebUI
一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 138 (+820%)
Mutual labels:  etl
polygon-etl
ETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (+253.33%)
Mutual labels:  etl
1-60 of 233 similar projects