All Projects → csvplus → Similar Projects or Alternatives

371 Open source projects that are alternatives of or similar to csvplus

hive-metastore-client
A client for connecting and running DDLs on hive metastore.
Stars: ✭ 37 (-44.78%)
Mutual labels:  etl
polygon-etl
ETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (-20.9%)
Mutual labels:  etl
ip2location-csv-converter
This PHP script converts IP2Location CSV database into IP range or CIDR format.
Stars: ✭ 26 (-61.19%)
Mutual labels:  csv-format
Aws Serverless Data Lake Framework
Enterprise-grade, production-hardened, serverless data lake on AWS
Stars: ✭ 179 (+167.16%)
Mutual labels:  etl
openPDC
Open Source Phasor Data Concentrator
Stars: ✭ 109 (+62.69%)
Mutual labels:  stream-processing
Bitcoin Etl
ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
Stars: ✭ 174 (+159.7%)
Mutual labels:  etl
xlstream
Turns XLSX into a readable stream.
Stars: ✭ 148 (+120.9%)
Mutual labels:  stream-processing
open-stream-processing-benchmark
This repository contains the code base for the Open Stream Processing Benchmark.
Stars: ✭ 37 (-44.78%)
Mutual labels:  stream-processing
proc-that
proc(ess)-that - easy extendable ETL tool for Node.js. Written in TypeScript.
Stars: ✭ 25 (-62.69%)
Mutual labels:  etl
Data Making Guidelines
📘 Making Data, the DataMade Way
Stars: ✭ 248 (+270.15%)
Mutual labels:  etl
distogram
A library to compute histograms on distributed environments, on streaming data
Stars: ✭ 19 (-71.64%)
Mutual labels:  stream-processing
Example Airflow Dags
Example DAGs using hooks and operators from Airflow Plugins
Stars: ✭ 243 (+262.69%)
Mutual labels:  etl
VBA-CSV
CSV Parser and Writer as VBA functions
Stars: ✭ 26 (-61.19%)
Mutual labels:  csv-format
Storagetapper
StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
Stars: ✭ 232 (+246.27%)
Mutual labels:  etl
vector
A high-performance observability data pipeline.
Stars: ✭ 12,138 (+18016.42%)
Mutual labels:  stream-processing
Elastic
R client for the Elasticsearch HTTP API
Stars: ✭ 227 (+238.81%)
Mutual labels:  etl
zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Stars: ✭ 655 (+877.61%)
Mutual labels:  etl
Linq2db
Linq to database provider.
Stars: ✭ 2,211 (+3200%)
Mutual labels:  etl
openrefine-batch
Shell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that communicates with the OpenRefine API.
Stars: ✭ 76 (+13.43%)
Mutual labels:  etl
Extract
A cross-platform command line tool for parallelised content extraction and analysis.
Stars: ✭ 188 (+180.6%)
Mutual labels:  etl
go-rivers
Collection of stream processing / multiplexing / networking libs in Go
Stars: ✭ 35 (-47.76%)
Mutual labels:  stream-processing
Metl
Metl is a simple, web-based integration platform that allows for several different styles of data integration including messaging, file based Extract/Transform/Load (ETL), and remote procedure invocation via Web Services. Read more at www.jumpmind.com/products/metl/overview
Stars: ✭ 185 (+176.12%)
Mutual labels:  etl
SwiftBuilder
SwiftBuilder is a fast way to assign new value to the property of the object.
Stars: ✭ 26 (-61.19%)
Mutual labels:  fluent-interface
Grafter
Linked Data & RDF Manufacturing Tools in Clojure
Stars: ✭ 174 (+159.7%)
Mutual labels:  etl
makinage
Stream Processing Made Easy
Stars: ✭ 31 (-53.73%)
Mutual labels:  stream-processing
CVparser
CVparser is software for parsing or extracting data out of CV/resumes.
Stars: ✭ 28 (-58.21%)
Mutual labels:  etl
sync-engine-example
Synchronization Algorithm Exploration: Techniques to synchronize a SQL database with external destinations.
Stars: ✭ 17 (-74.63%)
Mutual labels:  etl
csv-cruncher
Treats CSV and JSON files as SQL tables, and exports SQL SELECTs back to CSV or JSON.
Stars: ✭ 32 (-52.24%)
Mutual labels:  etl
awesome-integration
A curated list of awesome system integration software and resources.
Stars: ✭ 117 (+74.63%)
Mutual labels:  etl
Usaspending Api
Server application to serve U.S. federal spending data via a RESTful API
Stars: ✭ 166 (+147.76%)
Mutual labels:  etl
iex-stocks
ETL for the IEX Stocks API
Stars: ✭ 19 (-71.64%)
Mutual labels:  etl
Airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+7241.79%)
Mutual labels:  etl
fluentcheck
Fluent assertions for Python
Stars: ✭ 79 (+17.91%)
Mutual labels:  fluent-interface
Open Semantic Etl
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
Stars: ✭ 165 (+146.27%)
Mutual labels:  etl
neo4j-jdbc
JDBC driver for Neo4j
Stars: ✭ 110 (+64.18%)
Mutual labels:  etl
Etl unicorn
数据可视化, 数据挖掘, 数据处理 ETL
Stars: ✭ 156 (+132.84%)
Mutual labels:  etl
django-data-migration
Data migration framework for Django that migrates legacy data into your new django app
Stars: ✭ 18 (-73.13%)
Mutual labels:  etl
streamsx.kafka
Repository for integration with Apache Kafka
Stars: ✭ 13 (-80.6%)
Mutual labels:  stream-processing
Mara Example Project 2
An example mini data warehouse for python project stats, template for new projects
Stars: ✭ 154 (+129.85%)
Mutual labels:  etl
beepbeep-3
An event stream processor anyone can use
Stars: ✭ 20 (-70.15%)
Mutual labels:  stream-processing
Omniparser
omniparser: a native Golang ETL streaming parser and transform library for CSV, JSON, XML, EDI, text, etc.
Stars: ✭ 148 (+120.9%)
Mutual labels:  etl
Eel Sdk
Big Data Toolkit for the JVM
Stars: ✭ 140 (+108.96%)
Mutual labels:  etl
morph-kgc
Powerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (+14.93%)
Mutual labels:  etl
football-events
Event-Driven microservices with Kafka Streams
Stars: ✭ 57 (-14.93%)
Mutual labels:  stream-processing
Mara Pipelines
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Stars: ✭ 1,841 (+2647.76%)
Mutual labels:  etl
Kettle Web
基于spring boot通过java代码调用kette
Stars: ✭ 128 (+91.04%)
Mutual labels:  etl
Reddit Detective
Play detective on Reddit: Discover political disinformation campaigns, secret influencers and more
Stars: ✭ 129 (+92.54%)
Mutual labels:  etl
filterCSV
Tools to manipulate CSV files in a format suitable for importing into various mindmapping programs - such as iThoughts, Freemind, and MindNode.
Stars: ✭ 29 (-56.72%)
Mutual labels:  csv-format
dtd2mysql
MySQL / MariaDB import for DTD feeds (fares, timetable and routeing)
Stars: ✭ 25 (-62.69%)
Mutual labels:  etl
Etl.net
Mass processing data with a complete ETL for .net developers
Stars: ✭ 129 (+92.54%)
Mutual labels:  etl
FluentInterfaceCreator
Tool to create fluent interface files
Stars: ✭ 58 (-13.43%)
Mutual labels:  fluent-interface
uptasticsearch
An Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (-29.85%)
Mutual labels:  etl
starlake
Starlake is a Spark Based On Premise and Cloud ELT/ETL Framework for Batch & Stream Processing
Stars: ✭ 16 (-76.12%)
Mutual labels:  etl
naas
⚙️ Schedule notebooks, run them like APIs, expose securely your assets: Jupyter as a viable ⚡️ Production environment
Stars: ✭ 219 (+226.87%)
Mutual labels:  etl
sp
Stream Processors on Kafka in Golang
Stars: ✭ 29 (-56.72%)
Mutual labels:  stream-processing
Aws Data Wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+3459.7%)
Mutual labels:  etl
Kiba
Data processing & ETL framework for Ruby
Stars: ✭ 1,618 (+2314.93%)
Mutual labels:  etl
wikirepo
Python based Wikidata framework for easy dataframe extraction
Stars: ✭ 33 (-50.75%)
Mutual labels:  etl
Sentinel Crawler
Xenomorph Crawler, a Concise, Declarative and Observable Distributed Crawler(Node / Go / Java / Rust) For Web, RDB, OS, also can act as a Monitor(with Prometheus) or ETL for Infrastructure 💫 多语言执行器,分布式爬虫
Stars: ✭ 118 (+76.12%)
Mutual labels:  etl
Datax
DataX is an open source universal ETL tool that support Cassandra, ClickHouse, DBF, Hive, InfluxDB, Kudu, MySQL, Oracle, Presto(Trino), PostgreSQL, SQL Server
Stars: ✭ 116 (+73.13%)
Mutual labels:  etl
61-120 of 371 similar projects