All Projects → datamill-co → singer-runner

datamill-co / singer-runner

Licence: Apache-2.0 license
A CLI and library to run Singer Taps and Targets

Programming Languages

python
139335 projects - #7 most used programming language

Labels

Projects that are alternatives of or similar to singer-runner

polygon-etl
ETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (+60.61%)
Mutual labels:  etl
python mozetl
ETL jobs for Firefox Telemetry
Stars: ✭ 25 (-24.24%)
Mutual labels:  etl
tap-rest-api
Singer.io tap for generic Rest API
Stars: ✭ 18 (-45.45%)
Mutual labels:  singer
blockchain-etl-streaming
Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (+72.73%)
Mutual labels:  etl
uptasticsearch
An Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (+42.42%)
Mutual labels:  etl
OpenKettleWebUI
一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 138 (+318.18%)
Mutual labels:  etl
proc-that
proc(ess)-that - easy extendable ETL tool for Node.js. Written in TypeScript.
Stars: ✭ 25 (-24.24%)
Mutual labels:  etl
sql-to-redis
🔄 Simple tool for ETL. From SQL to Redis.
Stars: ✭ 18 (-45.45%)
Mutual labels:  etl
CVparser
CVparser is software for parsing or extracting data out of CV/resumes.
Stars: ✭ 28 (-15.15%)
Mutual labels:  etl
nasdaq-symbols
ETL for the NASDAQ symbol file
Stars: ✭ 13 (-60.61%)
Mutual labels:  etl
sync-engine-example
Synchronization Algorithm Exploration: Techniques to synchronize a SQL database with external destinations.
Stars: ✭ 17 (-48.48%)
Mutual labels:  etl
django-calaccess-raw-data
A Django app to download, extract and load campaign finance and lobbying activity data from the California Secretary of State's CAL-ACCESS database
Stars: ✭ 61 (+84.85%)
Mutual labels:  etl
covid-19
Data ETL & Analysis on the global and Mexican datasets of the COVID-19 pandemic.
Stars: ✭ 14 (-57.58%)
Mutual labels:  etl
starlake
Starlake is a Spark Based On Premise and Cloud ELT/ETL Framework for Batch & Stream Processing
Stars: ✭ 16 (-51.52%)
Mutual labels:  etl
DQCS
数据质量控制系统
Stars: ✭ 34 (+3.03%)
Mutual labels:  etl
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+18.18%)
Mutual labels:  etl
csvplus
csvplus extends the standard Go encoding/csv package with fluent interface, lazy stream operations, indices and joins.
Stars: ✭ 67 (+103.03%)
Mutual labels:  etl
dogETL
A lib to transform data from jdbc,csv,json to ecah other.
Stars: ✭ 15 (-54.55%)
Mutual labels:  etl
hamilton
A scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+1754.55%)
Mutual labels:  etl
awesome-singer-official-portal
A curated list of awesome singer official portals. Live Website ➡️
Stars: ✭ 45 (+36.36%)
Mutual labels:  singer

singer-runner

Singer Runner manages tap and target processes, as well as metrics, state, and configuration.

Features

  • Run a tap or target
  • Pass run options via CLI paramters or JSON/YAML config file.
  • Use local file system or S3 for piping the singer stream, storing state, and storing metrics.
  • Metric storage, piping, and state storage can be extended / customized by inheriting from the base classes.

Usage

Requires python 3, tested with python 3.7

Install

pip install singer-runner

Run

$ singer-runner 
Usage: singer-runner [OPTIONS] COMMAND [ARGS]...

Options:
  --help  Show this message and exit.

Commands:
  run-tap
  run-target

Concepts

  • Pipes
    • Pipes move a stream of Singer messages from tap to target. A pipe could be as simple as a local file, a file in S3, or Kafka.
  • State Storage
  • Metrics Storage

Programmatic Usage

Singer runner can be used within any python application. The primary functions are in singer_runner.runner including:

  • run_tap runs a tap
  • run_target runs a target

Classes in the singer_runner.metrics, singer_runner.pipes, and singer_runner.state can be used as arguemnts, along with catalog/config.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].