All Projects → twineworks → ruby-for-pentaho-kettle

twineworks / ruby-for-pentaho-kettle

Licence: other
Ruby scripting for pentaho-kettle

Programming Languages

java
68154 projects - #9 most used programming language
ruby
36898 projects - #4 most used programming language
HTML
75241 projects

Projects that are alternatives of or similar to ruby-for-pentaho-kettle

OpenKettleWebUI
一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
Stars: ✭ 138 (+228.57%)
Mutual labels:  etl, kettle
docker-biserver-ce
Docker image for Pentaho BI server(community edition). https://hub.docker.com/r/zhicwu/biserver-ce/
Stars: ✭ 35 (-16.67%)
Mutual labels:  pdi, kettle
architect big data solutions with spark
code, labs and lectures for the course
Stars: ✭ 40 (-4.76%)
Mutual labels:  etl
oic-options-chains
ETL for OIC Options Chains
Stars: ✭ 22 (-47.62%)
Mutual labels:  etl
go-bqloader
bqloader is a simple ETL framework to load data from Cloud Storage into BigQuery.
Stars: ✭ 16 (-61.9%)
Mutual labels:  etl
etlflow
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for writing various different tasks, jobs on GCP and AWS.
Stars: ✭ 38 (-9.52%)
Mutual labels:  etl
PDAP-Scrapers
Code relating to scraping public police data.
Stars: ✭ 72 (+71.43%)
Mutual labels:  etl
dogETL
A lib to transform data from jdbc,csv,json to ecah other.
Stars: ✭ 15 (-64.29%)
Mutual labels:  etl
dswarm
an open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)
Stars: ✭ 57 (+35.71%)
Mutual labels:  etl
cobrix
A COBOL parser and Mainframe/EBCDIC data source for Apache Spark
Stars: ✭ 109 (+159.52%)
Mutual labels:  etl
dflib
In-memory Java DataFrame library
Stars: ✭ 50 (+19.05%)
Mutual labels:  etl
cubetl
CubETL - Framework and tool for data ETL (Extract, Transform and Load) in Python (PERSONAL PROJECT / SELDOM MAINTAINED)
Stars: ✭ 21 (-50%)
Mutual labels:  etl
mik
The Move to Islandora Kit is an extensible PHP command-line tool for converting source content and metadata into packages suitable for importing into Islandora (or other digital repository and preservations systems).
Stars: ✭ 32 (-23.81%)
Mutual labels:  etl
mydataharbor
🇨🇳 MyDataHarbor是一个致力于解决任意数据源到任意数据源的分布式、高扩展性、高性能、事务级的数据同步中间件。帮助用户可靠、快速、稳定的对海量数据进行准实时增量同步或者定时全量同步,主要定位是为实时交易系统服务,亦可用于大数据的数据同步(ETL领域)。
Stars: ✭ 28 (-33.33%)
Mutual labels:  etl
bigquery-kafka-connect
☁️ nodejs kafka connect connector for Google BigQuery
Stars: ✭ 17 (-59.52%)
Mutual labels:  etl
web-click-flow
网站点击流离线日志分析
Stars: ✭ 14 (-66.67%)
Mutual labels:  etl
singer-runner
A CLI and library to run Singer Taps and Targets
Stars: ✭ 33 (-21.43%)
Mutual labels:  etl
DaFlow
Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-42.86%)
Mutual labels:  etl
DataBridge.NET
Configurable data bridge for permanent ETL jobs
Stars: ✭ 16 (-61.9%)
Mutual labels:  etl
persistity
A persistence framework for game developers
Stars: ✭ 34 (-19.05%)
Mutual labels:  etl

Ruby scripting for pentaho-kettle

Kettle 5.x Java 7+ License Maven Central Travis Build Status Appveyor Build status

An plugin for Pentaho kettle (PDI) allowing to include ruby scripts as transformation steps.

The elegance of the ruby language is paired with ultimate row processing flexibility. The step allows scripts to read, write, aggregate, consume and redirect rows to accomplish the most complex of data processing tasks in one place.

The implementation is based on JRuby. Thanks to JRuby's great Java interop, the scripting step also enables easy Java scripting in kettle.

Supported versions of pentaho-kettle

The plugin works with Kettle versions 5.x - 8.x.

How to get it?

Grab the latest release from the releases page. You can also get the plugin zip as a maven dependency from maven central.

How to install?

Decompress the release zip to <kettle-dir>/plugins and restart Spoon. The "Ruby Script" step will appear in the "Scripting" section of a transformation.

How do I write ruby scripts in kettle?

The ruby scripting step comes with a lot of samples. You can access them by opening a ruby step dialog and exploring the samples section on the left. Samples

Features at a glance

  • rows are represented as hashes, indexed by field name
  • automatic conversion between all kettle data types and ruby types
  • steps with no input can be used as row generators
  • can redirect rows to an error handling stream
  • can read from specific info steps
  • can write to specific target steps
  • can call your favorite Java libraries
  • can use ruby gems
  • kettle's step $step and transformation $trans objects are available in ruby scope for advanced scripting

Where do I report bugs and issues?

Just open issues on github.

What about ruby gems?

You can use gems, see the Ruby gems article on the project wiki. As long as JRuby likes the gem, which usually means that the gem has no unsupported native bindings, you may use gems as with any other ruby program.

How do I build the project?

mvn clean package

It creates the plugin zip in target/ruby-for-pentaho-kettle-{version}-plugin.zip.

How do I run the test suite?

Create a package, then run the tests. The packaging process unzips the plugin into target/ruby-for-pentaho-kettle, which enables kettle to find the plugin when running integration tests.

mvn clean package
mvn -DskipTests=false test

How can I contribute?

If you'd like to contribute please fork the project, add the feature or bugfix and send a pull request. If your change majorly alters the way the plugin works, we should discuss it via an open issue first.

License

The ruby-for-pentaho-kettle plugin uses the LGPL 2.1 license.

Support

Open source does not mean you're on your own. The ruby-for-pentaho-kettle plugin is developed by Twineworks GmbH. Twineworks offers commercial support and consulting services. Contact us if you'd like us to help with a project.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].