All Projects → opencultureconsulting → openrefine-docker

opencultureconsulting / openrefine-docker

Licence: other
OpenRefine is a free, open source power tool for working with messy data and improving it. This repository contains Dockerbuild files for automated builds.

Programming Languages

Dockerfile
14818 projects

Projects that are alternatives of or similar to openrefine-docker

openrefine-client
The OpenRefine Python Client from Paul Makepeace provides a library for communicating with an OpenRefine server. This fork extends the command line interface (CLI) and is distributed as a convenient one-file-executable (Windows, Linux, Mac). It is also available via Docker Hub, PyPI and Binder.
Stars: ✭ 67 (+252.63%)
Mutual labels:  etl, openrefine, code4lib
openrefine-batch
Shell script to run OpenRefine in batch mode (import, transform, export). It orchestrates OpenRefine (server) and a python client that communicates with the OpenRefine API.
Stars: ✭ 76 (+300%)
Mutual labels:  etl, openrefine, code4lib
Library-Search-Plugin-Public
The Library Search Plugin plugin allows users (students, researchers, etc.) to search your library's catalogue, Google Scholar, WorldCat, or PubMed, without having to navigate to the respective websites first! It also comes with a neat context menu that allows users to select text, right-click, and search!
Stars: ✭ 17 (-10.53%)
Mutual labels:  code4lib
spdr-etf-holdings
ETL for the SPDR ETF holdings XLS documents
Stars: ✭ 14 (-26.32%)
Mutual labels:  etl
oesophagus
Enterprise Grade Single-Step Streaming Data Infrastructure Setup. (Under Development)
Stars: ✭ 12 (-36.84%)
Mutual labels:  etl
kafka-connect-datagen
A Kafka Connect source connector that generates data for tests
Stars: ✭ 27 (+42.11%)
Mutual labels:  etl
koza
Data transformation framework for LinkML data models
Stars: ✭ 21 (+10.53%)
Mutual labels:  etl
redis-connect-dist
Real-Time Event Streaming & Change Data Capture
Stars: ✭ 21 (+10.53%)
Mutual labels:  etl
cardano-py
Python3 lib and cli for operating a Cardano Passive Node and using the API's. (PRE-ALPHA)
Stars: ✭ 17 (-10.53%)
Mutual labels:  etl
astro
Astro allows rapid and clean development of {Extract, Load, Transform} workflows using Python and SQL, powered by Apache Airflow.
Stars: ✭ 79 (+315.79%)
Mutual labels:  etl
lineage
Generate beautiful documentation for your data pipelines in markdown format
Stars: ✭ 16 (-15.79%)
Mutual labels:  etl
sparklanes
A lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-10.53%)
Mutual labels:  etl
carry
Python ETL(Extract-Transform-Load) tool / Data migration tool
Stars: ✭ 115 (+505.26%)
Mutual labels:  etl
rivery cli
Rivery CLI
Stars: ✭ 16 (-15.79%)
Mutual labels:  etl
mlbgameday
Multi-core processing of 'Gameday' data from Major League Baseball Advanced Media. Additional tools to parallelize large data sets and write them to a database.
Stars: ✭ 37 (+94.74%)
Mutual labels:  etl
maxwell-sink
consume maxwell generated message from kafka,export it to another mysql.
Stars: ✭ 16 (-15.79%)
Mutual labels:  etl
conciliator
OpenRefine reconciliation services for VIAF, ORCID, and Open Library + framework for creating more.
Stars: ✭ 95 (+400%)
Mutual labels:  openrefine
es2postgres
ElasticSearch to PostgreSQL loader
Stars: ✭ 18 (-5.26%)
Mutual labels:  etl
brunnhilde
Siegfried-based characterization tool for directories and disk images
Stars: ✭ 55 (+189.47%)
Mutual labels:  code4lib
etl
M-Lab ingestion pipeline
Stars: ✭ 15 (-21.05%)
Mutual labels:  etl

Docker container for OpenRefine

Codacy Badge

OpenRefine is a free, open source power tool for working with messy data and improving it. These docker images are build from official released versions (3.5.0, 3.4.1, 3.4, 3.3, 3.2, 3.1, 3.0, 2.8, 2.7, 2.7rc2, 2.7rc1, 2.6rc2, 2.6rc1, 2.5, 2.1, 2.0) and from a fork (2017-10-28-with-pr1294).

Dockerbuild files are inspired by vimagick/openrefine and psychemedia/openrefine.

Versions

cf. OpenRefine Releases

OpenRefine 4.0-snapshot (2021-07-12) from openjdk:11-jre-alpine [4.0-snapshot]

OpenRefine 3.5.0 (2021-11-07) from openjdk:8-jre-alpine [3.5.0] & [latest]

OpenRefine 3.4.1 (2020-09-24) from openjdk:8-jre-alpine [3.4.1]

OpenRefine 3.4 (2020-09-06) from openjdk:8-jre-alpine [3.4]

OpenRefine 3.3 (2020-01-31) from openjdk:8-jre-alpine [3.3]

OpenRefine 3.2 (2019-07-16) from adoptopenjdk/openjdk12:alpine-jre [3.2-java12]

OpenRefine 3.2 (2019-07-16) adoptopenjdk/openjdk11:alpine-jre [3.2-java11]

OpenRefine 3.2 (2019-07-16) from openjdk:10-jre-alpine [3.2-java10]

OpenRefine 3.2 (2019-07-16) from adoptopenjdk/openjdk9:alpine-slim [3.2-java9]

OpenRefine 3.2 (2019-07-16) from openjdk:8-jre-alpine [3.2]

OpenRefine 3.1 (2018-11-29) from adoptopenjdk/openjdk9:alpine-slim [3.1-java9]

OpenRefine 3.1 (2018-11-29) from openjdk:8-jre-alpine [3.1]

OpenRefine 3.0 (2018-09-16) from adoptopenjdk/openjdk9:alpine-slim [3.0-java9]

OpenRefine 3.0 (2018-09-16) from openjdk:8-jre-alpine [3.0]

OpenRefine 2.8 (2017-11-19) from adoptopenjdk/openjdk9:alpine-slim [2.8-java9]

OpenRefine 2.8 (2017-11-19) from openjdk:8-jre-alpine [2.8]

OpenRefine 2.8 (2017-11-19) from openjdk:7-jre [2.8-java7]

OpenRefine 2.7 (2017-06-18) from openjdk:8-jre-alpine [2.7]

OpenRefine 2.7 (2017-06-18) from openjdk:7-jre [2.7-java7]

OpenRefine 2.7 Release Candidate 2 (2017-03-03) from openjdk:8-jre-alpine [2.7rc2]

OpenRefine 2.7 Release Candidate 1 (2017-02-10) from openjdk:8-jre-alpine [2.7rc1]

OpenRefine 2.6 Release Candidate 2 (2015-10-14) from openjdk:8-jre-alpine [2.6rc2]

OpenRefine 2.6 Release Candidate 1 (2015-04-30) from openjdk:8-jre-alpine [2.6rc1]

Google Refine 2.5 (2011-12-11) from openjdk:7-jre [2.5-java7]

Google Refine 2.5 (2011-12-11) from openjdk:6-jre [2.5-java6]

Google Refine 2.1 (2011-07-12) from openjdk:6-jre [2.1-java6]

Google Refine 2.0 (2010-11-10) from openjdk:6-jre [2.0-java6]

OpenRefine fork with extended cross (snapshot 2017-10-28 with pull request #1294) from openjdk:8-jre-alpine [2017-10-28-with-pr1294]

Usage

docker run -p 3333:3333 felixlohmeier/openrefine

point your browser on host machine to http://localhost:3333 (or on any machine within your network)

Example for customized run command

docker run --rm -p 80:3333 -v /home/felix/refine:/data:z felixlohmeier/openrefine:3.5.0 -i 0.0.0.0 -d /data -m 4G
  • automatically remove docker container when it exits (--rm)
  • publish internal port 3333 to host port 80 (-p 80:3333)
  • let OpenRefine read and write data in host directory
    • mount host path /home/felix/refine to container path /data (-v /home/felix/refine:/data:z)
    • set OpenRefine workspace to /data (-d /data)
  • pin docker tag 3.5.0 (i.e. OpenRefine version) (:3.5.0)
  • set Openrefine to be accessible from outside the container, i.e. from host (-i 0.0.0.0)
  • increase java heap size to 4G (-m 4g)

See also

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].