OpenRefine is a free, open source power tool for working with messy data and improving it. This repository contains Dockerbuild files for automated builds. https://hub.docker.com/r/felixlohmeier/openrefine/
Go to file
Felix Lohmeier 65863aca33 added version dev-with-pr1294 2017-11-12 12:59:44 +01:00
2.6rc1 new release 2.7rc2 2017-03-14 22:36:19 +01:00
2.6rc2 new release 2.7rc2 2017-03-14 22:36:19 +01:00
2.7 added OpenRefine Release 2.7 (2017-06-18) 2017-06-20 12:49:31 +02:00
2.7rc1 new release 2.7rc2 2017-03-14 22:36:19 +01:00
2.7rc2 new release 2.7rc2 2017-03-14 22:36:19 +01:00
client-py client-py docker image is deprecated - new version available at https://hub.docker.com/r/felixlohmeier/openrefine-client/ 2017-02-02 02:30:19 +01:00
dev-with-pr1294 added version dev-with-pr1294 2017-11-12 12:57:25 +01:00
Dockerfile new release 2.7rc2 2017-03-14 22:36:19 +01:00
README.md added version dev-with-pr1294 2017-11-12 12:59:44 +01:00

README.md

Docker container for OpenRefine Server

OpenRefine is a free, open source power tool for working with messy data and improving it. These docker images are build from released versions (2.7, 2.7rc2, 2.7rc1, 2.6rc2, 2.6rc1) or from official GitHub Repository (latest).

Dockerbuild files are inspired by vimagick/openrefine and psychemedia/openrefine.

versions

cf. OpenRefine Releases

OpenRefine 2.7 (2017-06-18) from java:8-jre-alpine [2.7]

docker pull felixlohmeier/openrefine:2.7

OpenRefine 2.7 Release Candidate 2 (2017-03-03) from java:8-jre-alpine [2.7rc2]

docker pull felixlohmeier/openrefine:2.7rc2

OpenRefine 2.7 Release Candidate 1 (2017-02-10) from java:8-jre-alpine [2.7rc1]

docker pull felixlohmeier/openrefine:2.7rc1

OpenRefine 2.6 Release Candidate 2 (2015-10-14) from java:8-jre-alpine [2.6rc2]

docker pull felixlohmeier/openrefine:2.6rc2

OpenRefine 2.6 Release Candidate 1 (2015-04-30) from java:8-jre-alpine [2.6rc1]

docker pull felixlohmeier/openrefine:2.6rc1

OpenRefine Development Version (automated build) from ubuntu:trusty + jdk [latest]

docker pull felixlohmeier/openrefine

OpenRefine Development Version (automated build) from ubuntu:trusty + jdk based on fork with pull request #1294 from @claussni to extend cross () [dev-with-pr1294]

docker pull felixlohmeier/openrefine:dev-with-pr1294

usage

docker run -p 80:3333 felixlohmeier/openrefine:2.7

point your browser on host machine to http://localhost or point browser on any machine within your network to http://

example for customized run command

docker run --rm -p 80:3333 -v /home/felix/refine:/data:z felixlohmeier/openrefine:2.7 -i 0.0.0.0 -m 4G -d /data

  • automatically remove docker container when it exits
  • publish internal port 3333 to host port 80
  • mount host directory /home/felix/refine as working directory
  • make openrefine available in the network
  • increase java heap size to 4 GB
  • set refine workspace to /data

batch processing with python client

see https://hub.docker.com/r/felixlohmeier/openrefine-client/ for a command line interface (e.g. for usage in shell scripts)