openrefine-docker/README.md

91 lines
4.1 KiB
Markdown
Raw Normal View History

2019-08-06 21:32:08 +02:00
# Docker container for OpenRefine
[![Codacy Badge](https://api.codacy.com/project/badge/Grade/331a584a806e45feba63ed8871329c7a)](https://www.codacy.com/app/felixlohmeier/openrefine-docker?utm_source=github.com&utm_medium=referral&utm_content=opencultureconsulting/openrefine-docker&utm_campaign=Badge_Grade)
2017-02-02 02:23:57 +01:00
[OpenRefine](http://openrefine.org/) is a free, open source power tool for working with messy data and improving it.
2020-08-07 23:40:30 +02:00
These docker images are build from official released versions (3.3, 3.2, 3.1, 3.0, 2.8, 2.7, 2.7rc2, 2.7rc1, 2.6rc2, 2.6rc1, 2.5, 2.1, 2.0) and from a fork (2017-10-28-with-pr1294).
2017-02-02 02:09:25 +01:00
2017-02-02 02:23:57 +01:00
* [GitHub Repository with Dockerbuild files](https://github.com/felixlohmeier/openrefine-docker)
* [Docker Hub with docker images](https://hub.docker.com/r/felixlohmeier/openrefine/)
2017-02-02 02:09:25 +01:00
2017-02-02 12:31:39 +01:00
Dockerbuild files are inspired by [vimagick/openrefine](https://hub.docker.com/r/vimagick/openrefine/) and [psychemedia/openrefine](https://hub.docker.com/r/psychemedia/openrefine/).
2017-02-02 02:09:25 +01:00
2019-08-06 21:32:08 +02:00
### Versions
2017-02-02 02:09:25 +01:00
cf. [OpenRefine Releases](https://github.com/OpenRefine/OpenRefine/releases)
2020-08-07 23:40:30 +02:00
OpenRefine 3.3 (2020-01-31) from openjdk:8-jre-alpine **[3.3]** & **[latest]**
OpenRefine 3.2 (2019-07-16) from adoptopenjdk/openjdk12:alpine-jre **[3.2-java12]**
OpenRefine 3.2 (2019-07-16) adoptopenjdk/openjdk11:alpine-jre **[3.2-java11]**
2019-08-05 18:34:27 +02:00
OpenRefine 3.2 (2019-07-16) from openjdk:10-jre-alpine **[3.2-java10]**
OpenRefine 3.2 (2019-07-16) from adoptopenjdk/openjdk9:alpine-slim **[3.2-java9]**
2019-08-05 01:36:46 +02:00
2020-08-07 23:40:30 +02:00
OpenRefine 3.2 (2019-07-16) from openjdk:8-jre-alpine **[3.2]**
2019-08-05 01:36:46 +02:00
2019-08-05 18:47:22 +02:00
OpenRefine 3.1 (2018-11-29) from adoptopenjdk/openjdk9:alpine-slim **[3.1-java9]**
2019-08-05 01:36:46 +02:00
OpenRefine 3.1 (2018-11-29) from openjdk:8-jre-alpine **[3.1]**
2019-08-05 18:47:22 +02:00
OpenRefine 3.0 (2018-09-16) from adoptopenjdk/openjdk9:alpine-slim **[3.0-java9]**
OpenRefine 3.0 (2018-09-16) from openjdk:8-jre-alpine **[3.0]**
2019-08-05 18:47:22 +02:00
OpenRefine 2.8 (2017-11-19) from adoptopenjdk/openjdk9:alpine-slim **[2.8-java9]**
OpenRefine 2.8 (2017-11-19) from openjdk:8-jre-alpine **[2.8]**
OpenRefine 2.8 (2017-11-19) from openjdk:7-jre **[2.8-java7]**
2019-08-05 18:34:27 +02:00
OpenRefine 2.7 (2017-06-18) from openjdk:8-jre-alpine **[2.7]**
OpenRefine 2.7 (2017-06-18) from openjdk:7-jre **[2.7-java7]**
2019-08-05 18:34:27 +02:00
OpenRefine 2.7 Release Candidate 2 (2017-03-03) from openjdk:8-jre-alpine **[2.7rc2]**
2017-03-14 22:36:19 +01:00
2019-08-05 18:34:27 +02:00
OpenRefine 2.7 Release Candidate 1 (2017-02-10) from openjdk:8-jre-alpine **[2.7rc1]**
2017-02-24 21:36:32 +01:00
2019-08-05 18:34:27 +02:00
OpenRefine 2.6 Release Candidate 2 (2015-10-14) from openjdk:8-jre-alpine **[2.6rc2]**
2017-02-02 02:09:25 +01:00
2019-08-05 18:34:27 +02:00
OpenRefine 2.6 Release Candidate 1 (2015-04-30) from openjdk:8-jre-alpine **[2.6rc1]**
2017-02-02 02:09:25 +01:00
Google Refine 2.5 (2011-12-11) from openjdk:7-jre **[2.5-java7]**
Google Refine 2.5 (2011-12-11) from openjdk:6-jre **[2.5-java6]**
Google Refine 2.1 (2011-07-12) from openjdk:6-jre **[2.1-java6]**
Google Refine 2.0 (2010-11-10) from openjdk:6-jre **[2.0-java6]**
2019-08-05 18:34:27 +02:00
OpenRefine [fork](https://github.com/opencultureconsulting/OpenRefine) with extended cross (snapshot 2017-10-28 with pull request #1294) from openjdk:8-jre-alpine **[2017-10-28-with-pr1294]**
2017-11-12 12:59:44 +01:00
2019-08-06 21:32:08 +02:00
### Usage
2019-08-06 21:43:32 +02:00
```
2020-08-07 23:40:30 +02:00
docker run -p 3333:3333 felixlohmeier/openrefine:3.3
2019-08-06 21:43:32 +02:00
```
2017-02-02 02:09:25 +01:00
point your browser on host machine to http://localhost:3333 (or on any machine within your network)
2017-02-02 02:09:25 +01:00
2019-08-06 21:32:08 +02:00
### Example for customized run command
2017-02-02 02:09:25 +01:00
2019-08-06 21:32:08 +02:00
```
2020-08-07 23:40:30 +02:00
docker run --rm -p 80:3333 -v /home/felix/refine:/data:z felixlohmeier/openrefine:3.3 -i 0.0.0.0 -d /data -m 4G
2019-08-06 21:32:08 +02:00
```
2017-02-02 02:09:25 +01:00
* automatically remove docker container when it exits (`--rm`)
* publish internal port 3333 to host port 80 (`-p 80:3333`)
2019-08-08 01:29:20 +02:00
* let OpenRefine read and write data in host directory
* mount host path /home/felix/refine to container path /data (`-v /home/felix/refine:/data:z`)
* set OpenRefine workspace to /data (`-d /data`)
2020-08-07 23:40:30 +02:00
* pin docker tag 3.3 (i.e. OpenRefine version) (`:3.3`)
* set Openrefine to be accessible from outside the container, i.e. from host (`-i 0.0.0.0`)
* increase java heap size to 4G (`-m 4g`)
2017-02-02 02:09:25 +01:00
2019-08-06 21:32:08 +02:00
### See also
* Command line interface for OpenRefine: [openrefine-client](https://github.com/opencultureconsulting/openrefine-client/#docker)
* Linux Bash script to run OpenRefine in batch mode (import, transform, export): [openrefine-batch-docker.sh](https://github.com/opencultureconsulting/openrefine-batch/#docker)