c79c4d7df6 | ||
---|---|---|
src | ||
.gitignore | ||
.gitpod.yml | ||
LICENSE | ||
README.md | ||
orcli |
README.md
orcli (💎+🤖)
Bash script to control OpenRefine via its HTTP API.
Features
- works with latest OpenRefine version (currently 3.6)
- run batch processes (import, transform, export)
- orcli takes care of starting and stopping OpenRefine with temporary workspaces
- allows execution of arbitrary bash scripts
- interactive mode for playing around and debugging
- your existing OpenRefine data will not be touched
- import CSV,
TSV, line-based TXT, fixed-width TXT, JSON or XML(and specify input options)- supports stdin, multiple files and URLs
- transform data by providing an undo/redo JSON file
- orcli calls specific endpoints for each operation to provide improved error handling and logging
- supports stdin, multiple files and URLs
- export to TSV,
CSV, HTML, XLS, XLSX, ODS templating export to additional formats like JSON or XML
Requirements
- GNU/Linux with Bash 4+
- jq
- curl
- OpenRefine 😉
Install
-
Navigate to the OpenRefine program directory
-
Download bash script there and make it executable
wget https://github.com/opencultureconsulting/orcli/raw/main/orcli
chmod +x orcli
Optional:
-
Create a symlink in your $PATH (e.g. to ~/.local/bin)
ln -s "${PWD}/orcli" ~/.local/bin/
-
Install Bash tab completion
-
temporary
source <(orcli completions)
-
permanently
mkdir -p ~/.bashrc.d orcli completions > ~/.bashrc.d/orcli
-
Getting Started
- Launch an interactive playground
./orcli run --interactive
- Create OpenRefine project
duplicates
from comma-separated-values (CSV) file
orcli import csv "https://git.io/fj5hF" --projectName "duplicates"
- Show OpenRefine project's metadata
orcli info "duplicates"
-
Remove duplicates (coming soon)
-
Export data from OpenRefine project to tab-separated-values (TSV) file
duplicates.tsv
orcli export tsv "duplicates" --output "duplicates.tsv"
- Write out your session history to file
example.sh
(and delete the last line to remove the history command)
history -a "example.sh"
sed -i '$ d' example.sh
- Exit playground
exit
- Run batch process
./orcli run example.sh
- Cleanup example files
rm duplicates.tsv
rm example.sh
Usage
-
Use integrated help screens for available options and examples for each command.
orcli --help
-
If your OpenRefine is running on a server, then use the environment variable OPENREFINE_URL.
OPENREFINE_URL="http://localhost:3333" orcli list
Development
orcli uses bashly for generating the one-file script from files in the src
directory
- Install bashly (requires ruby)
gem install bashly
-
Edit code in src directory
-
Generate script
bashly generate --upgrade