From e7dee1746ad5554ea0d8bfdd977d7be19097d0ae Mon Sep 17 00:00:00 2001 From: Felix Lohmeier Date: Mon, 28 Oct 2019 23:47:22 +0100 Subject: [PATCH] =?UTF-8?q?Notebooks=20f=C3=BCr=20ArchivesSpace=20und=20Op?= =?UTF-8?q?enRefine?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- 03-archivesspace.ipynb | 132 ++++++++++++++++++++++++++++ 04-lc-openrefine.ipynb | 192 +++++++++++++++++++++++++++++++++++++++++ README.md | 12 ++- 3 files changed, 335 insertions(+), 1 deletion(-) create mode 100644 03-archivesspace.ipynb create mode 100644 04-lc-openrefine.ipynb diff --git a/03-archivesspace.ipynb b/03-archivesspace.ipynb new file mode 100644 index 0000000..2e0959f --- /dev/null +++ b/03-archivesspace.ipynb @@ -0,0 +1,132 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# ArchivesSpace 2.7.0" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Installation\n", + "\n", + "nach Anleitungen unter:\n", + "* \n", + "* " + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "### In das Home-Verzeichnis wechseln" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "cd" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "### Java 8 installieren" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "sudo apt-get install -qq -y openjdk-8-jre-headless" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "### Zip-Archiv herunterladen und entpacken" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "wget https://github.com/archivesspace/archivesspace/releases/download/v2.7.0/archivesspace-v2.7.0.zip\n", + "unzip -q archivesspace-v2.7.0.zip" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "### ArchivesSpace starten\n", + "\n", + "JupyterLab verwendet die gleiche Adresse wie ArchivesSpace, daher müssen wir JupyterLab beenden bevor wir ArchivesSpace starten.\n", + "\n", + "Bitte öffnen Sie ein Terminal (**nicht in JupyterLab**, sondern über Start > Systemwerkzeuge > LXTerminal) und geben Sie dort die folgenden Befehle ein:" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": {}, + "outputs": [], + "source": [ + "sudo systemctl stop jupyter\n", + "archivesspace/archivesspace.sh" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Los geht's...\n", + "\n", + "Nach ein paar Minuten sollte ArchivesSpace unter folgenden URLs erreichbar sein:\n", + "* http://localhost:8080/ – the staff interface\n", + "* http://localhost:8081/ – the public interface\n", + "* http://localhost:8082/ – the OAI-PMH server\n", + "* http://localhost:8089/ – the backend\n", + "* http://localhost:8090/ – the Solr admin console\n", + "\n", + "Zugangsdaten für das \"Staff Interface\" sind:\n", + "* Username: `admin`\n", + "* Password: `admin`" + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Bash", + "language": "bash", + "name": "bash" + }, + "language_info": { + "codemirror_mode": "shell", + "file_extension": ".sh", + "mimetype": "text/x-sh", + "name": "bash" + } + }, + "nbformat": 4, + "nbformat_minor": 4 +} diff --git a/04-lc-openrefine.ipynb b/04-lc-openrefine.ipynb new file mode 100644 index 0000000..320bd66 --- /dev/null +++ b/04-lc-openrefine.ipynb @@ -0,0 +1,192 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# Library Carpentry: OpenRefine" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "## Setup" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "### Java installieren" + ] + }, + { + "cell_type": "code", + "metadata": {}, + "source": [ + "sudo apt-get install -qq -y default-jre" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "### Daten von Library Carpentry herunterladen" + ] + }, + { + "cell_type": "code", + "execution_count": 2, + "metadata": { + "scrolled": true + }, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "--2019-09-02 21:00:56-- https://github.com/LibraryCarpentry/lc-open-refine/raw/gh-pages/data/doaj-article-sample.csv\n", + "Resolving github.com (github.com)... 140.82.118.4\n", + "Connecting to github.com (github.com)|140.82.118.4|:443... connected.\n", + "HTTP request sent, awaiting response... 302 Found\n", + "Location: https://raw.githubusercontent.com/LibraryCarpentry/lc-open-refine/gh-pages/data/doaj-article-sample.csv [following]\n", + "--2019-09-02 21:00:57-- https://raw.githubusercontent.com/LibraryCarpentry/lc-open-refine/gh-pages/data/doaj-article-sample.csv\n", + "Resolving raw.githubusercontent.com (raw.githubusercontent.com)... 151.101.112.133\n", + "Connecting to raw.githubusercontent.com (raw.githubusercontent.com)|151.101.112.133|:443... connected.\n", + "HTTP request sent, awaiting response... 200 OK\n", + "Length: 524686 (512K) [text/plain]\n", + "Saving to: ‘doaj-article-sample.csv.1’\n", + "\n", + "doaj-article-sample 100%[===================>] 512,39K 962KB/s in 0,5s \n", + "\n", + "2019-09-02 21:00:58 (962 KB/s) - ‘doaj-article-sample.csv.1’ saved [524686/524686]\n", + "\n" + ] + } + ], + "source": [ + "wget https://github.com/LibraryCarpentry/lc-open-refine/raw/gh-pages/data/doaj-article-sample.csv" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "### OpenRefine herunterladen und in den Ordner `openrefine` entpacken" + ] + }, + { + "cell_type": "code", + "execution_count": 4, + "metadata": { + "scrolled": false + }, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "--2019-09-02 21:04:31-- https://github.com/OpenRefine/OpenRefine/releases/download/3.2/openrefine-linux-3.2.tar.gz\n", + "Resolving github.com (github.com)... 140.82.118.4\n", + "Connecting to github.com (github.com)|140.82.118.4|:443... connected.\n", + "HTTP request sent, awaiting response... 302 Found\n", + "Location: https://github-production-release-asset-2e65be.s3.amazonaws.com/6220644/7dc2a280-afc0-11e9-9a64-d6b401ada2fa?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIAIWNJYAX4CSVEH53A%2F20190902%2Fus-east-1%2Fs3%2Faws4_request&X-Amz-Date=20190902T190447Z&X-Amz-Expires=300&X-Amz-Signature=d35772d3dd43c57298ab8e2c7d28abfddeb5545257175e3bdc33a6f85b62caff&X-Amz-SignedHeaders=host&actor_id=0&response-content-disposition=attachment%3B%20filename%3Dopenrefine-linux-3.2.tar.gz&response-content-type=application%2Foctet-stream [following]\n", + "--2019-09-02 21:04:32-- https://github-production-release-asset-2e65be.s3.amazonaws.com/6220644/7dc2a280-afc0-11e9-9a64-d6b401ada2fa?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIAIWNJYAX4CSVEH53A%2F20190902%2Fus-east-1%2Fs3%2Faws4_request&X-Amz-Date=20190902T190447Z&X-Amz-Expires=300&X-Amz-Signature=d35772d3dd43c57298ab8e2c7d28abfddeb5545257175e3bdc33a6f85b62caff&X-Amz-SignedHeaders=host&actor_id=0&response-content-disposition=attachment%3B%20filename%3Dopenrefine-linux-3.2.tar.gz&response-content-type=application%2Foctet-stream\n", + "Resolving github-production-release-asset-2e65be.s3.amazonaws.com (github-production-release-asset-2e65be.s3.amazonaws.com)... 52.216.238.115\n", + "Connecting to github-production-release-asset-2e65be.s3.amazonaws.com (github-production-release-asset-2e65be.s3.amazonaws.com)|52.216.238.115|:443... connected.\n", + "HTTP request sent, awaiting response... 200 OK\n", + "Length: 106046195 (101M) [application/octet-stream]\n", + "Saving to: ‘openrefine-linux-3.2.tar.gz’\n", + "\n", + "openrefine-linux-3. 100%[===================>] 101,13M 3,82MB/s in 32s \n", + "\n", + "2019-09-02 21:05:06 (3,13 MB/s) - ‘openrefine-linux-3.2.tar.gz’ saved [106046195/106046195]\n", + "\n", + "Total bytes read: 125419520 (120MiB, 137MiB/s)\n" + ] + } + ], + "source": [ + "wget https://github.com/OpenRefine/OpenRefine/releases/download/3.2/openrefine-linux-3.2.tar.gz\n", + "mkdir -p openrefine\n", + "tar -xzf openrefine-linux-3.2.tar.gz -C openrefine --strip 1 --totals\n", + "rm openrefine-linux-3.2.tar.gz" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "### OpenRefine starten (die Adresse http://127.0.0.1:3333 öffnet sich automatisch im Browser)" + ] + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "scrolled": false + }, + "outputs": [ + { + "name": "stdout", + "output_type": "stream", + "text": [ + "You have 15961M of free memory.\n", + "Your current configuration is set to use 1400M of memory.\n", + "OpenRefine can run better when given more memory. Read our FAQ on how to allocate more memory here:\n", + "https://github.com/OpenRefine/OpenRefine/wiki/FAQ:-Allocate-More-Memory\n", + "Starting OpenRefine at 'http://127.0.0.1:3333/'\n", + "\n", + "21:07:41.809 [ refine_server] Starting Server bound to '127.0.0.1:3333' (0ms)\n", + "21:07:41.810 [ refine_server] refine.memory size: 1400M JVM Max heap: 1407188992 (1ms)\n", + "21:07:41.819 [ refine_server] Initializing context: '/' from '/home/felix/notebooks/openrefine/webapp' (9ms)\n", + "SLF4J: Class path contains multiple SLF4J bindings.\n", + "SLF4J: Found binding in [jar:file:/home/felix/notebooks/openrefine/server/target/lib/slf4j-log4j12-1.7.18.jar!/org/slf4j/impl/StaticLoggerBinder.class]\n", + "SLF4J: Found binding in [jar:file:/home/felix/notebooks/openrefine/webapp/WEB-INF/lib/slf4j-log4j12-1.7.18.jar!/org/slf4j/impl/StaticLoggerBinder.class]\n", + "SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.\n", + "SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]\n", + "21:07:42.268 [ refine] Starting OpenRefine 3.2 [55c921b]... (449ms)\n", + "21:07:42.268 [ refine] initializing FileProjectManager with dir (0ms)\n", + "21:07:42.268 [ refine] /home/felix/.local/share/openrefine (0ms)\n", + "21:07:47.059 [ refine] POST /command/core/load-language (4791ms)\n", + "21:07:47.083 [ refine] GET /command/core/get-preference (24ms)\n", + "21:07:47.096 [ refine] POST /command/core/load-language (13ms)\n", + "21:07:47.104 [ refine] POST /command/core/load-language (8ms)\n", + "21:07:47.188 [ refine] POST /command/core/get-importing-configuration (84ms)\n", + "21:07:47.216 [ refine] GET /command/core/get-all-project-tags (28ms)\n", + "21:07:47.229 [ refine] GET /command/core/get-all-project-metadata (13ms)\n", + "21:07:47.348 [ refine] GET /command/core/get-languages (119ms)\n", + "21:07:47.447 [ refine] GET /command/database/saved-connection (99ms)\n", + "21:07:47.489 [ refine] GET /command/core/get-version (42ms)\n" + ] + } + ], + "source": [ + "openrefine/refine" + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Bash", + "language": "bash", + "name": "bash" + }, + "language_info": { + "codemirror_mode": "shell", + "file_extension": ".sh", + "mimetype": "text/x-sh", + "name": "bash" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +} diff --git a/README.md b/README.md index 21c67be..2e0dbe2 100644 --- a/README.md +++ b/README.md @@ -55,8 +55,18 @@ Für Notizen und zum Austausch verwenden wir ein gemeinsames Dokument in einer [ - Metadatenstandards in Bibliotheken (hier MARC21) - Marktüberblick Bibliothekssysteme - Marshall Breeding: [Library Systems Report 2019](https://americanlibrariesmagazine.org/2019/05/01/library-systems-report-2019/) - - Aufgabe: Blogbeiträge zu Tag 1 und Tag 2 + - Aufgabe bis 30.10.: Blogbeiträge zu Tag 1 und Tag 2 3. 30.10.2019: Archivsystem ArchivesSpace + - Anwendungsfälle für Shell-Scripte + - Feedback zu Lerntagebüchern + - Metadatenstandards in Archiven (hier ISAD(g) und EAD) + - Gastbeitrag zu Archiven der ETH-Bibliothek von Michael Gasser + - ArchivesSpace installieren und anwenden + - Installation: [Vorschau mit nbviewer](https://nbviewer.jupyter.org/github/felixlohmeier/bibliotheks-und-archivinformatik/blob/master/03-archivesspace.ipynb) + - Ausprobieren anhand der [Einführungsvideos](https://www.youtube.com/playlist?list=PL3cxupmXL7WiXaHnpVquPrUUiLiDAMhg0) + - Aufgabe bis 6.11.: OpenRefine kennenlernen + - Installation: [Vorschau mit nbviewer](https://nbviewer.jupyter.org/github/felixlohmeier/bibliotheks-und-archivinformatik/blob/master/04-lc-openrefine.ipynb) + - [Library Carpentry: OpenRefine](https://librarycarpentry.org/lc-openrefine/) 4. 06.11.2019: Metadaten transformieren mit OpenRefine und MarcEdit 5. 13.11.2019: Discovery-System VuFind und Suchmaschine Solr 6. 27.11.2019: Repository-Software DSpace-CRIS