/archivist

Create webarchive entries on archive.org from your raindrop.io bookmarks list using waybackpy

Primary LanguageShell

Archivist

Raindrop.ioのブックマークリストからarchive.org上に必要に応じてアーカイブエントリーを作成するスクリプト

Prereq

You need waybackpy CLI. I implemented this script with waybackpy cli, not the waybackpy package to import.

Installation

Check what the code in the script looks like before you install the script to check whether it suits your need and preference.

foobar

Subcommands

archivist init: run first.
          start 90min: start the process with 1.5h timer. only minutes available to set. overwrites history.
          restart 31min: restart the process from the history. the starting point(page) may be inaccurate.

You cannot start/restart without timer. Timer would be ignored if the process gets done earlier.

Env Vars

  • ARCHIVIST_CONFIG: config file location. defaults to ~/.config/archivist/config.
  • RAINDROP_TEST_TOKEN: raindrop test token to allow archivist to have access to your raindrops. access to raindrop.io and get the value. must be defined.
  • ARCHIVIST_DATA_DIR: defaults to ~/.local/share/archivist.
  • ARCHIVIST_IGNORE_URL_FILE: file to store URLs archivist ignores. must include web.archive.org/*. defaults to ~/.config/archivist/ignore.conf.

~/.config/archivist/config example (default config location):

export RAINDROP_TEST_TOKEN="1234567foobar"
# export ARCHIVIST_DATA_DIR="$HOME/.local/share/archivist"
# export ARCHIVIST_IGNORE_URL="$HOME/.config/archivist/ignore.conf"

What you do manually

ckecklist.txt file is created under $ARCHIVIST_DATA_DIR dir after script execution. The URLs inside checklist.txt are what returned 4xx HTTP error codes. That means you need find alternative URLs for those from web.archive.org manually.