This repo holds static copies of notebooks for the Anserini IR toolkit (Java) and Pyserini (Python interface to Anserini). There are two ways to play with the notebooks here, using Colab and Binder.
The notebooks in this repo are sync'ed (by hand) with notebooks in Colab. These online demos provide a low-effort way to try out Anserini and Pyserini features:
- Anserini demo on Robust04: [Colab] [GitHub]
- Pyserini demo on Robust04: [Colab] [GitHub]
- Pyserini demo on MS MARCO passage ranking task: [Colab] [GitHub]
These are older notebooks that aren't being maintained anymore:
- Pyserini Demo on COVID-19 Dataset (Title + Abstract Index): [Colab] [GitHub]
- Pyserini Demo on COVID-19 Dataset (Paragraph Index): [Colab] [GitHub]
- Pyserini+SciBERT Demo on COVID-19 Dataset (Title + Abstract Index): [Colab] [GitHub]
- Related Article Search on COVID-19 Dataset: [Colab] [GitHub]
Click "Open in Playground" and you'll be able to replicate our results!
This entire repo is configured with work with Binder. Click the "launch binder" icon on the top-left corner to initialize an executable environment around these notebooks.
For convenience, we've pre-built a few common indexes, available to download here.