/paperbot

IRC bot for fetching papers/pdfs on IRC using phenny

Primary LanguagePython

paperbot

Paperbot is an IRC bot that fetches academic papers. It monitors all conversation for links to scholarly content, then fetches the content and posts a public link. This seems to help enhance the quality of discussion and make us less ignorant. When a link fails to lead to a pdf with the zotero translators, paperbot will not attempt further downloads of the paper unless paperbot was specifically spoken to.

## deets

All content is scraped using zotero/translators. These are javascript scrapers that work on a large number of academic publisher sites and are actively maintained. Paperbot offloads links to zotero/translation-server, which runs the zotero scrapers headlessly in a gecko and xulrunner environment. The scrapers return metadata and a link to the pdf. Then paperbot fetches that particular pdf. Sometimes in IRC someone drops a link straight to a pdf, which paperbot is also happy to compulsively archive.

Environment variables:

  • SCIHUB_PASSWORD - must be set to SciHub cookie
  • LOGGING - optional: bot spams logs to a channel defined as "#" plus the value of LOGGING
## TODO

It would be nice to use multiple proxies to resolve a pdf request.

## active demo

say hi to paperbot on irc.freenode.net ##hplusroadmap

## license

BSD.