/sotawhat

Returns latest research results by crawling arxiv papers and summarizing abstracts. Helps you stay afloat with so many new papers everyday.

Primary LanguagePython

sotawhat

This script runs using Python 3.

First, install the required packages. This script only requires nltk and PyEnchant.

$ pip3 install -r requirements.txt

If you run the error that the package punkt doesn't exist, download it by going into your Python environment and running:

$ python3

>>> import nltk
>>> nltk.download('punkt')

In MacOS, you can get the SSL error

[nltk_data] Error loading punkt: <urlopen error [SSL:
[nltk_data]     CERTIFICATE_VERIFY_FAILED] certificate verify failed:
[nltk_data]     unable to get local issuer certificate (_ssl.c:1045)>

this will be fixed by reinstalling certificates

$ /Applications/Python\ 3.x/Install\ Certificates.command

To query for a certain keyword, run:

$ python3 sotawhat.py "[keyword]" [number of results]

For example:

$ python3 sotawhat.py "perplexity" 10

If you don't specify the number of results, by default, the script returns 5 results. Each result contains the title of the paper with author and published date, a summary of the abstract, and link to the paper.

We've found that this script works well with keywords that are:

  • a model (e.g. transformer, wavenet, ...)
  • a dataset (e.g. wikitext, imagenet, ...)
  • a task (e.g. 'language model', 'machine translation', 'fuzzing', ...)
  • a metric (e.g. BLEU, perplexity, ...)
  • random stuff