/paperlens

PaperLens AI is a LLM-powered Python tool for summarizing and applying topic modeling on a large number of research paper PDFs completely for free using the Groq API.

Primary LanguagePythonMIT LicenseMIT

PaperLens AI

Description:

PaperLens AI is a LLM-powered Python tool for summarizing and applying topic modeling on a large number of research paper PDFs completely for free using the Groq API. It was developed primarily for doing quick literature reviews during my internship at Stockholm School of Economics, but can also be used for basic Retrieval-Augmented Generation (RAG). Using the default settings it can process ca 4 standard-length papers/minute using the Llama3 8B model.

How to use:

  1. Install the script by running one of below commands in a terminal depending on your OS:

Windows PowerShell:

iwr https://raw.githubusercontent.com/crabfeather/paperlens/main/PaperLensAI.py -O PaperLensAI.py

Linux/MacOS:

curl https://raw.githubusercontent.com/crabfeather/paperlens/main/PaperLensAI.py -o PaperLensAI.py
  1. Verify the latest version of Python3 and PIP are installed by running:
python --version
pip --version
  1. Install the required libraries:
pip install PyPDF2
pip install groq
  1. Edit the script to include a free Groq API key generated from https://console.groq.com/keys
  2. Modify the prompts and AI query parameters in the script to your liking. If needed edit the rate limits and chunk size accordingly.
  3. Run the script with Python for your folder of PDFs:
python PaperLensAI.py <input-folder>

Contact:

Allan Edh
Research Intern, Stockholm School of Economics (SSE)
allan.edh@gmail.com