LangChain_Wrapper_LocalLLM

Wrapper to chat with a local llm, sending custom content: Webpages, PDFs, Youtube video transcripts.

This it just a test using with oobabooga/text-generation-webui api, all running locally. Oobabooga [Text Generation Web Ui] install is not covered here!!! You can replace the API and then use the OpenAI(chat GPT) api instead.

Items added from forked repo, which I used as a starting point to learning how to interact with LLM API with embeddings.

Added multiple file upload.
Updated packages.
Target Text Generation Web UI API endpoints updated.
Option to query Wikipedia or URLs.
Option to query Plain text files.
Option to query Youtube video transcripts.
Added a chat page, with history and context, using ConversationSummaryMemory.
Many UI improvements, pratically a new app.

Works great with WizardLM-2-7B LLM model (tested using Q5_K_M quantized)

To install

clone repository
create a Conda environment (or venv)
install the requirements

To run

Start your oobabooga-api # Oobabooga [Text Generation Web Ui] install is not covered here!!!
Activate your conda environment (or venv)
Run start_linux.sh (or run "streamlit run HomePage.py")

Embeddings model

This solution uses this model for embeddings flax-sentence-embeddings/all_datasets_v4_MiniLM-L6, which will be download on first run to the huggingface cache.

Credits

It started as a fork from https://github.com/sebaxzero/LangChain_PDFChat_Oobabooga

9600-/Local-LLM-LangChain-Wrapper

LangChain_Wrapper_LocalLLM

To install

To run

Embeddings model

Credits