A Simple RAG implementation of Zephyr7b-Beta on local cpu Model link :https://huggingface.co/TheBloke/zephyr-7B-beta-GGUF/tree/main Here i ve used 2 bit Quantised guff model
Jaykumaran/Zephyr7b-Beta-RAG-Local-LangChainGradio
A Simple RAG implementation of Zephyr7b-Beta on local cpu
Python