seonglae/llama2gptq

Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantization.

PythonMIT

Issues

deps: renovate dependency dashboard
#2 opened 2 years ago by renovate
0
feat: text streaming using streamer or criterion
#33 opened a year ago by seonglae
0
feat: chat-streamlit 0.1.1 refs like perplexity
#12 opened a year ago by seonglae
0
model: auto gptq qunatization from knowledge distillation
#16 opened 2 years ago by seonglae
0
data: url bases new db generation of texonom
#13 opened 2 years ago by seonglae
1
feat: add conversation context using chat history
#5 opened 2 years ago by seonglae
1
refactor: find optimal model for local chat
#4 opened 2 years ago by seonglae
2
feat: web ui support using fastapi & next.js
#3 opened 2 years ago by seonglae
1