A safetensor format RAG System with WebUI Using: vLLM + LlamaIndex + Streamlit Layer: streamlit-rounds call -> restllm call -> llamaindex-rag call -> vLLM Use: pip install streamlit streamlit run .\streamlit.py