BentoMilvusLite Install dependencies pip install -U pymilvus bentoml Run the RAG app Clone the entire repo. Deploy an embedding and a large language model on BentoCloud, then retrieve the BENTO_EMBEDDING_MODEL_END_POINT and BENTO_LLM_END_POINT. Run the following: python rag_service.py