This project contains a Gradio application designed for answering questions from the "Concepts of Biology" textbook. It utilizes a Docker container to run the application.
Before running the application, please download the Concepts of Biology PDF and place it in the same directory as this README.
docker build -t gradio-app:latest .
docker run -d -it --privileged --net=host -e DISPLAY=unix$DISPLAY -v /tmp/.X11-unix:/tmp/.X11-unix --name RAG-app gradio-app:latest
The gradio-app is integrated with gradio which give the link to run the QA chatbot.
- To improve the retrieval process, we could have extracted metadata from chunks of text, and used it for keyword retrieval like BM25 along with dense vector embeddings.
- To improve the inference time, we could have used ctransformer or vLLM (paged attention) for the LLM model.
- Experiment with many new models like "openchat 3.5" etc for better answers.
- Due to time constraint not able to add API's.