This repository provides very basic flask, Streamlit, and docker examples for the llama_index (FKA gpt_index) package.
If you need to quickly create a POC to impress your boss, start here!
If you are having trouble with dependencies, I dump my entire env into requirements_full.txt
, but otherwise, use the base requirements.txt
.
The basic demo includes the classic "Paul Graham Essay" from the original llama_index repo. Some good starting questions are
- What did the author do growing up?
- Tell me more about interleaf
conda create --name llama_index python=3.11
pip install -r requirements.txt
There are two main example folders
-
flask (runs on localhost:5601/2)
sh launch_app.py
- creates a simple api that loads the text from the documents folder
- the "/query" endpoint accepts requests that contain a "text" parameter, which is used to query the index
- the "/upload" endpoint is a POST endpoint that inserts an attached text file into the index
- the index is managed by a seperate server using locks, since inserting a document is a mutable operation and flask is multithreaded
- I strongly recommend using a tool like Postman to test the api - there are example screenshots using postman in the
postman_examples
folder
-
streamlit (runs on localhost:8501)
streamlit run streamlit_demo.py
- creates a simple UI using streamlit
- loads text from the documents folder (using
st.cache_resource
, so it only loads once) - provides an input text-box and a button to run the query
- the string response is displayed after it finishes
- want to see this example in action? Check it out here
Each example contains a Dockerfile
. You can run docker build -t my_tag_name .
to build a python3.11-slim docker image inside your desired folder. It ends up being about 600MB.
Inside the Dockerfile
, certain ports are exposed based on which ports the examples need.
When running the image, be sure to include the -p option to access the proper ports (8501, or 5601).
I welcome any suggestions or PRs, or more examples!