Your private chat UI for OpenaAI/Azure APIs. Deploy anywhere, fill in API Key etc. and get your own ChatGPT.
- Token counter: get number of tokens in the dialog and in the prompt being typed
- Can be seen at the top right corner, counts dialog length + the number of tokens in the promnpt being typed
- Minimal, responsive UI: adapts to small and large screens, can be added to home-screen on mobile
- Azure and OpenAI: Connect to OpenAI APIs either directly (via API key) or via models deployed in Microsoft Azure
- Use it as chat UI for local models that support OpenAI chat API, you can define
base_url
when adding OpenAI models - Can be deployed as a single container: no dependencies on Postgress, all data is stored in local files and encrypted
docker pull ghcr.io/maxim-saplin/py_chat_ui:latest
- Response srtreaming: see how OpneAI is typing back
- Generation cancelation is supported, LLM replies midway and change the prompt
- Model Customization: Users can add, update, and delete AI models with custom settings in UI OR have a mode defined in ENV variables
- Single and Multi user: Anonymous authentication (single user) with not lock screen OR user authentication against a local user DBin a YAML file (pairs of usernames/password hashes).
- Chat Persistence: Dialogs are saved on the server in flies and can be access acros different devices.
- Data encryption: All user data (chat dialogs and model settings) is encrypted using hashed keys (stored in user's server-only/http-only cookies)
- Loosing user creds means loosing stored user data. No pasword change is supported!
- Protection against leaking sotred files. If the attacker can debug the server OR can steal cookies the encryption keys can be exposed
- Environment Configuration: Easy setup with environment variables or
.env
file. - Run and Debug localy: Run the application locally or via Docker, or debug with VSCode
You can configure the app either via setting environment variables or puttin values into .env
file (see .env.template
for an example).
Environment Variable | Description | Default Value |
---|---|---|
OPENAI_API_KEY |
The API key for OpenAI or Azure | None |
API_TYPE |
The type of API to use (OpenAI, Azure, Fake, Empty) | Fake |
API_VERSION |
The version of the API | Fake |
OPENAI_API_BASE |
The base URL for the OpenAI client (e.g. Azure deployment URL or custom model URL) | Fake |
ALIAS |
The alias for the model | Fake auto-reply model (demonstration) |
MODEL |
The name of the model | Fake |
TEMPERATURE |
The temperature setting for the AI model | 0.7 |
DATA_DIR |
The directory where data will be stored | ".data" |
DISABLE_AUTH |
Whether to disable user authentication | "False" |
DISBLE_USER_REG |
Whether to disable user registration | "False" |
Remarks, notes:
- If
API_TYPE
is set toFake
all message will get a static reply, no call to API will be made. For demonstration purposes. IfEmpty
is provided than no model is expected in the environment variables - Providing Azure/OpenAI model credentials (API key, deployment URL, etc.) via environment variables is not the only option. Users can also add their credentials in the Web UI, those creds will be encrypted and stored in
models.dat
file DATA_DIR
- that's where all conversations and model settings will be saved. Each user will have a$username
subfolder.DISABLE_AUTH
- if set to [True], no login page will be shown, and all data will be stored under$DATA_DIR/default_user
.
users.yaml
stored under $DATA_DIR
keeps usernames and hashes that are used to authenticate and allow users in. Make changes to this file in the prod to make adjustments to who and how can access the app.
When $DISBLE_USER_REG
is set to True
the Login page will have Register a new user
link available. Clicking the link will open a sign-up screen where everyone will be able to enter usernamae and password and create a new user record at users.yaml
allowing to make the login with the provided creds. Password is hashed, local data encryption key is created from login/password pair and is stored in HTTP-only cookie. No password reset is possible, lost pasword - lost access and data.
- Generate Credentials:
- Run generate_pass.py.
python generate_pass.py --username <username> --password <password>
- Input a username and password when prompted.
- Copy the output (username and hashed password).
- Edit users.yaml:
- Open users.yaml.
- Add a new entry with the generated username and hashed password.
- Save and Test:
- Save users.yaml.
- Restart the application.
- Log in with the new credentials to test.
- Set OpenAI API credentials as environment variables or within the
.env
file (refer to.env.template
for an example). - Install requirements with
pip3 install -r requirements.txt
. - Run the application using
streamlit run chat.py
. - Alternatively, debug in VSCode by pressing F5 and selecting the "Debug" configuration.
Start Docker Desktop and run command in terminal, after teh cintainer is started you can access the app on localhost:8501
docker build -t py-chat-ui .
docker run -p 8501:8501 py-chat-ui
You can get the public image of the UI, adjust environment varaibles to have the correct base URL (and model name and API key) and get the UI up-and-running and pointing to the self-deployed model
docker pull ghcr.io/maxim-saplin/py_chat_ui:latest
docker run -p 8501:8501 -e DISABLE_AUTH=True -e API_TYPE=OpenAI -e OPENAI_API_BASE=https://mixtral-api.test.com/v1 -e MODEL=mistralai/Mixtral-8x7B-Instruct-v0.1 -e OPENAI_API_KEY=EMPTY ghcr.io/maxim-saplin/py_chat_ui:latest
Open localhost:8501
in browser when the container is started.
- 0.2.19 - upgraded Streamlit, less intrusive tokeniser (larger delays)
- 0.2.18 - new chat CMD+Enter submits prompt, fixed recalculating message tokens when navigating between dialogs
- 0.2.12/17 - chat dialog refactored (hopefuly more stable with slow network/back-ends)
- 0.2.11 - added FavIcon
- 0.2.10 - fixing inconsistent cancelation of generation
- 0.2.9 - updated dockerfile to use Python 3.12, more swift tokenizer (shorter debounce timer when typing), refactorer chat dialog logic (showing ... as a placeholder when OpenAI request is sent), 1 more autotest
- 0.2.8 - on-the-fly tokenizer form in dialog is fixed (was visible due to a bug)
- 0.2.7 - refactored tokenizer in dialog and new chat, better stratup time (module lazy load)
- 0.2.6 - env var model no longer saved to models file
- 0.2.5 - better handling of empty env vars, building/publishing container images for AMD64 and ARM64 architectures
- 0.2.4 - allow defining
base_url
for OpenAI models (e.g. custom model deployments), better error messages for chat API exceptions, New Chat as default starup screen - 0.2.3 - token counter in New Chat, faster token counter in dialog, longer debounce interval for prompt token counter, UI tweaks
- 0.2.2 - preselecting last model used in New Chat, ability to cancel very first request question after New Chat started
- 0.2.1 - fix for model persistance
- 0.2.0 - lazy loading of chat sessions from files, refactored chat session UI, UI tweaks (removing empty space), OpenAI and Streamlit packages updated