This is a Python application designed to be deployed as a Docker container on RunPod.io.
app.py
- Flask app that handles HTTP requestsrunpod_handler.py
- Handler for RunPod serverless functionsDockerfile
- Container definitionrequirements.txt
- Python dependenciesenvironment.yml
- Conda environment configuration
-
Install dependencies:
pip install -r requirements.txt
-
Test the RunPod handler:
python runpod_handler.py
-
Create and activate the conda environment:
conda env create -f environment.yml conda activate runpod-app
-
Test the RunPod handler:
python runpod_handler.py
# Build Docker image
docker build -t sschat/llm-frontend:main .
# Run container locally
docker run -p 8000:8000 sschat/llm-frontend:main
- Push your Docker image to a container registry (Docker Hub, GitHub Container Registry, etc.)
- Create a new pod on RunPod.io using your container image
- Configure the pod with appropriate resources for your workload
GET /healthcheck
- Health check endpointPOST /api
- Main API endpoint for processing requests