Here is a jump start solution of how to build a basic chatbot API that:
- Leverages GKE or Cloud Run with Cloud SQL, VertexAI, and pgvector
- Demonstrates connectivity to Cloud SQL using Private Service Connect in a VPC
- Codifies all infrastructure including using Terraform
- Uses Python with asyncpg and FastAPI
- (optional) Supports cross-project setups with Cloud SQL and GKE or Cloud Run in separate projects
This solution is an operationalized version of a previously published colab, Building AI-powered data-driven applications using pgvector, LangChain and LLMs.
This app can be deployed with two different flavors:
- Cloud Run
- Google Kubernetes Engine (GKE)
Cloud Run (README)
To deploy the app using Cloud Run head to the run folder.
GKE (README)
To deploy the app using GKE head to the gke folder.