Document proposed architecture
rviscomi opened this issue · 5 comments
Describe how we'd serve data from the API endpoint, extracting from BigQuery and updating at the end of each monthly crawl
httparchive.core_web_vitals.technologies is the source of truth and there will be one script for each of the API endpoints that query the data from the table
Hook into the automated pipeline to update API data on Firestore on a monthly basis: https://github.com/HTTPArchive/data-pipeline/blob/main/data-pipeline.workflows.yaml
Changing the API code itself will require a manual deployment script
API security: as needed implement CORS and/or API keys
Firestore
- global instance?
- lives in the project, serverless service
- not like a VM instance
- data stored in us-east
- possible to put a CDN in front of API gateway
@rviscomi @tunetheweb I changed the region for the project to us-east4, which is the default for the project. Now Firestore database, API gateway, and cloud functions are all in the same region. It should work with the performance.
And looking for information about CDN in front of the API, I found this documentation https://cloud.google.com/api-gateway/docs/multi-region-deployment
I tried to test it but I don't have permission to create all necessary resources.