Investigate CE deployment daily run failures
vatsrahul1001 opened this issue · 1 comments
vatsrahul1001 commented
More context
vatsrahul1001 commented
As tasks were getting killed with return code -9, I investigated the resources of our Astro-SDK deployment and found that it was using an A5 machine (1 CPU, 2GB). Since we were running many DAGs at the same time, tasks were receiving SIGKILL. Additionally, all of our other deployments use A10 machines (2 CPUs, 4GB). I have updated our Astro-SDK deployment to use an A10 machine, and I no longer see that issue.