[feature request] infrastructure anomaly auto detection and avoid to schedule pods on abnormal nodes.
Opened this issue · 0 comments
SimonCqk commented
What would you like to be added:
- collect anomalous pod states and events, discover abnormal nodes progressively
- avoid to schedule pods on abnormal nodes.
Why is this needed:
- discover infrastructure problems proactively and make job runs with more robustly.