Crane: Cloud Resource Analytics and Economics

Crane: Cloud Resource Analytics and Economics
- Architecture
- Features

Architecture

Crane (FinOps Crane) is an opensource project which manages cloud resource on Kubernetes stack, it is inspired by FinOps concepts. Goal of Crane is to provide an one-stop shop project to help Kubernetes users to save cloud resource usage with a rich set of functionalities:

Resource Metrics Prediction based on monitoring data
Cost visibility including:
- Cost allocation, cost and usage virtualization
- Waste identification
- Idle resource collection and reallocation
Usage & Cost Optimization including:
- Enhanced scheduling which optimized for better resource utilization
- Intelligent Scaling based on prediction result
- Cost Optimization based on better billing rate
QoS Ensurance based on Pod PriorityClass

Features

Time Series Prediction

Knowing the future makes things easier for us.

Many businesses are naturally cyclical in time series, especially for those that directly or indirectly serve "people". This periodicity is determined by the regularity of people’s daily activities. For example, people are accustomed to ordering take-out at noon and in the evenings; there are always traffic peaks in the morning and evening; even for services that don't have such obvious patterns, such as searching, the amount of requests at night is much lower than that during business hours. For applications related to this kind of business, it is a natural idea to infer the next day's metrics from the history data of the past few days, or to infer the coming Monday's access traffic from the data of last Monday. With predicted metrics or traffic patterns in the next 24 hours, we can better manage our application instances, stabilize our system, and meanwhile, reduce the cost.

Crane predictor fetches historical metric data for the monitoring system, such as Prometheus, and identifies the time series that are predictable, for example, system cpu load, memory footprint, application's user traffic, etc. Then it outputs the prediction results, which can be consumed by other crane components, like Effective HorizontalPodAutoscaler and Analysis as a Service. It's also straightforward to apply the prediction results in user's applications.

Please see this document to learn more.

Effective HorizontalPodAutoscaler

EffectiveHorizontalPodAutoscaler helps you manage application scaling in an easy way. It is compatible with native HorizontalPodAutoscaler but extends more features. EffectiveHorizontalPodAutoscaler supports prediction-driven autoscaling that supported by TimeSeriesPrediction. User can forecast the incoming peak flow and scale up their application ahead, also user can know when the peak flow will end and scale down their application gracefully. Besides, EffectiveHorizontalPodAutoscaler also defines several scale strategies to support different scaling scenarios.

Reliability: Guarantee both scalability and availability
Responsiveness: Scale up fast enough to successfully handle the increase in workload
Observability: Support Preview mod and automatic observe replicas

Please see this document to learn more.

Analysis as a Services

Analysis Service give you recommendations about cost optimize. It scans your cluster resources such as Deployment, StatefulSet and provide variety strategies to analysis the resource then recommend how to optimize it. Analysis and Recommendation are CustomResourceDefinition that can integration with your own systems.

Here we provide two Analysis Services:

ResourceRecommend: Recommend container request & limit resources based on historic metrics.
Effective HPARecommend: Recommend which workloads are suitable for autoscaling and provide optimized configurations such as minReplicas, maxReplicas.