Request-based autoscaling in Kubernetes: scaling to zero and back
TL;DR: In this session, you will learn how to monitor the HTTP requests to your apps in Kubernetes and how to define autoscaling rules to increase and decrease replicas for your workloads.
Reducing infrastructure costs boils down to turning apps off when you don’t use them. However, the challenge is figuring out how to turn them on automatically when you need them. Take the example of a Kubernetes dev cluster: you might want not to run any service during the weekend. In this webinar, Salman will demo live how you can use KEDA and the HTTP scaler to intercept and monitor the requests to your services and scale your pods accordingly. In the process, you will discover how you can scale to zero and create pods only when you need them.