Autoscaling is a feature in which the cluster is capable of increasing or decreasing the number of pods as the demand for service response need it.
Module
The Horizontal Pod Autoscaler automatically scales the number of pods in a replication controller, deployment or replica set based on observed CPU utilization.
Overview
At the end of this module, you will :
Learn the format of a YAML Autoscale file
Learn how to manage a Autoscale
Learn the composition of a Autoscale
Prerequisites
Create the directory data/autoscaling in your home folder to manage the YAML file needed in this module.
This module needs the metrics-server to be deployed on the cluster to get the monitoring values like CPU and memory. Ensure that the module is up and running before continuing.
Create
Looks up a Deployment, ReplicaSet, or ReplicationController by name and creates an autoscaler that uses the given resource as a reference. An autoscaler can automatically increase or decrease number of pods deployed within the system as needed.
Horizontal Pod Autoscaler automatically scales the number of pods in a deployment or replica set based on observed CPU, Memory or Custom Metrics utilization depending the API version used.
The Kubernetes basic autoscaling architecture can be schematized like this :
The create command can directly ask the API resource to create an HorizontalPodAutoscaler in command line or create an HorizontalPodAutoscaler object based on a yaml file definition.
Deploy metrics-server on Digital Ocean cluster
By default, metrics-server, the tool that will pull the metrics, is not installed on Digital Ocean cluster, to enable autoscaling, we need to install it.
---apiVersion:rbac.authorization.k8s.io/v1kind:ClusterRolemetadata:name:system:aggregated-metrics-readerlabels:rbac.authorization.k8s.io/aggregate-to-view:"true"rbac.authorization.k8s.io/aggregate-to-edit:"true"rbac.authorization.k8s.io/aggregate-to-admin:"true"rules:- apiGroups: ["metrics.k8s.io"]resources: ["pods","nodes"]verbs: ["get","list","watch"]---apiVersion:rbac.authorization.k8s.io/v1kind:ClusterRoleBindingmetadata:name:metrics-server:system:auth-delegatorroleRef:apiGroup:rbac.authorization.k8s.iokind:ClusterRolename:system:auth-delegatorsubjects:- kind:ServiceAccountname:metrics-servernamespace:kube-system---apiVersion:rbac.authorization.k8s.io/v1kind:RoleBindingmetadata:name:metrics-server-auth-readernamespace:kube-systemroleRef:apiGroup:rbac.authorization.k8s.iokind:Rolename:extension-apiserver-authentication-readersubjects:- kind:ServiceAccountname:metrics-servernamespace:kube-system---apiVersion:apiregistration.k8s.io/v1beta1kind:APIServicemetadata:name:v1beta1.metrics.k8s.iospec:service:name:metrics-servernamespace:kube-systemgroup:metrics.k8s.ioversion:v1beta1insecureSkipTLSVerify:truegroupPriorityMinimum:100versionPriority:100---apiVersion:v1kind:ServiceAccountmetadata:name:metrics-servernamespace:kube-system---apiVersion:apps/v1kind:Deploymentmetadata:name:metrics-servernamespace:kube-systemlabels:k8s-app:metrics-serverspec:selector:matchLabels:k8s-app:metrics-servertemplate:metadata:name:metrics-serverlabels:k8s-app:metrics-serverspec:serviceAccountName:metrics-servervolumes:# mount in tmp so we can safely use from-scratch images and/or read-only containers - name:tmp-diremptyDir: {}containers: - name:metrics-serverimage:k8s.gcr.io/metrics-server-amd64:v0.3.6args: - --cert-dir=/tmp - --secure-port=4443 - --kubelet-insecure-tls - --kubelet-preferred-address-types=InternalIP,ExternalIP,Hostnameports: - name:main-portcontainerPort:4443protocol:TCPsecurityContext:readOnlyRootFilesystem:truerunAsNonRoot:truerunAsUser:1000imagePullPolicy:AlwaysvolumeMounts: - name:tmp-dirmountPath:/tmpnodeSelector:beta.kubernetes.io/os:linux---apiVersion:v1kind:Servicemetadata:name:metrics-servernamespace:kube-systemlabels:kubernetes.io/name:"Metrics-server"kubernetes.io/cluster-service:"true"spec:selector:k8s-app:metrics-serverports: - port:443protocol:TCPtargetPort:main-port---apiVersion:rbac.authorization.k8s.io/v1kind:ClusterRolemetadata:name:system:metrics-serverrules:- apiGroups: - ""resources: - pods - nodes - nodes/stats - namespaces - configmapsverbs: - get - list - watch---apiVersion:rbac.authorization.k8s.io/v1kind:ClusterRoleBindingmetadata:name:system:metrics-serverroleRef:apiGroup:rbac.authorization.k8s.iokind:ClusterRolename:system:metrics-serversubjects:- kind:ServiceAccountname:metrics-servernamespace:kube-system
kubectlapply-fdata/autoscaling/01_php-apache.yaml# version kubectl <1.17kubectlrunphp-apache--image=k8s.gcr.io/hpa-example--requests=cpu=200m--limits=cpu=300m--expose--port=80# Create an Horizontal Pod Autoscaler based on the CPU usagekubectlautoscaledeploymentphp-apache--cpu-percent=50--min=3--max=10
Exercise n°2
Run a sample nginx application exposing port 8080
Create an Horizontal Pod Autoscaler to automatically scale the Deployment if the memory is above 80%.
kubectlapply-fdata/autoscaling/02_deployment.yaml# version kubectl <1.17kubectlrunnginx--image=nginx--requests=memory=500m--limits=memory=1G--expose--port=8080
The get command list the object asked. It could be a single object or a list of multiple objects comma separated. This command is useful to get the status of each object. The output can be formatted to only display some information based on some json search or external tools like tr, sort, uniq.
The default output display some useful information about each services :
Name : the name for the newly created object
Reference : the object managed by the autoscaler, like Pod name, a Deployment name ...
Targets : the metrics defined to autoscale the referenced resource
Minpods : the lower limit for the number of pods that can be set by the autoscaler
Maxpods : the upper limit for the number of pods that can be set by the autoscaler
Replicas : the current replicas number
Age : the age of the object from his creation
Exercise n°1
Get the current HorizontalPodAutoscaler resources in the default namespace.
Stress the Pod created in the previous section and check the HorizontalPoMindAutoscaler associated.
# Connect to the Podkubectlrun-itload-generator--image=busybox/bin/sh# Run a loop bash command in the container to stress the CPUwhiletrue; dowget-q-O-http://php-apache.default.svc.cluster.local; done# Check the Horizontal Pod Autoscaler statuskubectlgethpa
Once an object is running, it is inevitably a need to debug problems or check the configuration deployed.
The describe command display a lot of configuration information about the Horizontal Pod Autoscaler (labels, annotations, etc.) and the scale policy (selector, type, number of pods, ...).
This command is really useful to introspect and debug an object deployed in a cluster.
Exercise n°1
Describe one of the existing Autoscaler in the default namespace.
kubectldescribehorizontalpodautoscalerphp-apache
Name:php-apacheNamespace:defaultLabels:<none>Annotations:<none>CreationTimestamp:Wed,06Feb201910:39:55-0500Reference:Deployment/php-apacheMetrics: ( current/target )resourcecpuonpods (as apercentageofrequest): 0% (0) / 50%Minreplicas:3Maxreplicas:10Conditions:TypeStatusReasonMessage----------------------- AbleToScale True ScaleDownStabilized recent recommendations were higher than current one, applying the highest recent recommendation
ScalingActive True ValidMetricFound the HPA was able to successfully calculate a replica count from cpu resource utilization (percentage of request)
ScalingLimitedFalseDesiredWithinRangethedesiredcountiswithintheacceptablerangeEvents:TypeReasonAgeFromMessage------------------------- Normal SuccessfulRescale 36m horizontal-pod-autoscaler New size: 3; reason: Current number of replicas below Spec.MinReplicas
Normal SuccessfulRescale 31m horizontal-pod-autoscaler New size: 4; reason: cpu resource utilization (percentage of request) above target
Normal SuccessfulRescale 28m horizontal-pod-autoscaler New size: 5; reason: cpu resource utilization (percentage of request) above target)
Normal SuccessfulRescale 2m horizontal-pod-autoscaler New size: 4; reason: All metrics below target
Explain
Kubernetes come with a lot of documentation about his objects and the available options in each one. Those information can be fin easily in command line or in the official Kubernetes documentation.
The explain command allows to directly ask the API resource via the command line tools to display information about each Kubernetes objects and their architecture.
Exercise n°1
Get the documentation of a specific field of a resource.
kubectlexplainhpa.spec
KIND:HorizontalPodAutoscalerVERSION:autoscaling/v1RESOURCE:spec<Object>DESCRIPTION:behaviourofautoscaler.Moreinfo:https://git.k8s.io/community/contributors/devel/api-conventions.md#spec-and-status.specificationofahorizontalpodautoscaler.FIELDS:maxReplicas<integer>-required-upperlimitforthenumberofpodsthatcanbesetbytheautoscaler;cannotbesmallerthanMinReplicas.minReplicas<integer>lowerlimitforthenumberofpodsthatcanbesetbytheautoscaler,default1.scaleTargetRef<Object>-required-referencetoscaledresource; horizontalpodautoscalerwilllearnthecurrentresourceconsumptionandwillsetthedesirednumberofpodsbyusingitsScalesubresource.targetCPUUtilizationPercentage<integer>targetaverageCPUutilization (represented asapercentageofrequestedCPU) over all the pods; ifnotspecifiedthedefaultautoscalingpolicywillbeused.
Add the --recursive flag to display all of the fields at once without descriptions.
Delete
The delete command delete resources by filenames, stdin, resources and names, or by resources and label selector.
Be careful on the deletion of an autoscaling object, this can have effects in the availability of the services associated.
Note that the delete command does NOT do resource version checks, so if someone submits an update to a resource right when you submit a delete, their update will be lost along with the rest of the resource.
Exercise n°1
Delete the previous autoscaling group created in command line.
# Delete the HorizontalPodAutoscalerkubectldeletehpaphp-apache# Delete the Podskubectldeletedeploymentphp-apacheload-generator# Delete the Serviceskubectldeleteservicephp-apache
Module exercise
The purpose of this section is to manage each steps of the lifecycle of an application to better understand each concepts of the Kubernetes course.
The main objective in this module is to understand how to dynamically and automatically manage the number of Pods needed to handle the workload.
For more information about the application used all along the course, please refer to the Exercise App > Voting App link in the left panel.
Based on the principles explain in this module, try by your own to handle this steps. The development of a yaml file is recommended.
The file developed has to be stored in this directory : ~/data/votingapp/10_autoscaling
Manage the HorizontalPodAutoscaler of the worker Pods to :
Ensure that the worker has minimum one Pods
Ensure that the worker has maximum five Pods
Ensure that the Pods is autoscaled when the CPU is above 80%.
Create the HorizontalPodAutoscaler to manage the worker workload.