Working with the Cluster Autoscaler as a Standalone Program

Find out how to install, configure, and use the Kubernetes Cluster Autoscaler as a standalone program to automatically resize the managed node pools in a cluster you've created using Kubernetes Engine (OKE).

Using the Kubernetes Cluster Autoscaler as a standalone program rather than as a cluster add-on gives you complete control and responsibility for configuration and ongoing maintenance, including:

Installing a version of the Kubernetes Cluster Autoscaler that is compatible with the version of Kubernetes running on the cluster.
Specifying configuration arguments correctly.
Manually upgrading the Kubernetes Cluster Autoscaler when you upgrade a cluster to a new version of Kubernetes, to ensure the Kubernetes Cluster Autoscaler is compatible with the cluster's new Kubernetes version.

The instructions below describe how to run the Kubernetes Cluster Autoscaler as a standalone program to manage node pools:

Step 1: Setting Up an Instance Principal or Workload Identity Principal to Enable Cluster Autoscaler Access to Node Pools

To manage node pools, the Kubernetes Cluster Autoscaler performs actions on other Oracle Cloud Infrastructure service resources. To perform those actions on OCI service resources, the Kubernetes Cluster Autoscaler uses the credentials of an authorized actor (or principal). You can currently set up the following types of principal to enable the Kubernetes Cluster Autoscaler to perform actions on OCI service resources:

Instance principal: The Kubernetes Cluster Autoscaler uses the identity of the instance on which it is running.
Workload identity principal: The Kubernetes Cluster Autoscaler uses the identity of a workload resource running on a Kubernetes cluster.

Note the use of workload identity principals to enable the Kubernetes Cluster Autoscaler to access OCI services and resources:

is supported with enhanced clusters, but not with basic clusters.
is only supported with Cluster Autoscaler version 1.26 (or later)

Using instance principals to enable access to node pools

You can set up an instance principal to enable the Kubernetes Cluster Autoscaler to perform actions on OCI service resources.

To set up an instance principal:

Log in to the Console.
Create a new compartment-level dynamic group in the compartment to which the cluster belongs, containing the worker nodes (compute instances) in the cluster:
1. Follow the instructions in To create a dynamic group in the IAM documentation, and give the new dynamic group a name (for example, acme-oke-cluster-autoscaler-dyn-grp).
2. Enter a rule that includes the worker nodes in the compartment, in the format:
```
ALL {instance.compartment.id = '<compartment-ocid>'}
```
  where <compartment-ocid> is the OCID of the compartment to which the cluster belongs.
  
  For example:
```
ALL {instance.compartment.id = 'ocid1.compartment.oc1..aaaaaaaa23______smwa'}
```

Create a policy to allow worker nodes to manage node pools:

Follow the instructions in To create a policy in the IAM documentation, and give the new policy a name (for example, acme-oke-cluster-autoscaler-dyn-grp-policy).

Enter a policy statement to allow worker nodes to manage node pools (along with other policy statements related to initializing worker nodes), in the format:

Allow dynamic-group <dynamic-group-name> to manage cluster-node-pools in compartment <compartment-name>
Allow dynamic-group <dynamic-group-name> to manage instance-family in compartment <compartment-name>
Allow dynamic-group <dynamic-group-name> to use subnets in compartment <compartment-name>
Allow dynamic-group <dynamic-group-name> to read virtual-network-family in compartment <compartment-name>
Allow dynamic-group <dynamic-group-name> to use vnics in compartment <compartment-name>
Allow dynamic-group <dynamic-group-name> to inspect compartments in compartment <compartment-name>

where:

<dynamic-group-name> is the name of the dynamic group you created earlier. For example, acme-oke-cluster-autoscaler-dyn-grp. Note that if a dynamic group is not in the default identity domain, prefix the dynamic group name with the identity domain name, in the format dynamic-group '<identity-domain-name>'/'<dynamic-group-name>'. You can also specify the dynamic group using its OCID, in the format dynamic-group id <dynamic-group-ocid>.
<compartment-name> is the name of the compartment to which the cluster belongs. For example, acme-oke-cluster-autoscaler-compartment

For example:

Allow dynamic-group acme-oke-cluster-autoscaler-dyn-grp to manage cluster-node-pools in compartment acme-oke-cluster-autoscaler-compartment
Allow dynamic-group acme-oke-cluster-autoscaler-dyn-grp to manage instance-family in compartment acme-oke-cluster-autoscaler-compartment
Allow dynamic-group acme-oke-cluster-autoscaler-dyn-grp to use subnets in compartment acme-oke-cluster-autoscaler-compartment
Allow dynamic-group acme-oke-cluster-autoscaler-dyn-grp to read virtual-network-family in compartment acme-oke-cluster-autoscaler-compartment
Allow dynamic-group acme-oke-cluster-autoscaler-dyn-grp to use vnics in compartment acme-oke-cluster-autoscaler-compartment
Allow dynamic-group acme-oke-cluster-autoscaler-dyn-grp to inspect compartments in compartment acme-oke-cluster-autoscaler-compartment

Note

If a node pool belongs to one compartment, and the network resources used by the node pool belong to a different compartment, you have to create policies in both compartments as follows:

In the node pool's compartment, create a policy with policy statements in the following format:

Allow dynamic-group acme-oke-cluster-autoscaler-dyn-grp to manage cluster-node-pools in compartment <nodepool-compartment-name>
Allow dynamic-group acme-oke-cluster-autoscaler-dyn-grp to manage instance-family in compartment <nodepool-compartment-name>
Allow dynamic-group acme-oke-cluster-autoscaler-dyn-grp to use subnets in compartment <nodepool-compartment-name>
Allow dynamic-group acme-oke-cluster-autoscaler-dyn-grp to use vnics in compartment <nodepool-compartment-name>
Allow dynamic-group acme-oke-cluster-autoscaler-dyn-grp to inspect compartments in compartment <nodepool-compartment-name>

In the network resources' compartment, create a policy with policy statements in the following format:

Allow dynamic-group acme-oke-cluster-autoscaler-dyn-grp to use subnets in compartment <network-compartment-name>
Allow dynamic-group acme-oke-cluster-autoscaler-dyn-grp to read virtual-network-family in compartment <network-compartment-name>
Allow dynamic-group acme-oke-cluster-autoscaler-dyn-grp to use vnics in compartment <network-compartment-name>
Allow dynamic-group acme-oke-cluster-autoscaler-dyn-grp to inspect compartments in compartment <network-compartment-name>

Using workload identity principals to enable access to node pools

You can set up a workload identity principal to enable the Kubernetes Cluster Autoscaler to perform actions on OCI service resources. Note that you can only use workload identity principals with enhanced clusters.

To set up a workload identity principal:

Obtain the OCID of the cluster (for example, using the Cluster details tab in the Console).
Follow the instructions in Creating a Policy in the IAM documentation, and give the new policy a name (for example, acme-oke-cluster-autoscaler-policy).

Enter policy statements to allow node pool management, in the format:

Allow any-user to manage cluster-node-pools in compartment <compartment-name> where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = '<cluster-ocid>'}
Allow any-user to manage instance-family in compartment <compartment-name> where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = '<cluster-ocid>'}
Allow any-user to use subnets in compartment <compartment-name> where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = '<cluster-ocid>'}
Allow any-user to read virtual-network-family in compartment <compartment-name> where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = '<cluster-ocid>'}
Allow any-user to use vnics in compartment <compartment-name> where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = '<cluster-ocid>'}
Allow any-user to inspect compartments in compartment <compartment-name> where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = '<cluster-ocid>'}

where:

<compartment-name> is the name of the compartment to which the cluster belongs. For example, acme-oke-cluster-autoscaler-compartment
<cluster-ocid> is the cluster's OCID that you obtained previously.

For example:

Allow any-user to manage cluster-node-pools in compartment acme-oke-cluster-autoscaler-compartment where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = 'ocid1.cluster.oc1.iad.aaaaaaaa______ska'}
Allow any-user to manage instance-family in compartment acme-oke-cluster-autoscaler-compartment where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = 'ocid1.cluster.oc1.iad.aaaaaaaa______ska'}
Allow any-user to use subnets in compartment acme-oke-cluster-autoscaler-compartment where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = 'ocid1.cluster.oc1.iad.aaaaaaaa______ska'}
Allow any-user to read virtual-network-family in compartment acme-oke-cluster-autoscaler-compartment where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = 'ocid1.cluster.oc1.iad.aaaaaaaa______ska'}
Allow any-user to use vnics in compartment acme-oke-cluster-autoscaler-compartment where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = 'ocid1.cluster.oc1.iad.aaaaaaaa______ska'}
Allow any-user to inspect compartments in compartment acme-oke-cluster-autoscaler-compartment where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = 'ocid1.cluster.oc1.iad.aaaaaaaa______ska'}

Note

If a node pool belongs to one compartment, and the network resources used by the node pool belong to a different compartment, you have to create policies in both compartments as follows:

In the node pool's compartment, create a policy with policy statements in the following format:

Allow any-user to manage cluster-node-pools in compartment <nodepool-compartment-name> where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = '<cluster-ocid>'}
Allow any-user to manage instance-family in compartment <nodepool-compartment-name> where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = '<cluster-ocid>'}
Allow any-user to use subnets in compartment <nodepool-compartment-name> where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = '<cluster-ocid>'}
Allow any-user to use vnics in compartment <nodepool-compartment-name> where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = '<cluster-ocid>'}
Allow any-user to inspect compartments in compartment <nodepool-compartment-name> where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = '<cluster-ocid>'}

In the network resources' compartment, create a policy with policy statements in the following format:

Allow any-user to use subnets in compartment <network-compartment-name> where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = '<cluster-ocid>'}
Allow any-user to read virtual-network-family in compartment <network-compartment-name> where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = '<cluster-ocid>'}
Allow any-user to use vnics in compartment <network-compartment-name> where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = '<cluster-ocid>'}
Allow any-user to inspect compartments in compartment <network-compartment-name> where ALL {request.principal.type='workload', request.principal.namespace ='kube-system', request.principal.service_account = 'cluster-autoscaler', request.principal.cluster_id = '<cluster-ocid>'}

Step 2: Copy and customize the Cluster Autoscaler configuration file

Step 2a: Copy the configuration file

In a text editor, create a file called cluster-autoscaler.yaml with the following content:

---
apiVersion: v1
kind: ServiceAccount
metadata:
  labels:
    k8s-addon: cluster-autoscaler.addons.k8s.io
    k8s-app: cluster-autoscaler
  name: cluster-autoscaler
  namespace: kube-system
---
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRole
metadata:
  name: cluster-autoscaler
  labels:
    k8s-addon: cluster-autoscaler.addons.k8s.io
    k8s-app: cluster-autoscaler
rules:
  - apiGroups: [""]
    resources: ["events", "endpoints"]
    verbs: ["create", "patch"]
  - apiGroups: [""]
    resources: ["pods/eviction"]
    verbs: ["create"]
  - apiGroups: [""]
    resources: ["pods/status"]
    verbs: ["update"]
  - apiGroups: [""]
    resources: ["endpoints"]
    resourceNames: ["cluster-autoscaler"]
    verbs: ["get", "update"]
  - apiGroups: [""]
    resources: ["nodes"]
    verbs: ["watch", "list", "get", "patch", "update"]
  - apiGroups: [""]
    resources:
      - "pods"
      - "services"
      - "replicationcontrollers"
      - "persistentvolumeclaims"
      - "persistentvolumes"
    verbs: ["watch", "list", "get"]
  - apiGroups: ["extensions"]
    resources: ["replicasets", "daemonsets"]
    verbs: ["watch", "list", "get"]
  - apiGroups: ["policy"]
    resources: ["poddisruptionbudgets"]
    verbs: ["watch", "list"]
  - apiGroups: ["apps"]
    resources: ["statefulsets", "replicasets", "daemonsets"]
    verbs: ["watch", "list", "get"]
  - apiGroups: ["storage.k8s.io"]
    resources: ["storageclasses", "csinodes", "volumeattachments"]
    verbs: ["watch", "list", "get"]
  - apiGroups: ["batch", "extensions"]
    resources: ["jobs"]
    verbs: ["get", "list", "watch", "patch"]
  - apiGroups: ["coordination.k8s.io"]
    resources: ["leases"]
    verbs: ["create"]
  - apiGroups: ["coordination.k8s.io"]
    resourceNames: ["cluster-autoscaler"]
    resources: ["leases"]
    verbs: ["get", "update"]
  - apiGroups: [""]
    resources: ["namespaces"]
    verbs: ["watch", "list"]
  - apiGroups: ["storage.k8s.io"]
    resources: ["csidrivers", "csistoragecapacities"]
    verbs: ["watch", "list"]
---
apiVersion: rbac.authorization.k8s.io/v1
kind: Role
metadata:
  name: cluster-autoscaler
  namespace: kube-system
  labels:
    k8s-addon: cluster-autoscaler.addons.k8s.io
    k8s-app: cluster-autoscaler
rules:
  - apiGroups: [""]
    resources: ["configmaps"]
    verbs: ["create","list","watch"]
  - apiGroups: [""]
    resources: ["configmaps"]
    resourceNames: ["cluster-autoscaler-status", "cluster-autoscaler-priority-expander"]
    verbs: ["delete", "get", "update", "watch"]

---
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRoleBinding
metadata:
  name: cluster-autoscaler
  labels:
    k8s-addon: cluster-autoscaler.addons.k8s.io
    k8s-app: cluster-autoscaler
roleRef:
  apiGroup: rbac.authorization.k8s.io
  kind: ClusterRole
  name: cluster-autoscaler
subjects:
  - kind: ServiceAccount
    name: cluster-autoscaler
    namespace: kube-system

---
apiVersion: rbac.authorization.k8s.io/v1
kind: RoleBinding
metadata:
  name: cluster-autoscaler
  namespace: kube-system
  labels:
    k8s-addon: cluster-autoscaler.addons.k8s.io
    k8s-app: cluster-autoscaler
roleRef:
  apiGroup: rbac.authorization.k8s.io
  kind: Role
  name: cluster-autoscaler
subjects:
  - kind: ServiceAccount
    name: cluster-autoscaler
    namespace: kube-system

---
apiVersion: apps/v1
kind: Deployment
metadata:
  name: cluster-autoscaler
  namespace: kube-system
  labels:
    app: cluster-autoscaler
spec:
  replicas: 3
  selector:
    matchLabels:
      app: cluster-autoscaler
  template:
    metadata:
      labels:
        app: cluster-autoscaler
      annotations:
        prometheus.io/scrape: 'true'
        prometheus.io/port: '8085'
    spec:
      serviceAccountName: cluster-autoscaler
      containers:
        - image: iad.ocir.io/oracle/oci-cluster-autoscaler:{{ image tag }}
          name: cluster-autoscaler
          resources:
            limits:
              cpu: 100m
              memory: 300Mi
            requests:
              cpu: 100m
              memory: 300Mi
          command:
            - ./cluster-autoscaler
            - --v=4
            - --stderrthreshold=info
            - --cloud-provider=oci
            - --max-node-provision-time=25m
            - --nodes=1:5:{{ node pool ocid 1 }}
            - --nodes=1:5:{{ node pool ocid 2 }}
            - --scale-down-delay-after-add=10m
            - --scale-down-unneeded-time=10m
            - --unremovable-node-recheck-timeout=5m
            - --balance-similar-node-groups
            - --balancing-ignore-label=displayName
            - --balancing-ignore-label=hostname
            - --balancing-ignore-label=internal_addr
            - --balancing-ignore-label=oci.oraclecloud.com/fault-domain
          imagePullPolicy: "Always"

Save the cluster-autoscaler.yaml file you created.

Step 2b: Specify the node pools to manage

In the cluster-autoscaler.yaml file you created, specify the cluster's node pools that you want the Kubernetes Cluster Autoscaler to manage.

You can specify that you want the Kubernetes Cluster Autoscaler to manage a single node pool, or multiple node pools. The recommendation is to always have at least one node pool that is not managed by the Kubernetes Cluster Autoscaler to run critcal cluster add-ons, and to ensure the Kubernetes Cluster Autoscaler does not scale down the nodes on which it is running. Also note that it is your responsibility to manually scale any node pools you do not specify in the configuration file.

You specify the node pools that you want the Kubernetes Cluster Autoscaler to manage in one of two ways:

You can explicitly specify each node pool to manage, using the --nodes parameter to specify each node pool's OCID.
You can specify that the Kubernetes Cluster Autoscaler is to discover which node pool (or node pools) to manage, using the --node-group-auto-discovery parameter to specify the tags to match. You can specify both defined tags and freeform tags. For more information about adding tags to node pools, see Applying Tags to Node Pools. The Kubernetes Cluster Autoscaler manages node pools with tags that match the tags you specify. Note that the node-group-auto-discovery parameter is supported with Cluster Autoscaler version 1.30.3, version 1.31.1, and version 1.32.0, and later.

Note that you cannot specify both the --nodes parameter and the --node-group-auto-discovery parameter in the same cluster-autoscaler.yaml file. The two parameters are mutually exclusive alternatives.

To use the --nodes parameter to explicitly specify which node pools to manage:

In the cluster-autoscaler.yaml file you created, locate the following template line:
```
- --nodes=1:5:{{ node pool ocid 1 }}
```
The --nodes parameter has the following format:
```
--nodes=<min-nodes>:<max-nodes>:<nodepool-ocid>
```
where:
- <min-nodes> is the minimum number of nodes allowed in the node pool. The Kubernetes Cluster Autoscaler will not reduce the number of nodes below this number.
- <max-nodes> is the maximum number of nodes allowed in the node pool. The Kubernetes Cluster Autoscaler will not increase the number of nodes above this number. Make sure the maximum number of nodes you specify does not exceed the tenancy limits for the worker node shape defined for the node pool.
- <nodepool-ocid> is the OCID of the node pool to manage.
Change the value of the --nodes parameter to specify:
- The minimum number of nodes allowed in the node pool. For example, 1.
- The maximum number of nodes allowed in the node pool. For example, 5.
- The OCID of the node pool you want the Kubernetes Cluster Autoscaler to manage.
For example:
```
--nodes=1:5:ocid1.nodepool.oc1.iad.aaaaaaaaaeydq...
```
If you only want the Kubernetes Cluster Autoscaler to manage one node pool in the cluster, locate the following line in the cluster-autoscaler.yaml file and remove it:
```
- --nodes=1:5:{{ node pool ocid 2 }}
```
If you want the Kubernetes Cluster Autoscaler to manage a second node pool in the cluster, locate the following line in the cluster-autoscaler.yaml file and set appropriate values for the --nodes parameter:
```
- --nodes=1:5:{{ node pool ocid 2 }}
```
If you want the Kubernetes Cluster Autoscaler to manage more node pools, insert additional --nodes parameters in the cluster-autoscaler.yaml file and set appropriate values for them.
Save the cluster-autoscaler.yaml file.

To use the --node-group-auto-discovery parameter to specify that the Kubernetes Cluster Autoscaler is to discover which node pools to manage:

In the cluster-autoscaler.yaml file you created, locate the following template line:
```
- --nodes=1:5:{{ node pool ocid 1 }}
```
Delete the entire line specifying the --nodes parameter, and replace it with the following line:
```
- --node-group-auto-discovery=clusterId:{{ cluster ocid }},compartmentId:{{ compartment ocid }},nodepoolTags:{{ tagKey1 }}={{ tagValue1 }}&{{ tagKey2 }}={{ tagValue2 }},min:{{ min }},max:{{ max }}
```
The --node-group-auto-discovery parameter has the following format:
```
--node-group-auto-discovery=clusterId:{{<cluster-ocid>}},compartmentId:{{<compartment-ocid>}},nodepoolTags:{{<tagKey1>}}={{<tagValue1>}}&{{<tagKey2>}}={{<tagValue2>}},min:{{<min-nodes>}},max:{{<max-nodes>}}
```
where:
- <cluster-ocid> is the cluster in which to run the Kubernetes Cluster Autoscaler.
- <compartment-ocid> is the OCID of the compartment in which the node pool is located.
- {{<tagKey1>}}={{<tagValue1>}} specifies the name of the first tag to match, and the value of that tag to match.
- {{<tagKey2>}}={{<tagValue2>}} optionally specifies the name of a second tag to match, and the value of that tag to match. You can specify as many tags as required (you are not limited to two). If you specify multiple tags, then all tags have to match.
- <min-nodes> is the minimum number of nodes allowed in the node pool. The Kubernetes Cluster Autoscaler will not reduce the number of nodes below this number.
- <max-nodes> is the maximum number of nodes allowed in the node pool. The Kubernetes Cluster Autoscaler will not increase the number of nodes above this number. Make sure the maximum number of nodes you specify does not exceed the tenancy limits for the worker node shape defined for the node pool.
Change the value of the --node-group-auto-discovery parameter to specify:
- The cluster in which to run the Kubernetes Cluster Autoscaler.
- The OCID of the compartment in which the node pool is located.
- One or more tag names and tag values to match.
- The minimum number of nodes allowed in the node pool. For example, 1.
- The maximum number of nodes allowed in the node pool. For example, 5.
For example:
```
--node-group-auto-discovery=clusterId:ocid1.cluster.oc1.iad.aaaaaaaa______ixq,compartmentId:ocid1.compartment.oc1..aaaaaaaa______t7a,nodepoolTags:managedby=ca,min:1,max:5
```
Locate the following line in the cluster-autoscaler.yaml file and remove it:
```
- --nodes=1:5:{{ node pool ocid 2 }}
```
If you want the Kubernetes Cluster Autoscaler to manage more node pools, in different compartments, or with different tag names and tag values, or with different minimum and maximum numbers of allowed nodes, insert additional --node-group-auto-discovery parameters in the cluster-autoscaler.yaml file and set appropriate values for them.
For example:
```
--node-group-auto-discovery=clusterId:ocid1.cluster.oc1.iad.aaaaaaaa______ixq,compartmentId:ocid1.compartment.oc1..aaaaaaaa______t7a,nodepoolTags:managedby=ca,min:1,max:5
--node-group-auto-discovery=clusterId:ocid1.cluster.oc1.iad.aaaaaaaa______ixq,compartmentId:ocid1.compartment.oc1..aaaaaaaa______jyv,nodepoolTags:managedby=ca,min:2,max:6
```
Save the cluster-autoscaler.yaml file.

Step 2c: Include additional configuration settings

In the cluster-autoscaler.yaml file you created, add environment variables to specify how you have set up the Kubernetes Cluster Autoscaler to access OCI services and resources:

If you have set up an instance principal to enable the Kubernetes Cluster Autoscaler to access OCI services and resources, after the line imagePullPolicy: "Always" at the end of the file, add the following:

          env:
          - name: OKE_USE_INSTANCE_PRINCIPAL
            value: "true"
          - name: OCI_SDK_APPEND_USER_AGENT
            value: "oci-oke-cluster-autoscaler"

For example:

...
          imagePullPolicy: "Always"
          env:
          - name: OKE_USE_INSTANCE_PRINCIPAL
            value: "true"
          - name: OCI_SDK_APPEND_USER_AGENT
            value: "oci-oke-cluster-autoscaler"

If you have set up a workload identity principal to enable the Kubernetes Cluster Autoscaler to access OCI services and resources, after the line imagePullPolicy: "Always" at the end of the file, add the following:

          env:
          - name: OKE_USE_INSTANCE_PRINCIPAL
            value: "false"
          - name: OCI_USE_WORKLOAD_IDENTITY
            value: "true"
          - name: OCI_RESOURCE_PRINCIPAL_VERSION
            value: "2.2"
          - name: OCI_RESOURCE_PRINCIPAL_REGION
            value: "<cluster-region>"
          - name: OCI_SDK_APPEND_USER_AGENT
            value: "oci-oke-cluster-autoscaler"

where <cluster-region> is the region in which the cluster is located.

For example:

...
          imagePullPolicy: "Always"
          env:
          - name: OKE_USE_INSTANCE_PRINCIPAL
            value: "false"
          - name: OCI_USE_WORKLOAD_IDENTITY
            value: "true"
          - name: OCI_RESOURCE_PRINCIPAL_VERSION
            value: "2.2"
          - name: OCI_RESOURCE_PRINCIPAL_REGION
            value: "us-phoenix-1"
          - name: OCI_SDK_APPEND_USER_AGENT
            value: "oci-oke-cluster-autoscaler"

In the cluster-autoscaler.yaml file you created, confirm that the --cloud-provider parameter is set correctly for the version of Kubernetes running on the cluster. By default, the parameter assumes the cluster is running Kubernetes version 1.27 or later (or 1.23 or earlier) and is set to oci. If the cluster is running Kubernetes version 1.26, 1.25, or 1.24, change the value of the --cloud-provider parameter to oci-oke:
1. In the cluster-autoscaler.yaml file, locate the following line:
```
- --cloud-provider=oci
```
2. If the cluster is running Kubernetes version 1.26, 1.25, or 1.24, change the value of the --cloud-provider parameter to oci-oke :
```
- --cloud-provider=oci-oke
```
3. Save the cluster-autoscaler.yaml file.

In the cluster-autoscaler.yaml file you created, change the image path of the Kubernetes Cluster Autoscaler image to download from Oracle Cloud Infrastructure Registry. Images are available in a number of regions. For the best performance, choose the region closest to the one where the cluster is deployed:

In the cluster-autoscaler.yaml file, locate the following template line:
```
- image: iad.ocir.io/oracle/oci-cluster-autoscaler:{{ image tag }}
```

Change the image path to one of the following, according to the location and Kubernetes version of the cluster in which to run the Kubernetes Cluster Autoscaler:


Image Location	Kubernetes Version	Image Path
Germany Central (Frankfurt)	Kubernetes 1.31	fra.ocir.io/oracle/oci-cluster-autoscaler:1.31.3-2
Germany Central (Frankfurt)	Kubernetes 1.32	fra.ocir.io/oracle/oci-cluster-autoscaler:1.32.2-2
Germany Central (Frankfurt)	Kubernetes 1.33	fra.ocir.io/oracle/oci-cluster-autoscaler:1.33.0-3
Germany Central (Frankfurt)	Kubernetes 1.34	fra.ocir.io/oracle/oci-cluster-autoscaler:1.33.0-3
UK South (London)	Kubernetes 1.31	lhr.ocir.io/oracle/oci-cluster-autoscaler:1.31.3-2
UK South (London)	Kubernetes 1.32	lhr.ocir.io/oracle/oci-cluster-autoscaler:1.32.2-2
UK South (London)	Kubernetes 1.33	lhr.ocir.io/oracle/oci-cluster-autoscaler:1.33.0-3
UK South (London)	Kubernetes 1.34	lhr.ocir.io/oracle/oci-cluster-autoscaler:1.33.0-3
US East (Ashburn)	Kubernetes 1.31	iad.ocir.io/oracle/oci-cluster-autoscaler:1.31.3-2
US East (Ashburn)	Kubernetes 1.32	iad.ocir.io/oracle/oci-cluster-autoscaler:1.32.2-2
US East (Ashburn)	Kubernetes 1.33	iad.ocir.io/oracle/oci-cluster-autoscaler:1.33.0-3
US East (Ashburn)	Kubernetes 1.34	iad.ocir.io/oracle/oci-cluster-autoscaler:1.33.0-3
US West (Phoenix)	Kubernetes 1.31	phx.ocir.io/oracle/oci-cluster-autoscaler:1.31.3-2
US West (Phoenix)	Kubernetes 1.32	phx.ocir.io/oracle/oci-cluster-autoscaler:1.32.2-2
US West (Phoenix)	Kubernetes 1.33	phx.ocir.io/oracle/oci-cluster-autoscaler:1.33.0-3
US West (Phoenix)	Kubernetes 1.34	phx.ocir.io/oracle/oci-cluster-autoscaler:1.33.0-3

For example, if you want to run the Kubernetes Cluster Autoscaler in a Kubernetes 1.33 cluster located in the UK South region, specify the following image:

- image: lhr.ocir.io/oracle/oci-cluster-autoscaler:1.33.0-3

Tip

If you want to deploy the Kubernetes Cluster Autoscaler on a Kubernetes cluster that is not in the same region as any of the Oracle repositories containing Cluster Autoscaler images, we recommend you push the image to a repository that is in the same region as the cluster, as follows:

i. Pull the image from an Oracle repository using the docker pull command. See Pulling Images Using the Docker CLI.

ii. Tag the image (using the docker tag command), and then push the image to a repository in Oracle Cloud Infrastructure Registry that is in the same region as the cluster in which you want to run the Kubernetes Cluster Autoscaler (using the docker push command). See Pushing Images Using the Docker CLI.

iii. Specify the location of the image in the cluster-autoscaler.yaml file.

Note

If you want to deploy the Kubernetes Cluster Autoscaler on a Kubernetes cluster where you have enabled image verification, do not simply specify an image path from one of the Oracle repositories in the cluster-autoscaler.yaml file. Instead, do the following:

i. Pull the image from an Oracle repository using the docker pull command. See Pulling Images Using the Docker CLI.

iii. Sign the image using a master key and key version in the Vault service, creating an image signature. See Signing Images for Security.

iv. Specify the location of the signed image in the cluster-autoscaler.yaml file. Reference the image using the image digest rather than the image tag (see Enforcing the Use of Signed Images from Registry).

Save the cluster-autoscaler.yaml file.

In the cluster-autoscaler.yaml file you created, confirm that the default values of the CPU and memory limit parameters are sufficient for the number of node pools that you want the Kubernetes Cluster Autoscaler to manage. The default limits are relatively low, so consider increasing the limits if you want the Kubernetes Cluster Autoscaler to manage a large number of node pools. Note that it is your responsibility to set the limits to suitable values.
1. In the cluster-autoscaler.yaml file, locate the following lines:
```
          resources:
            limits:
              cpu: 100m
              memory: 300Mi
```
2. Set the CPU and memory limits to values that are appropriate for the number of node pools that you want the the Kubernetes Cluster Autoscaler to manage. For example:
```
          resources:
            limits:
              cpu: 200m
              memory: 600Mi
```
3. Save the cluster-autoscaler.yaml file.
In the cluster-autoscaler.yaml file you created, specify other parameters for the Kubernetes Cluster Autoscaler. For information about the parameters you can set, see Supported Kubernetes Cluster Autoscaler Parameters.
Save and close the cluster-autoscaler.yaml file.

Step 3: Deploy the Kubernetes Cluster Autoscaler in the cluster and confirm successful deployment

If you haven't already done so, follow the steps to set up the cluster's kubeconfig configuration file and (if necessary) set the KUBECONFIG environment variable to point to the file. Note that you must set up your own kubeconfig file. You cannot access a cluster using a kubeconfig file that a different user set up. See Setting Up Cluster Access.
Deploy the Kubernetes Cluster Autoscaler on the cluster by entering:
```
kubectl apply -f cluster-autoscaler.yaml
```
View the Kubernetes Cluster Autoscaler logs to confirm that it was successfully deployed and is currently monitoring the workload of node pools in the cluster, by entering:
```
kubectl -n kube-system logs -f deployment.apps/cluster-autoscaler
```
Identify which one of the three Kubernetes Cluster Autoscaler pods defined in the cluster-autoscaler.yaml file is currently performing actions, by entering:
```
kubectl -n kube-system get lease
```
Obtain a high-level view of the Kubernetes Cluster Autoscaler's state from the configmap in the kube-system namespace, by entering:
```
kubectl -n kube-system get cm cluster-autoscaler-status -oyaml
```

Step 4: View the Scaling Operation

You can watch the Kubernetes Cluster Autoscaler you have deployed as it automatically scales worker nodes in a node pool. To make the scaling operation more obvious, consider the following suggestions (note these are for observation purposes only, and might be contrary to recommendations shown in Recommendations when using the Kubernetes Cluster Autoscaler in Production Environments):

Observe a cluster that has a single node pool (the node pool being managed by the Kubernetes Cluster Autoscaler).
If the cluster you want to observe has more than one node pool, restrict pods to running on nodes on the single node pool being managed by the Kubernetes Cluster Autoscaler. See Assigning Pods to Nodes in the Kubernetes documentation.
Start with one node in the node pool being managed by the Kubernetes Cluster Autoscaler.
In the Kubernetes Cluster Autoscaler configuration file, you specify the maximum number of nodes allowed in the node pool. Make sure the maximum number of nodes you specify does not exceed the tenancy limit for the worker node shape defined for the node pool.

To view the Kubernetes Cluster Autoscaler automatically scaling worker nodes:

Confirm the current total number of worker nodes in the cluster by entering:
```
kubectl get nodes
```

Define a sample Nginx application by creating a file called nginx.yaml in a text editor, with the following content:

apiVersion: apps/v1
kind: Deployment
metadata:
  name: nginx-deployment
spec:
  selector:
    matchLabels:
      app: nginx
  replicas: 2
  template:
    metadata:
      labels:
        app: nginx
    spec:
      containers:
      - name: nginx
        image: nginx:latest
        ports:
        - containerPort: 80
        resources:
          requests:
            memory: "500Mi"

Notice that a resource request limit has been set.

Deploy the sample application by entering:
```
kubectl create -f nginx.yaml
```
Increase the number of pods in the deployment to 100 (from 2) by entering:
```
kubectl scale deployment nginx-deployment --replicas=100
```
The Kubernetes Cluster Autoscaler now adds worker nodes to the node pool to meet the increased workload.
Observe the status of the deployment by entering:
```
kubectl get deployment nginx-deployment --watch
```
After a few minutes, view the increased total number of worker nodes in the cluster by entering:
```
kubectl get nodes
```
Note that the number of worker nodes that you see will depend on the worker node shape and the maximum number of nodes specified in the Kubernetes Cluster Autoscaler configuration file.

Step 5: Clean Up

Delete the sample Nginx application by entering:
```
kubectl delete deployment nginx-deployment
```
After ten minutes, confirm that the worker nodes have reduced to the original number, by entering:
```
kubectl get nodes
```

Note that after deleting the sample Nginx application and waiting, you might see fewer worker nodes but still more than the original number. This is probably because kube-system pods have been scheduled to run on those nodes. kube-system pods can prevent the Kubernetes Cluster Autoscaler from removing nodes because the Autoscaler's skip-nodes-with-system-pods parameter is set to true by default.

Oracle Cloud Infrastructure Documentation

Working with the Cluster Autoscaler as a Standalone Program

Step 1: Setting Up an Instance Principal or Workload Identity Principal to Enable Cluster Autoscaler Access to Node Pools

Using instance principals to enable access to node pools

Using workload identity principals to enable access to node pools

Step 2: Copy and customize the Cluster Autoscaler configuration file

Step 2a: Copy the configuration file

Step 2b: Specify the node pools to manage

Step 2c: Include additional configuration settings

Step 3: Deploy the Kubernetes Cluster Autoscaler in the cluster and confirm successful deployment

Step 4: View the Scaling Operation

Step 5: Clean Up