Placement Policy Scheduler Plugins

Most of cloud environments today provides cluster admins with ephemeral nodes (VMs). These nodes typically cost significantly less but they offer less reliability than their regular counterpart. Cluster admins are often torn between the choice of cost and reliability because of the innate inability of the default Kubernetes scheduler to place some of a specific workload pods on these nodes. Having the entire workload on ephemeral nodes risks the reliability of the workload when the cloud environment stops these nodes.

This scheduler enables cluster admins to offload some configurable percentage of their workloads on these nodes enabling them to decrease the cost of running these pods without affecting its reliability. The scheduler is implemented using the scheduler framework with two additional plugins. The scheduler will run side by side with existing kubernetes scheduler and users will have to specifically specify this scheduler to use for their workloads that perform the following:

A scorer plugin implemented with that will be used in case “best effort” policy enforcement.
- Extension points implemented: PreScore and Score
A filter plugin that will be used in case “force” policy enforcement.
- Extension points implemented: PreFilter and Filter

For more detail, see the design document.

NOTE: This code is in ALPHA status. It is not considered ready for production use.

Quick Start

Install

The container images for the scheduler plugin is available in the github container registry.

Quick Install

kubectl apply -f https://raw.githubusercontent.com/Azure/placement-policy-scheduler-plugins/main/deploy/kube-scheduler-configuration.yml

Result

customresourcedefinition.apiextensions.k8s.io/placementpolicies.placement-policy.scheduling.x-k8s.io created
configmap/pp-scheduler-config created
clusterrole.rbac.authorization.k8s.io/pp-plugins-scheduler created
clusterrolebinding.rbac.authorization.k8s.io/pp-plugins-scheduler created
rolebinding.rbac.authorization.k8s.io/pp-plugins-scheduler-as-kube-scheduler created
clusterrolebinding.rbac.authorization.k8s.io/pp-plugins-scheduler-as-kube-scheduler created
serviceaccount/pp-plugins-scheduler created
deployment.apps/pp-plugins-scheduler created

Helm

helm repo add placement-policy-scheduler-plugins https://azure.github.io/placement-policy-scheduler-plugins/charts
helm repo update

helm install -n kube-system [RELEASE_NAME] placement-policy-scheduler-plugins/placement-policy-scheduler-plugins

Example config

apiVersion: placement-policy.scheduling.x-k8s.io/v1alpha1
kind: PlacementPolicy
metadata:
  name: besteffort-must
spec:
  weight: 100
  enforcementMode: BestEffort
  podSelector:
    matchLabels:
      app: nginx
  nodeSelector:
    matchLabels:
      node: want
  policy:
    action: Must
    targetSize: 40%

enforcementMode: specifies how the policy will be enforced during scheduler. Values allowed for this field are:
- BestEffort (default): the policy will be enforced as best effort (scorer mode).
- Strict: the policy will be forced during scheduling.
nodeSelector: selects the nodes where the placement policy will apply on according to action.
podSelector: identifies which pods this placement policy will apply on
action: policy placement action that carries the following possible values:
- Must(default): based on the rule below pods must be placed on nodes selected by node selector MustNot: based on the rule pods
- MustNot be placed nodes selected by node selector'
targetSize: the number or percent of pods that can or cannot be placed on the node.
weight: allows the engine to decide which policy to use when pods match multiple policies.

Demo

1. Create a kind cluster with the following config

cat <<EOF | kind create cluster --name placement-policy --config=-
kind: Cluster
apiVersion: kind.x-k8s.io/v1alpha4
nodes:
- role: control-plane
- role: worker
  kubeadmConfigPatches:
    - |
      kind: JoinConfiguration
      nodeRegistration:
        kubeletExtraArgs:
          node-labels: "node=want"
- role: worker
  kubeadmConfigPatches:
    - |
      kind: JoinConfiguration
      nodeRegistration:
        kubeletExtraArgs:
          node-labels: "node=want"
- role: worker
  kubeadmConfigPatches:
    - |
      kind: JoinConfiguration
      nodeRegistration:
        kubeletExtraArgs:
          node-labels: "node=unwant"
EOF

Output

Creating cluster "placement-policy" ...
 ✓ Ensuring node image (kindest/node:v1.21.1) 🖼
 ✓ Preparing nodes 📦 📦 📦 📦
 ✓ Writing configuration 📜
 ✓ Starting control-plane 🕹️
 ✓ Installing CNI 🔌
 ✓ Installing StorageClass 💾
 ✓ Joining worker nodes 🚜
Set kubectl context to "kind-placement-policy"
You can now use your cluster with:

kubectl cluster-info --context kind-placement-policy

Have a question, bug, or feature request? Let us know! https://kind.sigs.k8s.io/#community 🙂

The same node selector node: want will be used as node label for kind-worker and kind-worker2

Node labels

➜ kubectl get nodes -o=custom-columns="NAME:.metadata.name,LABEL:.metadata.labels['node']"
NAME                 LABEL
kind-control-plane   <none>
kind-worker          want
kind-worker2         want
kind-worker3         unwant

2. Follow the Install section to deploy placement-policy-scheduler-plugins as a secondary scheduler

3. Deploy a `PlacementPolicy` CRD

kubectl apply -f https://raw.githubusercontent.com/Azure/placement-policy-scheduler-plugins/main/examples/v1alpha1_placementpolicy_strict_must.yml

Result

placementpolicy.placement-policy.scheduling.x-k8s.io/strict-must created

4. Deploy a `ReplicaSet` that will create 10 replicas

kubectl apply -f https://raw.githubusercontent.com/Azure/placement-policy-scheduler-plugins/main/examples/demo_replicaset.yml

Result

replicaset.apps/nginx created

5. Get pods with matching labels

kubectl get po -o wide -l app=nginx

NAME          READY   STATUS    RESTARTS   AGE   IP           NODE           NOMINATED NODE   READINESS GATES
nginx-8cr58   1/1     Running   0          76s   10.244.3.4   kind-worker3   <none>           <none>
nginx-d7js5   1/1     Running   0          76s   10.244.1.2   kind-worker2   <none>           <none>
nginx-jt527   1/1     Running   0          76s   10.244.3.6   kind-worker3   <none>           <none>
nginx-m5c86   1/1     Running   0          76s   10.244.2.2   kind-worker    <none>           <none>
nginx-qxx6m   1/1     Running   0          76s   10.244.3.2   kind-worker3   <none>           <none>
nginx-rdlzx   1/1     Running   0          76s   10.244.2.3   kind-worker    <none>           <none>
nginx-skk5z   1/1     Running   0          76s   10.244.1.3   kind-worker2   <none>           <none>
nginx-vq598   1/1     Running   0          76s   10.244.3.7   kind-worker3   <none>           <none>
nginx-xzxsb   1/1     Running   0          76s   10.244.3.3   kind-worker3   <none>           <none>
nginx-zwrsk   1/1     Running   0          76s   10.244.3.5   kind-worker3   <none>           <none>

We will find the nodes which carry the same node selector defined in the strict-must PlacementPolicy have been assigned 40% of the workload as defined with targetSize.

6. Clean up

Delete kind cluster

kind delete cluster --name placement-policy

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
.github		.github
apis/v1alpha1		apis/v1alpha1
charts/placement-policy-scheduler-plugins		charts/placement-policy-scheduler-plugins
cmd/scheduler		cmd/scheduler
config		config
deploy		deploy
examples		examples
hack		hack
manifest_staging		manifest_staging
pkg		pkg
test		test
.gitignore		.gitignore
.golangci.yml		.golangci.yml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
PROJECT		PROJECT
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
go.mod		go.mod
go.sum		go.sum

License

Azure/placement-policy-scheduler-plugins

Folders and files

Latest commit

History

Repository files navigation

Placement Policy Scheduler Plugins

Quick Start

Install

Quick Install

Helm

Example config

Demo

1. Create a kind cluster with the following config

2. Follow the Install section to deploy placement-policy-scheduler-plugins as a secondary scheduler

3. Deploy a PlacementPolicy CRD

4. Deploy a ReplicaSet that will create 10 replicas

5. Get pods with matching labels

6. Clean up

Contributing

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Languages

3. Deploy a `PlacementPolicy` CRD

4. Deploy a `ReplicaSet` that will create 10 replicas