Create worker pods through Deployments #730

Matt711 · 2023-06-08T21:15:06Z

Closes #603. Currently, if worker group loses a pod, isn't recreated by the operator. This PR solves that problem by create deployment for each worker. The deployment then creates the worker pod. The advantage of a deployment is that we can define the number of replicas (1 replica/scheduler in this case) and Kubernetes will recreate the worker pod.

Matt711 · 2023-06-08T21:15:28Z

One deployment per worker

Matt711 · 2023-06-09T22:17:40Z

Can we merge this after #711 pending review?

tasansal · 2023-06-16T19:02:40Z

if this gets merged and new release of dask-k8s is out, this solves 90% of our problems 😍

Thanks for all the hard work!

…ployments

jacobtomlinson · 2023-06-22T11:32:57Z

CI failures are releated to #738. Merging from main.

jacobtomlinson

This looks great! I pulled it locally and tested it out and it seems to work great.

I'm keen to switch out kubernetes_asyncio for kr8s in daskworkergroup_replica_update but we can do that in a follow up to avoid creeping the scope of this PR.

I have suggested a change to the test that uses kr8s, but again we could just do this later.

jacobtomlinson · 2023-06-22T16:44:03Z

dask_kubernetes/operator/controller/tests/test_controller.py

+    with kopf_runner as runner:
+        async with gen_cluster() as (cluster_name, ns):
+            scheduler_deployment_name = "simple-scheduler"
+            worker_deployment_name = "simple-default-worker"
+            service_name = "simple-scheduler"
+            while scheduler_deployment_name not in k8s_cluster.kubectl(
+                "get", "pods", "-n", ns
+            ):
+                await asyncio.sleep(0.1)
+            while service_name not in k8s_cluster.kubectl("get", "svc", "-n", ns):
+                await asyncio.sleep(0.1)
+            while worker_deployment_name not in k8s_cluster.kubectl(
+                "get", "pods", "-n", ns
+            ):
+                await asyncio.sleep(0.1)
+            k8s_cluster.kubectl(
+                "delete",
+                "pods",
+                "-l",
+                "dask.org/cluster-name=simple,dask.org/component=worker",
+                "-n",
+                ns,
+            )
+            k8s_cluster.kubectl(
+                "wait",
+                "--for=condition=Ready",
+                "-l",
+                "dask.org/cluster-name=simple,dask.org/component=worker",
+                "pod",
+                "-n",
+                ns,
+                "--timeout=60s",
+            )
+            assert worker_deployment_name in k8s_cluster.kubectl(
+                "get", "pods", "-n", ns
+            )


I spent some time tinkering with this test. I wrote this implementation which uses kr8s and tests a little more thoroughly that the Pods get created, become Ready, get deleted, get recreated and get back to Ready.

This has also inspired a few tweaks to kr8s so I may modify this a little further after it is merged.

Suggested change

with kopf_runner as runner:

async with gen_cluster() as (cluster_name, ns):

scheduler_deployment_name = "simple-scheduler"

worker_deployment_name = "simple-default-worker"

service_name = "simple-scheduler"

while scheduler_deployment_name not in k8s_cluster.kubectl(

"get", "pods", "-n", ns

):

await asyncio.sleep(0.1)

while service_name not in k8s_cluster.kubectl("get", "svc", "-n", ns):

await asyncio.sleep(0.1)

while worker_deployment_name not in k8s_cluster.kubectl(

"get", "pods", "-n", ns

):

await asyncio.sleep(0.1)

k8s_cluster.kubectl(

"delete",

"pods",

"-l",

"dask.org/cluster-name=simple,dask.org/component=worker",

"-n",

ns,

)

k8s_cluster.kubectl(

"wait",

"--for=condition=Ready",

"-l",

"dask.org/cluster-name=simple,dask.org/component=worker",

"pod",

"-n",

ns,

"--timeout=60s",

)

assert worker_deployment_name in k8s_cluster.kubectl(

"get", "pods", "-n", ns

)

api = await kr8s.asyncio.api()

with kopf_runner as runner:

async with gen_cluster() as (cluster_name, ns):

# Wait for worker Pods to be created

while True:

pods = await api.get(

"pods",

namespace=ns,

label_selector=f"dask.org/cluster-name={cluster_name},dask.org/component=worker",

)

if not pods:

await asyncio.sleep(0.1)

continue

break

# Store number of workers

n_pods = len(pods)

# Wait for worker Pods to be ready

await asyncio.gather(

*[pod.wait(conditions="condition=Ready", timeout=60) for pod in pods]

)

# Delete a worker Pod

await pods[0].delete()

# Wait for Pods to be recreated

while True:

pods = await api.get(

"pods",

namespace=ns,

label_selector=f"dask.org/cluster-name={cluster_name},dask.org/component=worker",

)

if len(pods) < n_pods:

await asyncio.sleep(0.1)

continue

break

# Wait for worker Pods to be ready

await asyncio.gather(

*[pod.wait(conditions="condition=Ready", timeout=60) for pod in pods]

)

I think I'm going to merge this and then update things in a follow up.

Follow up is #743

bstadlbauer · 2023-06-26T07:35:18Z

@jacobtomlinson Would it be possible to create a release now that this is in? This would also really help us in case of spot node preemption

jacobtomlinson · 2023-06-26T09:09:57Z

@bstadlbauer sure!

briceruzand · 2024-01-27T10:26:32Z

Hi,
Have you study the possibility to use only a ReplicaSet instead of a Deployement for the worker in order to allow restartPolicy: Never seed #855 and #856.
Thx

jacobtomlinson · 2024-02-01T10:58:38Z

@briceruzand please don't comment on closed PRs. Open new issues instead.

Create worker pods through Deployments

1962811

Matt711 added the operator label Jun 9, 2023

Merge remote-tracking branch 'upstream/main' into worker-deployments

69423d7

Matt711 force-pushed the worker-deployments branch from 35f4392 to 69423d7 Compare June 9, 2023 21:10

Matt711 marked this pull request as ready for review June 9, 2023 22:16

Matt711 requested a review from jacobtomlinson as a code owner June 9, 2023 22:16

Matt711 and others added 3 commits June 20, 2023 15:38

Merge remote-tracking branch 'upstream/main' into worker-deployments

c2b8e74

Add a test

2fbb002

Merge branch 'main' of github.com:dask/dask-kubernetes into worker-de…

068a2e7

…ployments

jacobtomlinson approved these changes Jun 22, 2023

View reviewed changes

jacobtomlinson merged commit 0c3ede7 into dask:main Jun 23, 2023

jacobtomlinson mentioned this pull request Jun 23, 2023

Migrate worker deployment test to use kr8s #743

Merged

NakulK48 mentioned this pull request Jul 11, 2023

Automatically rescale to recover from pod deletion #717

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create worker pods through Deployments #730

Create worker pods through Deployments #730

Matt711 commented Jun 8, 2023

Matt711 commented Jun 8, 2023

Matt711 commented Jun 9, 2023

tasansal commented Jun 16, 2023

jacobtomlinson commented Jun 22, 2023

jacobtomlinson left a comment

jacobtomlinson Jun 22, 2023

jacobtomlinson Jun 23, 2023

jacobtomlinson Jun 23, 2023

bstadlbauer commented Jun 26, 2023

jacobtomlinson commented Jun 26, 2023

briceruzand commented Jan 27, 2024

jacobtomlinson commented Feb 1, 2024

Create worker pods through Deployments #730

Create worker pods through Deployments #730

Conversation

Matt711 commented Jun 8, 2023

Matt711 commented Jun 8, 2023

Matt711 commented Jun 9, 2023

tasansal commented Jun 16, 2023

jacobtomlinson commented Jun 22, 2023

jacobtomlinson left a comment

Choose a reason for hiding this comment

jacobtomlinson Jun 22, 2023

Choose a reason for hiding this comment

jacobtomlinson Jun 23, 2023

Choose a reason for hiding this comment

jacobtomlinson Jun 23, 2023

Choose a reason for hiding this comment

bstadlbauer commented Jun 26, 2023

jacobtomlinson commented Jun 26, 2023

briceruzand commented Jan 27, 2024

jacobtomlinson commented Feb 1, 2024