refactor kubernetes providers: each block in kube refers to a pod #1073

ZhuozhaoLi · 2019-06-17T20:04:48Z

Now kubernetes provider creates non-deployment pods for each executor, rather than deployments.
The scaling of pods will be managed by Parsl.

Refer the details to the discussions in #853

benclifford · 2019-06-18T09:48:55Z

parsl/providers/kubernetes/kube.py

@@ -65,7 +65,7 @@ def __init__(self,
                 max_blocks=10,
                 parallelism=1,
                 worker_init="",
-                 deployment_name=None,
+                 pod_name=None,


needs docstring

benclifford · 2019-06-18T09:53:29Z

parsl/providers/kubernetes/kube.py

@@ -161,9 +159,10 @@ def cancel(self, job_ids):
        for job in job_ids:
            logger.debug("Terminating job/proc_id: {0}".format(job))
            # Here we are assuming that for local, the job_ids are the process id's
-            self._delete_deployment(job)


Maybe remove _delete_deployment from the source file?

benclifford · 2019-06-18T09:54:03Z

parsl/providers/kubernetes/kube.py

-            formatted_cmd = template_string.format(command=cmd_string,
-                                                   worker_init=self.worker_init)
-
-            self.deployment_obj = self._create_deployment_object(job_name,


_create_deployment and _create_deployment_object are dead code that should be removed maybe now.

benclifford · 2019-06-18T09:55:13Z

parsl/providers/kubernetes/kube.py

+                    port=80,
+                    cmd_string=None,
+                    volumes=[]):
+        """ Create a kubernetes non-deployment pod for the job.


Refer to this as this a "pod", not a "non-deployment pod".

benclifford · 2019-06-18T09:55:39Z

parsl/providers/kubernetes/kube.py

+        for volume in volumes:
+            volume_mounts.append(client.V1VolumeMount(mount_path=volume[1],
+                                                      name=volume[0]))
+        # Configureate Pod template container


"configure"?

benclifford · 2019-06-18T10:03:48Z

parsl/providers/kubernetes/kube.py

+                                                      name=volume[0]))
+        # Configureate Pod template container
+        container = None
+        if security_context:


I think the only purpose of this is to supply or not supply the security_context parameter. It looks (from the generated python code of V1Container in https://github.com/kubernetes-client/python/blob/master/kubernetes/client/models/v1_container.py) that it is fine to pass in None for security_context and so this if-statement could collapse into a single code path.

benclifford · 2019-06-18T10:06:10Z

This looks ok but I'd like someone to confirm they've definitely run this code and seen it scale correctly.

ZhuozhaoLi · 2019-06-19T03:55:47Z

This looks ok but I'd like someone to confirm they've definitely run this code and seen it scale correctly.

@benclifford I have tested this on petrelkube with DLHub. This includes the following tests.

Pods terminated when the main parsl process exits
Pods terminated when there is no task
Pods scaled out after termination when there are tasks

ZhuozhaoLi and others added 2 commits April 24, 2019 22:48

each block in kube refers to a pod

2cf9841

Merge branch 'master' into fix_kube_provider_#853

05160b9

benclifford reviewed Jun 18, 2019

View reviewed changes

ZhuozhaoLi and others added 6 commits June 18, 2019 17:53

add docstring for pod_name

d0930c3

remove dead code about create deployment

4beeb15

docstring - non-deployment pod to pod

0b42954

typo in comment

0ce7f93

Merge branch 'master' into fix_kube_provider_#853

d17fc2f

simplify container creation code

e80792d

Merge branch 'master' into fix_kube_provider_#853

44deaba

benclifford self-requested a review June 19, 2019 12:20

benclifford approved these changes Jun 19, 2019

View reviewed changes

benclifford merged commit 45ec82a into master Jun 19, 2019

benclifford deleted the fix_kube_provider_#853 branch June 19, 2019 14:38

benclifford mentioned this pull request Feb 1, 2022

kubernetes worker pods restart forever globus/globus-compute#601

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor kubernetes providers: each block in kube refers to a pod #1073

refactor kubernetes providers: each block in kube refers to a pod #1073

ZhuozhaoLi commented Jun 17, 2019

benclifford Jun 18, 2019

ZhuozhaoLi Jun 19, 2019

benclifford Jun 18, 2019

ZhuozhaoLi Jun 19, 2019

benclifford Jun 18, 2019

ZhuozhaoLi Jun 19, 2019

benclifford Jun 18, 2019

ZhuozhaoLi Jun 19, 2019

benclifford Jun 18, 2019

ZhuozhaoLi Jun 19, 2019

benclifford Jun 18, 2019

ZhuozhaoLi Jun 19, 2019

benclifford commented Jun 18, 2019

ZhuozhaoLi commented Jun 19, 2019

refactor kubernetes providers: each block in kube refers to a pod #1073

refactor kubernetes providers: each block in kube refers to a pod #1073

Conversation

ZhuozhaoLi commented Jun 17, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

benclifford commented Jun 18, 2019

ZhuozhaoLi commented Jun 19, 2019