Create dev mode instructions for operator deployment #467

Tomcli · 2020-12-08T18:48:57Z

Currently, the operator always watching the Kubeflow resources to reconcile when something is missing. This is good for production environment, but not very friendly when we need to remove and test resources in our development and testing setup. It would be nice to have a dev_mode flag to disable the operator watcher for development.

/cc @moficodes

vpavlin · 2020-12-08T18:50:30Z

How would the reconcilation be triggered then? (I.e. what would the operator do if not watch:) )

moficodes · 2020-12-08T18:53:21Z

I think the goal is to make the operator more like kfctl. With the dev mode operator is just wrapping kfctl running the command once and thats it.

its useful for quickly iterating and testing the operator deployment.

moficodes · 2020-12-08T18:53:33Z

I can take a look at it.

moficodes · 2020-12-08T18:53:38Z

/assign

vpavlin · 2020-12-08T18:58:02Z

Why not just use kfctl then?

Or even better, use the operator-sdk tooling for development - https://github.com/operator-framework/getting-started#2-run-locally-outside-the-cluster

Tomcli · 2020-12-08T19:06:21Z

This is coming from one of our users who doesn't have much experience as a devops. We probably don't have to disable all the watchers, we only want to disable the watcher for monitoring the k8s resources https://github.com/kubeflow/kfctl/blob/master/pkg/controller/kfdef/kfdef_controller.go#L119

Tomcli · 2020-12-08T19:07:03Z

also, this is an opt-out feature, so it shouldn't change the behavior of the current operator deployment.

Tomcli · 2020-12-08T19:18:49Z

Why not just use kfctl then?

Or even better, use the operator-sdk tooling for development - https://github.com/operator-framework/getting-started#2-run-locally-outside-the-cluster

For most of our users, kfctl is sufficient in this case. However, we have some users that are using window or have very little experience with terminal. So able to use operator for development would be nice for them.

vpavlin · 2020-12-08T20:00:27Z

Can you help me to understand the use case again - maybe with more details? It sounds like there is a very specific case which would get treatment in the operator where it should rather be treated by educating the user(s).

Tomcli · 2020-12-08T22:04:43Z

Since the default behavior for operator now is to reapply the kfdef if there a delete event from any kfctl resource, users that made changes to the Kubeflow deployment with kubectl edit instead of updating kubeflow/manifests will lose their configuration. I do agree educating the users is the right approach, but I'm seeing some users are afraid to use operator when they see a big learning curve for deployment.

I suggest only use this flag for users that are deploying Kubeflow by themselves in a dev setup. So those who are interested in the Kubeflow project will be more committed to learn about kustomize and kfdef to deploy Kubeflow with the operator in the right way.

nakfour · 2020-12-10T19:58:37Z

@Tomcli I don't think it is a good idea to override a normal operator workflow to satisfy a small set of users. Another option they can do as @tumido pointed out is to install the operator, install Kubeflow and then pull down the operator pod instance to 0. This will remove the operator pod watching and doing the reconcile function. I am absolutely not a fan of adding code that breaks the fundamental function of an operator.

Tomcli · 2020-12-10T20:26:33Z

Thanks @nakfour, pull down the operator pod instance to 0 can be a good option. Then we probably want to add some instructions for:

How to stop watching kubeflow deployment (using kubectl, k8s/ocp ui to cover different audiences)
When to resume watching (e.g. deleting kubeflow, update kfdef)

Hopefully this way we should able to help out our users without changing the operator behaviors.

tumido · 2020-12-10T20:29:48Z

I was looking for this issue and couldn't find it.😁

Precisely as @nakfour says. My experience with dev setup, when working on adjusting ODH components, I've found out that only either manual kfctl or scaling down the operator after the initial deploy gives me the control I need.

If you need to test the operator interaction with your kfdef, the best way is to let it operate. And if you need to manually modify the manifests after the initial deploy, you should pause the operator - scaling it down is by far the most easy option.

This way you also have control over the updated manifests from the repositories specified in kfdef since the operator holds the repository cache in the pods, so when you scale it up again, you have the most fresh manifests available.

I think, if you need to do manual adjustmets, you need to turn the autopilot off first.

tumido · 2020-12-11T09:56:25Z

btw, @Tomcli this way the whole "dev mode" toggle experience can be as simple as this:

Disable operator

oc patch deployment opendatahub-operator -n openshift-operators -p '{"spec":{"replicas":0}}'

Enable operator

oc patch deployment opendatahub-operator -n openshift-operators -p '{"spec":{"replicas":1}}'

you can also alias it in you bash to something shorter, which makes it even more convenient to use. 🙂

Tomcli · 2020-12-11T17:28:28Z

Thanks @tumido, I can add these instructions to the kubeflow/website and close this issue.

k8s-ci-robot assigned moficodes Dec 8, 2020

Tomcli changed the title ~~Create a dev mode for operator deployment~~ Create a dev mode instructions for operator deployment Dec 10, 2020

Tomcli changed the title ~~Create a dev mode instructions for operator deployment~~ Create dev mode instructions for operator deployment Dec 10, 2020

moficodes removed their assignment Feb 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create dev mode instructions for operator deployment #467

Create dev mode instructions for operator deployment #467

Tomcli commented Dec 8, 2020

vpavlin commented Dec 8, 2020

moficodes commented Dec 8, 2020

moficodes commented Dec 8, 2020

moficodes commented Dec 8, 2020

vpavlin commented Dec 8, 2020

Tomcli commented Dec 8, 2020

Tomcli commented Dec 8, 2020

Tomcli commented Dec 8, 2020

vpavlin commented Dec 8, 2020

Tomcli commented Dec 8, 2020

nakfour commented Dec 10, 2020

Tomcli commented Dec 10, 2020

tumido commented Dec 10, 2020 •

edited

Loading

tumido commented Dec 11, 2020

Tomcli commented Dec 11, 2020

Create dev mode instructions for operator deployment #467

Create dev mode instructions for operator deployment #467

Comments

Tomcli commented Dec 8, 2020

vpavlin commented Dec 8, 2020

moficodes commented Dec 8, 2020

moficodes commented Dec 8, 2020

moficodes commented Dec 8, 2020

vpavlin commented Dec 8, 2020

Tomcli commented Dec 8, 2020

Tomcli commented Dec 8, 2020

Tomcli commented Dec 8, 2020

vpavlin commented Dec 8, 2020

Tomcli commented Dec 8, 2020

nakfour commented Dec 10, 2020

Tomcli commented Dec 10, 2020

tumido commented Dec 10, 2020 • edited Loading

tumido commented Dec 11, 2020

Disable operator

Enable operator

Tomcli commented Dec 11, 2020

tumido commented Dec 10, 2020 •

edited

Loading