-
Notifications
You must be signed in to change notification settings - Fork 487
Closed
Description
This is track issue for Katib 0.10 release for Kubeflow 1.2.
Related: kubeflow/kubeflow#5224.
Main features:
- Release v1beta1 Katib version.
- New Trial template design (Support new Trial Template in v1beta1 #1208).
- Extracting metrics in multiple ways (extracting metric value in multiple ways #1140).
- Resume Experiment from volume (Save Suggestion state in persistent volume #1250).
- Custom CRDs in Trial template ([Feature] Modify Job provider to support any kind of Kubernetes CRDs #1214).
- Support MPIJob (Add MPI operator horovod example #1342).
- Support Tekton (Add Tekton Pipeline example #1339).
- Early Stopping (Early stopping implementation #1330).
TODOs list for UI update:
- Support new Trial template ([UI Feature] Support new Trial template in Katib UI #1217).
- Support metric strategies ([UI Feature] Support metric strategies in submit Experiment page #1220).
- Support resume experiment ([UI Feature] Support resume experiment in submit Experiment page #1221).
TODOs list for manifest update:
- Update
kubeflow/manifests
with the latest changes and image versions.
TODOs list for website update:
- Update all reference to the new v1beta1 APIs.
- New Trial template design ([Documentation] New Trial template design #1341).
- Trial metadata injection ([Documentation] Information about Trial meta injection in template #1280).
- Default values for Experiment APIs ([Documentation] Controller logic for defaults Experiment API parameters #1286).
- Resume Experiment feature ([Documentation] Resume Experiment feature #1292).
- Metrics strategies info ([Documentation] Metrics strategies information #1310).
- Disable ephemeral storage from resources ([Documentation] Disable ephemeral storage for Suggestion and metrics collector resources #1358).
- Annotation for istio sidecar ([Documentation] Annotation to disable istio sidecar container #1359).
Let me know if we need to add/remove something.
I the meantime, we are working on new AWS test infra: #1356. We try to carry out it before the release.
/area release
/priority p1
/cc @gaocegege @johnugeorge @jlewi @Jeffwan
gaocegege, rui-vas and c-batac-batac-bata