Adding CCC based autoscaler files#109
Adding CCC based autoscaler files#109pulasthi wants to merge 3 commits intoAI-Hypercomputer:tpu7x-autofrom
Conversation
Signed-off-by: pulasthi <pulasthi@google.com>
|
Thanks @pulasthi! One question: when running the automation_launch.sh, I can see a lot of warning as such Should we remove them from yaml? |
|
We also need to add a README and update https://github.com/AI-Hypercomputer/accelerator-microbenchmarks/blob/tpu7x-auto/Ironwood/guides/automation/README.md to include the ccc solution. |
This is a limitation in CCC when its been used in conjunction with Kueue. When CCC is used without Kueue the only nodeselector tag that is needed is the one pointing to the CCC class However using just this Kueue does not create the workload correctly and hence the normal node selectors of need to be added to get pass Kueue. However CCC logs warnings for these since this information is already present in the CCC template. I will fill a bug about this and followup with the CCC team. But for now we would need to keep these |
Hi @junjieqian added the updated readme file. Most of the content is same as the current readme file |
Adding scripts for automated microbenchmark runs with CCC