Spot controller instance #1770
Replies: 2 comments 1 reply
-
Indeed, reducing the fixed cost of the controller has been something on the roadmap. For your use case, specifying skypilot/sky/templates/spot-controller.yaml.j2 Lines 5 to 6 in 8d6f6a9 like this resources:
disk_size: 50
cpus: 4+ should work. Caveat: if you have N vCPUs in the controller, the maximum concurrent spot jobs is 2*N. This is due to the constant set here. For a few jobs, using a smaller controller makes sense; for a larger number of jobs, the controller cost can be amortized. Let us know if the above works for you? |
Beta Was this translation helpful? Give feedback.
-
Thank you for your response. How much memory is actually needed if only running let's say 1 or 2 tasks at one time? For the m6 instance ratio of 4GB/cvpu, it looks like an average of 2 GB/tasks, but I guess there is a base cost. |
Beta Was this translation helpful? Give feedback.
-
What instance is actually needed for the spot controller?, if the comment in the template is still valid the instance looks like it's overkill, e.g. it's almost the price of a spot GPU instance or an on-demand basic GPU instance for inference.
If only using a single cloud provider can the requirements be reduced by editing the template file?
Beta Was this translation helpful? Give feedback.
All reactions