Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Migrate resource allocation map from agent to manager #3515

Open
fregataa opened this issue Jan 22, 2025 — with Lablup-Issue-Syncer · 0 comments
Open

Migrate resource allocation map from agent to manager #3515

fregataa opened this issue Jan 22, 2025 — with Lablup-Issue-Syncer · 0 comments

Comments

@fregataa
Copy link
Member

fregataa commented Jan 22, 2025

Motivation  

  • InsufficientResource errors occur in the CREATING phase of session creation, while creating containers in agents. This error may result from:
    • Resource state mismatches between managers and agents (BA-586)
    • Resource fragmentation in agent.
      Since managers don't have visibility into agents' resource allocation maps, they can only schedule sessions by comparing the required resource amount with the total sum of remaining resources in agents. This leads to scheduling sessions to inappropriate agents that have fragmented resources

Objective  

  • Let scheduler consider allocation map and avoid scheduling sessions to resource fragmented agents

�Expected Sub Issue

  • Migrate resource allocation map from agent to manager
  • Update schedulers of manager to consider the allocation map
  • Update all accelerator plugins
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant