-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Default head node instance type for arm cluster is incorrect #206
Comments
cartalla
added a commit
that referenced
this issue
Mar 9, 2024
Add --RESEnvironmentName to the installer Ease initial integration with Research and Engineering Studio (RES). Automatically add the correct submitter security groups and configure the /home directory. Resolves #207 ============================ Update template config files Added more comments to clarify that these are examples that should be copied and customized by users. Added comments for typical configuration options. Deleted obsolete configs that were from v1. Resolves #203 ============================= Set default head node instance type based on architecture. Resolves #206 ============================== Clean up ansible-lint errors and warnings. Arm architecture cluster was failing because of an incorrect condition in the ansible playbook that is flagged by lint. ============================== Use vdi controller instead of cluster manager for users and groups info Cluster manager stopped being domain joined for some reason. ============================== Paginate describe_instances when creating head node a record. Otherwise, may not find the cluster head node instance. ============================== Add default MungeKeySecret. This should be the default or you can't access multiple clusters from the same server. ============================== Increase timeout for ssm command that configures submitters Need the time to compile slurm. ============================== Force slurm to be rebuilt for submitters of all os distributions even if they match the os of the cluster. Otherwise get errors because can't find PluginDir in the same location as when it was compiled. ============================== Paginate describe_instances in UpdateHeadNode lambda ============================== Add check for min memory of 4 GB for slurm controller
cartalla
added a commit
that referenced
this issue
Mar 9, 2024
Add --RESEnvironmentName to the installer Ease initial integration with Research and Engineering Studio (RES). Automatically add the correct submitter security groups and configure the /home directory. Resolves #207 ============================ Update template config files Added more comments to clarify that these are examples that should be copied and customized by users. Added comments for typical configuration options. Deleted obsolete configs that were from v1. Resolves #203 ============================= Set default head node instance type based on architecture. Resolves #206 ============================== Clean up ansible-lint errors and warnings. Arm architecture cluster was failing because of an incorrect condition in the ansible playbook that is flagged by lint. ============================== Use vdi controller instead of cluster manager for users and groups info Cluster manager stopped being domain joined for some reason. ============================== Paginate describe_instances when creating head node a record. Otherwise, may not find the cluster head node instance. ============================== Add default MungeKeySecret. This should be the default or you can't access multiple clusters from the same server. ============================== Increase timeout for ssm command that configures submitters Need the time to compile slurm. ============================== Force slurm to be rebuilt for submitters of all os distributions even if they match the os of the cluster. Otherwise get errors because can't find PluginDir in the same location as when it was compiled. ============================== Paginate describe_instances in UpdateHeadNode lambda ============================== Add check for min memory of 4 GB for slurm controller
cartalla
added a commit
that referenced
this issue
Mar 10, 2024
Add --RESEnvironmentName to the installer Ease initial integration with Research and Engineering Studio (RES). Automatically add the correct submitter security groups and configure the /home directory. Resolves #207 ============================ Update template config files Added more comments to clarify that these are examples that should be copied and customized by users. Added comments for typical configuration options. Deleted obsolete configs that were from v1. Resolves #203 ============================= Set default head node instance type based on architecture. Resolves #206 ============================== Clean up ansible-lint errors and warnings. Arm architecture cluster was failing because of an incorrect condition in the ansible playbook that is flagged by lint. ============================== Use vdi controller instead of cluster manager for users and groups info Cluster manager stopped being domain joined for some reason. ============================== Paginate describe_instances when creating head node a record. Otherwise, may not find the cluster head node instance. ============================== Add default MungeKeySecret. This should be the default or you can't access multiple clusters from the same server. ============================== Increase timeout for ssm command that configures submitters Need the time to compile slurm. ============================== Force slurm to be rebuilt for submitters of all os distributions even if they match the os of the cluster. Otherwise get errors because can't find PluginDir in the same location as when it was compiled. ============================== Paginate describe_instances in UpdateHeadNode lambda ============================== Add check for min memory of 4 GB for slurm controller
cartalla
added a commit
that referenced
this issue
Mar 20, 2024
Add --RESEnvironmentName to the installer Ease initial integration with Research and Engineering Studio (RES). Automatically add the correct submitter security groups and configure the /home directory. Automatically choose the subnets if not specified based on RES subnets. Resolves #207 ============================ Update template config files Added more comments to clarify that these are examples that should be copied and customized by users. Added comments for typical configuration options. Deleted obsolete configs that were from v1. Resolves #203 ============================= Set default head node instance type based on architecture. Resolves #206 ============================== Clean up ansible-lint errors and warnings. Arm architecture cluster was failing because of an incorrect condition in the ansible playbook that is flagged by lint. ============================== Use vdi controller instead of cluster manager for users and groups info Cluster manager stopped being domain joined for some reason. ============================== Paginate describe_instances when creating head node a record. Otherwise, may not find the cluster head node instance. ============================== Add default MungeKeySecret. This should be the default or you can't access multiple clusters from the same server. ============================== Increase timeout for ssm command that configures submitters Need the time to compile slurm. ============================== Force slurm to be rebuilt for submitters of all os distributions even if they match the os of the cluster. Otherwise get errors because can't find PluginDir in the same location as when it was compiled. ============================== Paginate describe_instances in UpdateHeadNode lambda ============================== Add check for min memory of 4 GB for slurm controller
cartalla
added a commit
that referenced
this issue
Mar 22, 2024
Add --RESEnvironmentName to the installer Ease initial integration with Research and Engineering Studio (RES). Automatically add the correct submitter security groups and configure the /home directory. Automatically choose the subnets if not specified based on RES subnets. Resolves #207 ============================ Update template config files Added more comments to clarify that these are examples that should be copied and customized by users. Added comments for typical configuration options. Deleted obsolete configs that were from v1. Resolves #203 ============================= Set default head node instance type based on architecture. Resolves #206 ============================== Clean up ansible-lint errors and warnings. Arm architecture cluster was failing because of an incorrect condition in the ansible playbook that is flagged by lint. ============================== Use vdi controller instead of cluster manager for users and groups info Cluster manager stopped being domain joined for some reason. ============================== Paginate describe_instances when creating head node a record. Otherwise, may not find the cluster head node instance. ============================== Add default MungeKeySecret. This should be the default or you can't access multiple clusters from the same server. ============================== Increase timeout for ssm command that configures submitters Need the time to compile slurm. ============================== Force slurm to be rebuilt for submitters of all os distributions even if they match the os of the cluster. Otherwise get errors because can't find PluginDir in the same location as when it was compiled. ============================== Paginate describe_instances in UpdateHeadNode lambda ============================== Add check for min memory of 4 GB for slurm controller ============================== Update documentation. Remove Regions from InstanceConfig. This was left over from legacy cluster. ParallelCluster doesn't support multiple regions.
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Describe the bug
Defaults to c6a.large and cluster fails to build.
The text was updated successfully, but these errors were encountered: