Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add NVIDIA_H100_80GB and TPU_V5_LITEPOD to AcceleratorType #4557

Merged
merged 16 commits into from
Aug 21, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
16 commits
Select commit Hold shift + click to select a range
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions packages/google-cloud-aiplatform/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -299,6 +299,7 @@ Samples are in the [`samples/`](https://github.com/googleapis/google-cloud-node/
| Tensorboard_service.list_tensorboard_time_series | [source code](https://github.com/googleapis/google-cloud-node/blob/main/packages/google-cloud-aiplatform/samples/generated/v1/tensorboard_service.list_tensorboard_time_series.js) | [![Open in Cloud Shell][shell_img]](https://console.cloud.google.com/cloudshell/open?git_repo=https://github.com/googleapis/google-cloud-node&page=editor&open_in_editor=packages/google-cloud-aiplatform/samples/generated/v1/tensorboard_service.list_tensorboard_time_series.js,packages/google-cloud-aiplatform/samples/README.md) |
| Tensorboard_service.list_tensorboards | [source code](https://github.com/googleapis/google-cloud-node/blob/main/packages/google-cloud-aiplatform/samples/generated/v1/tensorboard_service.list_tensorboards.js) | [![Open in Cloud Shell][shell_img]](https://console.cloud.google.com/cloudshell/open?git_repo=https://github.com/googleapis/google-cloud-node&page=editor&open_in_editor=packages/google-cloud-aiplatform/samples/generated/v1/tensorboard_service.list_tensorboards.js,packages/google-cloud-aiplatform/samples/README.md) |
| Tensorboard_service.read_tensorboard_blob_data | [source code](https://github.com/googleapis/google-cloud-node/blob/main/packages/google-cloud-aiplatform/samples/generated/v1/tensorboard_service.read_tensorboard_blob_data.js) | [![Open in Cloud Shell][shell_img]](https://console.cloud.google.com/cloudshell/open?git_repo=https://github.com/googleapis/google-cloud-node&page=editor&open_in_editor=packages/google-cloud-aiplatform/samples/generated/v1/tensorboard_service.read_tensorboard_blob_data.js,packages/google-cloud-aiplatform/samples/README.md) |
| Tensorboard_service.read_tensorboard_size | [source code](https://github.com/googleapis/google-cloud-node/blob/main/packages/google-cloud-aiplatform/samples/generated/v1/tensorboard_service.read_tensorboard_size.js) | [![Open in Cloud Shell][shell_img]](https://console.cloud.google.com/cloudshell/open?git_repo=https://github.com/googleapis/google-cloud-node&page=editor&open_in_editor=packages/google-cloud-aiplatform/samples/generated/v1/tensorboard_service.read_tensorboard_size.js,packages/google-cloud-aiplatform/samples/README.md) |
| Tensorboard_service.read_tensorboard_time_series_data | [source code](https://github.com/googleapis/google-cloud-node/blob/main/packages/google-cloud-aiplatform/samples/generated/v1/tensorboard_service.read_tensorboard_time_series_data.js) | [![Open in Cloud Shell][shell_img]](https://console.cloud.google.com/cloudshell/open?git_repo=https://github.com/googleapis/google-cloud-node&page=editor&open_in_editor=packages/google-cloud-aiplatform/samples/generated/v1/tensorboard_service.read_tensorboard_time_series_data.js,packages/google-cloud-aiplatform/samples/README.md) |
| Tensorboard_service.read_tensorboard_usage | [source code](https://github.com/googleapis/google-cloud-node/blob/main/packages/google-cloud-aiplatform/samples/generated/v1/tensorboard_service.read_tensorboard_usage.js) | [![Open in Cloud Shell][shell_img]](https://console.cloud.google.com/cloudshell/open?git_repo=https://github.com/googleapis/google-cloud-node&page=editor&open_in_editor=packages/google-cloud-aiplatform/samples/generated/v1/tensorboard_service.read_tensorboard_usage.js,packages/google-cloud-aiplatform/samples/README.md) |
| Tensorboard_service.update_tensorboard | [source code](https://github.com/googleapis/google-cloud-node/blob/main/packages/google-cloud-aiplatform/samples/generated/v1/tensorboard_service.update_tensorboard.js) | [![Open in Cloud Shell][shell_img]](https://console.cloud.google.com/cloudshell/open?git_repo=https://github.com/googleapis/google-cloud-node&page=editor&open_in_editor=packages/google-cloud-aiplatform/samples/generated/v1/tensorboard_service.update_tensorboard.js,packages/google-cloud-aiplatform/samples/README.md) |
Expand Down Expand Up @@ -491,6 +492,7 @@ Samples are in the [`samples/`](https://github.com/googleapis/google-cloud-node/
| Pipeline_service.get_training_pipeline | [source code](https://github.com/googleapis/google-cloud-node/blob/main/packages/google-cloud-aiplatform/samples/generated/v1beta1/pipeline_service.get_training_pipeline.js) | [![Open in Cloud Shell][shell_img]](https://console.cloud.google.com/cloudshell/open?git_repo=https://github.com/googleapis/google-cloud-node&page=editor&open_in_editor=packages/google-cloud-aiplatform/samples/generated/v1beta1/pipeline_service.get_training_pipeline.js,packages/google-cloud-aiplatform/samples/README.md) |
| Pipeline_service.list_pipeline_jobs | [source code](https://github.com/googleapis/google-cloud-node/blob/main/packages/google-cloud-aiplatform/samples/generated/v1beta1/pipeline_service.list_pipeline_jobs.js) | [![Open in Cloud Shell][shell_img]](https://console.cloud.google.com/cloudshell/open?git_repo=https://github.com/googleapis/google-cloud-node&page=editor&open_in_editor=packages/google-cloud-aiplatform/samples/generated/v1beta1/pipeline_service.list_pipeline_jobs.js,packages/google-cloud-aiplatform/samples/README.md) |
| Pipeline_service.list_training_pipelines | [source code](https://github.com/googleapis/google-cloud-node/blob/main/packages/google-cloud-aiplatform/samples/generated/v1beta1/pipeline_service.list_training_pipelines.js) | [![Open in Cloud Shell][shell_img]](https://console.cloud.google.com/cloudshell/open?git_repo=https://github.com/googleapis/google-cloud-node&page=editor&open_in_editor=packages/google-cloud-aiplatform/samples/generated/v1beta1/pipeline_service.list_training_pipelines.js,packages/google-cloud-aiplatform/samples/README.md) |
| Prediction_service.count_tokens | [source code](https://github.com/googleapis/google-cloud-node/blob/main/packages/google-cloud-aiplatform/samples/generated/v1beta1/prediction_service.count_tokens.js) | [![Open in Cloud Shell][shell_img]](https://console.cloud.google.com/cloudshell/open?git_repo=https://github.com/googleapis/google-cloud-node&page=editor&open_in_editor=packages/google-cloud-aiplatform/samples/generated/v1beta1/prediction_service.count_tokens.js,packages/google-cloud-aiplatform/samples/README.md) |
| Prediction_service.explain | [source code](https://github.com/googleapis/google-cloud-node/blob/main/packages/google-cloud-aiplatform/samples/generated/v1beta1/prediction_service.explain.js) | [![Open in Cloud Shell][shell_img]](https://console.cloud.google.com/cloudshell/open?git_repo=https://github.com/googleapis/google-cloud-node&page=editor&open_in_editor=packages/google-cloud-aiplatform/samples/generated/v1beta1/prediction_service.explain.js,packages/google-cloud-aiplatform/samples/README.md) |
| Prediction_service.predict | [source code](https://github.com/googleapis/google-cloud-node/blob/main/packages/google-cloud-aiplatform/samples/generated/v1beta1/prediction_service.predict.js) | [![Open in Cloud Shell][shell_img]](https://console.cloud.google.com/cloudshell/open?git_repo=https://github.com/googleapis/google-cloud-node&page=editor&open_in_editor=packages/google-cloud-aiplatform/samples/generated/v1beta1/prediction_service.predict.js,packages/google-cloud-aiplatform/samples/README.md) |
| Prediction_service.raw_predict | [source code](https://github.com/googleapis/google-cloud-node/blob/main/packages/google-cloud-aiplatform/samples/generated/v1beta1/prediction_service.raw_predict.js) | [![Open in Cloud Shell][shell_img]](https://console.cloud.google.com/cloudshell/open?git_repo=https://github.com/googleapis/google-cloud-node&page=editor&open_in_editor=packages/google-cloud-aiplatform/samples/generated/v1beta1/prediction_service.raw_predict.js,packages/google-cloud-aiplatform/samples/README.md) |
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -36,8 +36,8 @@ message Context {
pattern: "projects/{project}/locations/{location}/metadataStores/{metadata_store}/contexts/{context}"
};

// Output only. The resource name of the Context.
string name = 1 [(google.api.field_behavior) = OUTPUT_ONLY];
// Immutable. The resource name of the Context.
string name = 1 [(google.api.field_behavior) = IMMUTABLE];

// User provided display name of the Context.
// May be up to 128 Unicode characters.
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -318,4 +318,9 @@ message Scheduling {
// This feature can be used by distributed training jobs that are not
// resilient to workers leaving and joining a job.
bool restart_job_on_worker_restart = 3;

// Optional. Indicates if the job should retry for internal errors after the
// job starts running. If true, overrides
// `Scheduling.restart_job_on_worker_restart` to false.
bool disable_retries = 5 [(google.api.field_behavior) = OPTIONAL];
}
Original file line number Diff line number Diff line change
Expand Up @@ -369,6 +369,20 @@ message ExportDataOperationMetadata {
string gcs_output_directory = 2;
}

// Runtime operation information for
// [DatasetService.CreateDatasetVersion][google.cloud.aiplatform.v1.DatasetService.CreateDatasetVersion].
message CreateDatasetVersionOperationMetadata {
// The common part of the operation metadata.
GenericOperationMetadata generic_metadata = 1;
}

// Runtime operation information for
// [DatasetService.RestoreDatasetVersion][google.cloud.aiplatform.v1.DatasetService.RestoreDatasetVersion].
message RestoreDatasetVersionOperationMetadata {
// The common part of the operation metadata.
GenericOperationMetadata generic_metadata = 1;
}

// Request message for
// [DatasetService.ListDataItems][google.cloud.aiplatform.v1.DatasetService.ListDataItems].
message ListDataItemsRequest {
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -207,6 +207,10 @@ message PipelineJob {
// is from supported template registry.
PipelineTemplateMetadata template_metadata = 20
[(google.api.field_behavior) = OUTPUT_ONLY];

// Output only. The schedule resource name.
// Only returned if the Pipeline is created by Schedule API.
string schedule_name = 22 [(google.api.field_behavior) = OUTPUT_ONLY];
}

// Pipeline template metadata if
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -154,6 +154,10 @@ message PublisherModel {
// Optional. Request for access.
RegionalResourceReferences request_access = 9
[(google.api.field_behavior) = OPTIONAL];

// Optional. Open evaluation pipeline of the PublisherModel.
RegionalResourceReferences open_evaluation_pipeline = 11
[(google.api.field_behavior) = OPTIONAL];
}

// An enum representing the open source category of a PublisherModel.
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -88,8 +88,8 @@ message Schedule {
CreatePipelineJobRequest create_pipeline_job_request = 14;
}

// Output only. The resource name of the Schedule.
string name = 1 [(google.api.field_behavior) = OUTPUT_ONLY];
// Immutable. The resource name of the Schedule.
string name = 1 [(google.api.field_behavior) = IMMUTABLE];

// Required. User provided name of the Schedule.
// The name can be up to 128 characters long and can consist of any UTF-8
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -222,6 +222,7 @@ message ListSchedulesRequest {
// descending order.
//
// Supported fields:
//
// * `create_time`
// * `start_time`
// * `end_time`
Expand Down Expand Up @@ -296,7 +297,9 @@ message ResumeScheduleRequest {
message UpdateScheduleRequest {
// Required. The Schedule which replaces the resource on the server.
// The following restrictions will be applied:
//
// * The scheduled request type cannot be changed.
// * The non-empty fields cannot be unset.
// * The output_only fields will be ignored if specified.
Schedule schedule = 1 [(google.api.field_behavior) = REQUIRED];

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -112,6 +112,15 @@ service TensorboardService {
option (google.api.method_signature) = "tensorboard";
}

// Returns the storage size for a given TensorBoard instance.
rpc ReadTensorboardSize(ReadTensorboardSizeRequest)
returns (ReadTensorboardSizeResponse) {
option (google.api.http) = {
get: "/v1/{tensorboard=projects/*/locations/*/tensorboards/*}:readSize"
};
option (google.api.method_signature) = "tensorboard";
}

// Creates a TensorboardExperiment.
rpc CreateTensorboardExperiment(CreateTensorboardExperimentRequest)
returns (TensorboardExperiment) {
Expand Down Expand Up @@ -510,6 +519,27 @@ message ReadTensorboardUsageResponse {
map<string, PerMonthUsageData> monthly_usage_data = 1;
}

// Request message for
// [TensorboardService.ReadTensorboardSize][google.cloud.aiplatform.v1.TensorboardService.ReadTensorboardSize].
message ReadTensorboardSizeRequest {
// Required. The name of the Tensorboard resource.
// Format:
// `projects/{project}/locations/{location}/tensorboards/{tensorboard}`
string tensorboard = 1 [
(google.api.field_behavior) = REQUIRED,
(google.api.resource_reference) = {
type: "aiplatform.googleapis.com/Tensorboard"
}
];
}

// Response message for
// [TensorboardService.ReadTensorboardSize][google.cloud.aiplatform.v1.TensorboardService.ReadTensorboardSize].
message ReadTensorboardSizeResponse {
// Payload storage size for the TensorBoard
int64 storage_size_byte = 1;
}

// Request message for
// [TensorboardService.CreateTensorboardExperiment][google.cloud.aiplatform.v1.TensorboardService.CreateTensorboardExperiment].
message CreateTensorboardExperimentRequest {
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -53,6 +53,9 @@ enum AcceleratorType {
// Nvidia L4 GPU.
NVIDIA_L4 = 11;

// Nvidia H100 80Gb GPU.
NVIDIA_H100_80GB = 13;

// TPU v2.
TPU_V2 = 6;

Expand All @@ -61,4 +64,7 @@ enum AcceleratorType {

// TPU v4.
TPU_V4_POD = 10;

// TPU v5.
TPU_V5_LITEPOD = 12;
}
Original file line number Diff line number Diff line change
Expand Up @@ -36,8 +36,8 @@ message Context {
pattern: "projects/{project}/locations/{location}/metadataStores/{metadata_store}/contexts/{context}"
};

// Output only. The resource name of the Context.
string name = 1 [(google.api.field_behavior) = OUTPUT_ONLY];
// Immutable. The resource name of the Context.
string name = 1 [(google.api.field_behavior) = IMMUTABLE];

// User provided display name of the Context.
// May be up to 128 Unicode characters.
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -332,4 +332,9 @@ message Scheduling {
// This feature can be used by distributed training jobs that are not
// resilient to workers leaving and joining a job.
bool restart_job_on_worker_restart = 3;

// Optional. Indicates if the job should retry for internal errors after the
// job starts running. If true, overrides
// `Scheduling.restart_job_on_worker_restart` to false.
bool disable_retries = 5 [(google.api.field_behavior) = OPTIONAL];
}
Original file line number Diff line number Diff line change
Expand Up @@ -369,6 +369,20 @@ message ExportDataOperationMetadata {
string gcs_output_directory = 2;
}

// Runtime operation information for
// [DatasetService.CreateDatasetVersion][google.cloud.aiplatform.v1beta1.DatasetService.CreateDatasetVersion].
message CreateDatasetVersionOperationMetadata {
// The common part of the operation metadata.
GenericOperationMetadata generic_metadata = 1;
}

// Runtime operation information for
// [DatasetService.RestoreDatasetVersion][google.cloud.aiplatform.v1beta1.DatasetService.RestoreDatasetVersion].
message RestoreDatasetVersionOperationMetadata {
// The common part of the operation metadata.
GenericOperationMetadata generic_metadata = 1;
}

// Request message for
// [DatasetService.ListDataItems][google.cloud.aiplatform.v1beta1.DatasetService.ListDataItems].
message ListDataItemsRequest {
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -37,10 +37,10 @@ message DeploymentResourcePool {
pattern: "projects/{project}/locations/{location}/deploymentResourcePools/{deployment_resource_pool}"
};

// Output only. The resource name of the DeploymentResourcePool.
// Immutable. The resource name of the DeploymentResourcePool.
// Format:
// `projects/{project}/locations/{location}/deploymentResourcePools/{deployment_resource_pool}`
string name = 1 [(google.api.field_behavior) = OUTPUT_ONLY];
string name = 1 [(google.api.field_behavior) = IMMUTABLE];

// Required. The underlying DedicatedResources that the DeploymentResourcePool
// uses.
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -181,6 +181,20 @@ message DiskSpec {
int32 boot_disk_size_gb = 2;
}

// Represents the spec of [persistent
// disk][https://cloud.google.com/compute/docs/disks/persistent-disks] options.
message PersistentDiskSpec {
// Type of the disk (default is "pd-standard").
// Valid values: "pd-ssd" (Persistent Disk Solid State Drive)
// "pd-standard" (Persistent Disk Hard Disk Drive)
// "pd-balanced" (Balanced Persistent Disk)
// "pd-extreme" (Extreme Persistent Disk)
string disk_type = 1;

// Size in GB of the disk (default is 100GB).
int64 disk_size_gb = 2;
}

// Represents a mount configuration for Network File System (NFS) to mount.
message NfsMount {
// Required. IP address of the NFS server.
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -166,10 +166,10 @@ message ResourcePool {
[(google.api.field_behavior) = OPTIONAL];
}

// Optional. The unique ID in a PersistentResource to refer the this resource
// Immutable. The unique ID in a PersistentResource to refer the this resource
// pool. User can specify it if need to use it, otherwise we will generate it
// automatically.
string id = 1 [(google.api.field_behavior) = OPTIONAL];
string id = 1 [(google.api.field_behavior) = IMMUTABLE];

// Required. Immutable. The specification of a single machine.
MachineSpec machine_spec = 2 [
Expand All @@ -183,11 +183,6 @@ message ResourcePool {
// Optional. Disk spec for the machine in this node pool.
DiskSpec disk_spec = 4 [(google.api.field_behavior) = OPTIONAL];

// Output only. The number of machines currently not in use by training jobs
// for this resource pool. Deprecated. Use `used_replica_count` instead.
int64 idle_replica_count = 5
[deprecated = true, (google.api.field_behavior) = OUTPUT_ONLY];

// Output only. The number of machines currently in use by training jobs for
// this resource pool. Will replace idle_replica_count.
int64 used_replica_count = 6 [(google.api.field_behavior) = OUTPUT_ONLY];
Expand All @@ -205,7 +200,7 @@ message ResourceRuntimeSpec {
ServiceAccountSpec service_account_spec = 2
[(google.api.field_behavior) = OPTIONAL];

// Ray cluster configuration.
// Optional. Ray cluster configuration.
// Required when creating a dedicated RayCluster on the PersistentResource.
RaySpec ray_spec = 1 [(google.api.field_behavior) = OPTIONAL];
}
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -39,7 +39,7 @@ service PersistentResourceService {
option (google.api.oauth_scopes) =
"https://www.googleapis.com/auth/cloud-platform";

// Uploads a Model artifact into Vertex AI.
// Creates a PersistentResource.
rpc CreatePersistentResource(CreatePersistentResourceRequest)
returns (google.longrunning.Operation) {
option (google.api.http) = {
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -207,6 +207,10 @@ message PipelineJob {
// is from supported template registry.
PipelineTemplateMetadata template_metadata = 20
[(google.api.field_behavior) = OUTPUT_ONLY];

// Output only. The schedule resource name.
// Only returned if the Pipeline is created by Schedule API.
string schedule_name = 22 [(google.api.field_behavior) = OUTPUT_ONLY];
}

// Pipeline template metadata if
Expand Down
Loading