From 368745c7bc333a09b66d0879e1c167db1302c411 Mon Sep 17 00:00:00 2001 From: Kevin Lewin Date: Fri, 19 Jan 2024 12:57:15 -0500 Subject: [PATCH] fixing module name --- docs/eks/gpu-monitoring.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/docs/eks/gpu-monitoring.md b/docs/eks/gpu-monitoring.md index b33bb806..bfc12c87 100644 --- a/docs/eks/gpu-monitoring.md +++ b/docs/eks/gpu-monitoring.md @@ -1,7 +1,7 @@ # Monitoring NVIDIA GPU Workloads -GPUs play an integral part in data intensive workloads. The base infrastructure module of the Observability Accelerator provides the ability to deploy the NVIDIA DCGM Exporter Dashboard. -The dashboard utilizes metrics scraped from the `/metrics` endpoint that are exposed when running the nvidia gpu operator and NVSMI binary. +GPUs play an integral part in data intensive workloads. The eks-monitoring module of the Observability Accelerator provides the ability to deploy the NVIDIA DCGM Exporter Dashboard. +The dashboard utilizes metrics scraped from the `/metrics` endpoint that are exposed when running the nvidia gpu operator with the [DCGM exporter](https://developer.nvidia.com/blog/monitoring-gpus-in-kubernetes-with-dcgm/) and NVSMI binary. !!!note In order to make use of this dashboard, you will need to have a GPU backed EKS cluster and deploy the [GPU operator](https://docs.nvidia.com/datacenter/cloud-native/gpu-operator/latest/amazon-eks.html)