Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Workload resnet50: no IOs are generated at all with dali while benchmark reports it is running #61

Open
alexander272272 opened this issue Apr 8, 2024 · 1 comment

Comments

@alexander272272
Copy link

alexander272272 commented Apr 8, 2024

Steps:

  • git clone -b v1.0-rc1 --recurse-submodules https://github.com/mlcommons/storage.git
  • pip3 install -r dlio_benchmark/requirements.txt
  • ./benchmark.sh datagen --workload resnet50 --accelerator-type h100 --num-parallel 8 --param dataset.num_files_train=1200 --param dataset.data_folder=/mnt/1/ifs/data/rosnet50_05_04_2024_x02
  • ./benchmark.sh run --hosts HOST --workload resnet50 --accelerator-type h100 --num-accelerators 2 --results-dir resultsdir-$(date +"%d-%m-%Y") --param dataset.num_files_train=1200 --param dataset.data_folder=/mnt/1/ifs/data/rosnet50_05_04_2024_x02

show progress but no IOs to the NAS

If config changed ./storage-conf/workload/resnet50_h100.yaml
< framework: pytorch
---
> framework: tensorflow


<  data_loader: dali
---
>  data_loader: tensorflow

Then IOs are generated and can be captured on the wire

# cat /etc/os-release
PRETTY_NAME="Ubuntu 22.04.1 LTS"
NAME="Ubuntu"
VERSION_ID="22.04"
VERSION="22.04.1 LTS (Jammy Jellyfish)"
VERSION_CODENAME=jammy
ID=ubuntu
ID_LIKE=debian
HOME_URL="https://www.ubuntu.com/"
SUPPORT_URL="https://help.ubuntu.com/"
BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/"
PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy"
UBUNTU_CODENAME=jammy

@zhenghh04
Copy link
Contributor

We are aware of that. We are in the process of addressing the issue in the DLIO code.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants