Update Intel Thread Counts #22894

A-Satti · 2024-11-19T18:43:35Z

Description

The default thread count methodology by onnxruntime did not account for new upcoming Intel microarchitectures leading to a suboptimal thread count. Optimizing the thread count for new Intel microarchitectures reveal gains on the majority of models across datatypes and shows gains up to ~1.5x speedup.

Motivation and Context

Applications should run on Intel with the most performant thread configuration for the majority of models. With new microarchitectures, adjusting the thread count methodology is required to take advantage of their differences.

onnxruntime/core/platform/windows/hardware_core_enumerator.cc

winml/lib/Api/HardwareCoreEnumerator.cpp

tianleiwu · 2024-11-20T18:31:22Z

@A-Satti, please follow https://github.com/microsoft/onnxruntime/blob/main/docs/Coding_Conventions_and_Standards.md#linting to update the format.

tianleiwu · 2024-11-23T01:13:41Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline

tianleiwu · 2024-11-23T01:13:42Z

/azp run Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Linux Android Emulator QNN CI Pipeline

tianleiwu · 2024-11-23T01:13:44Z

/azp run Android CI Pipeline,iOS CI Pipeline,ONNX Runtime React Native CI Pipeline,CoreML CI Pipeline,Linux DNNL CI Pipeline,Linux MIGraphX CI Pipeline,Linux ROCm CI Pipeline

azure-pipelines · 2024-11-23T01:14:10Z

Azure Pipelines successfully started running 7 pipeline(s).

azure-pipelines · 2024-11-23T01:14:13Z

Azure Pipelines successfully started running 8 pipeline(s).

azure-pipelines · 2024-11-23T01:14:17Z

Azure Pipelines successfully started running 10 pipeline(s).

onnxruntime/core/platform/windows/hardware_core_enumerator.cc

snnn · 2024-12-09T23:32:00Z

@A-Satti , this PR removes "/Qspectre" compile flag, which is a critical security flag. Though the flag has performance penalty, we cannot trade security for performance. You are fine to not using this flag in your private build, but all ORT's official binaries must be built with this flag. Please add it back.

A-Satti · 2024-12-10T01:26:23Z

Hi @snnn created this PR to address #23060. Returning the original flag and only removed the stale Meteorlake flag and comment

### Description The default thread count methodology by onnxruntime did not account for new upcoming Intel microarchitectures leading to a suboptimal thread count. Optimizing the thread count for new Intel microarchitectures reveal gains on the majority of models across datatypes and shows gains up to ~1.5x speedup. ### Motivation and Context Applications should run on Intel with the most performant thread configuration for the majority of models. With new microarchitectures, adjusting the thread count methodology is required to take advantage of their differences.

Update Intel Thread Counts

38bcef3

github-advanced-security bot found potential problems Nov 19, 2024

View reviewed changes

onnxruntime/core/platform/windows/hardware_core_enumerator.cc Fixed Show fixed Hide fixed

winml/lib/Api/HardwareCoreEnumerator.cpp Fixed Show fixed Hide fixed

jywu-msft requested a review from liqunfu November 21, 2024 00:11

Resolve formatting

3a1dd1d

liqunfu reviewed Dec 5, 2024

View reviewed changes

onnxruntime/core/platform/windows/hardware_core_enumerator.cc Show resolved Hide resolved

liqunfu approved these changes Dec 5, 2024

View reviewed changes

tianleiwu merged commit f5293d2 into microsoft:main Dec 6, 2024
91 checks passed

robertknight mentioned this pull request Dec 7, 2024

Adjust default thread count on Apple Silicon systems robertknight/rten#342

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Intel Thread Counts #22894

Update Intel Thread Counts #22894

A-Satti commented Nov 19, 2024 •

edited

Loading

tianleiwu commented Nov 20, 2024

tianleiwu commented Nov 23, 2024

tianleiwu commented Nov 23, 2024

tianleiwu commented Nov 23, 2024

azure-pipelines bot commented Nov 23, 2024

azure-pipelines bot commented Nov 23, 2024

azure-pipelines bot commented Nov 23, 2024

snnn commented Dec 9, 2024

A-Satti commented Dec 10, 2024

Update Intel Thread Counts #22894

Update Intel Thread Counts #22894

Conversation

A-Satti commented Nov 19, 2024 • edited Loading

Description

Motivation and Context

tianleiwu commented Nov 20, 2024

tianleiwu commented Nov 23, 2024

tianleiwu commented Nov 23, 2024

tianleiwu commented Nov 23, 2024

azure-pipelines bot commented Nov 23, 2024

azure-pipelines bot commented Nov 23, 2024

azure-pipelines bot commented Nov 23, 2024

snnn commented Dec 9, 2024

A-Satti commented Dec 10, 2024

A-Satti commented Nov 19, 2024 •

edited

Loading