[QNN EP] Improve QDQ model accuracy tests #16916

adrianlizarraga · 2023-07-29T02:11:24Z

Description

Improves how unit tests measure the accuracy of QDQ models on QNN EP.
Adds tests for ops: Add, Mul, Abs¹, And¹, Or¹, Ceil¹, Cos¹

¹: Not previously supported due to missing node unit handling.

Motivation and Context

The new approach for testing QDQ operator accuracy requires running 3 inferences:

float model on CPU EP (baseline)
qdq model on CPU EP
qdq model on QNN EP

The units tests check that running the QDQ model on QNN EP (3) is at least as accurate (+- small tolerance) as running the QDQ model on CPU EP (2). We measure accuracy by comparing to the baseline (1).

This is essentially what we care about: is qnn ep as accurate as cpu ep. If not, it is worth investigating as a potential bug.

onnxruntime/core/providers/qnn/qnn_execution_provider.cc

onnxruntime/test/providers/qnn/argmaxmin_op_test.cc

onnxruntime/test/providers/qnn/max_pool_test.cpp

HectorSVC

HectorSVC

### Description Slightly increases the allowable error tolerance for ReduceProd tests on x64 Windows/Linux with the QNN CPU backend. ### Motivation and Context A recent [PR](#16916) updated the input range for ReduceProd tests, which uncovered an inaccuracy for ReduceProd on x64 Windows/Linux with the QNN CPU backend. This PR updates the allowable error tolerance and adds a TODO for investigation. This is needed to ensure the QNN_Nuget_Windows pipeline runs successfully.

### Description - Improves how unit tests measure the accuracy of QDQ models on QNN EP. - Adds tests for ops: Add, Mul, Abs1, And1, Or1, Ceil1, Cos1 1: Not previously supported due to missing node unit handling. ### Motivation and Context The new approach for testing QDQ operator accuracy requires running 3 inferences: 1. float model on CPU EP (baseline) 2. qdq model on CPU EP 3. qdq model on QNN EP The units tests check that running the QDQ model on QNN EP (3) is at least as accurate (+- small tolerance) as running the QDQ model on CPU EP (2). We measure accuracy by comparing to the baseline (1). This is essentially what we care about: is qnn ep as accurate as cpu ep. If not, it is worth investigating as a potential bug.

### Description Slightly increases the allowable error tolerance for ReduceProd tests on x64 Windows/Linux with the QNN CPU backend. ### Motivation and Context A recent [PR](#16916) updated the input range for ReduceProd tests, which uncovered an inaccuracy for ReduceProd on x64 Windows/Linux with the QNN CPU backend. This PR updates the allowable error tolerance and adds a TODO for investigation. This is needed to ensure the QNN_Nuget_Windows pipeline runs successfully.

### Description - Improves how unit tests measure the accuracy of QDQ models on QNN EP. - Adds tests for ops: Add, Mul, Abs1, And1, Or1, Ceil1, Cos1 1: Not previously supported due to missing node unit handling. ### Motivation and Context The new approach for testing QDQ operator accuracy requires running 3 inferences: 1. float model on CPU EP (baseline) 2. qdq model on CPU EP 3. qdq model on QNN EP The units tests check that running the QDQ model on QNN EP (3) is at least as accurate (+- small tolerance) as running the QDQ model on CPU EP (2). We measure accuracy by comparing to the baseline (1). This is essentially what we care about: is qnn ep as accurate as cpu ep. If not, it is worth investigating as a potential bug.

…soft#17078) ### Description Slightly increases the allowable error tolerance for ReduceProd tests on x64 Windows/Linux with the QNN CPU backend. ### Motivation and Context A recent [PR](microsoft#16916) updated the input range for ReduceProd tests, which uncovered an inaccuracy for ReduceProd on x64 Windows/Linux with the QNN CPU backend. This PR updates the allowable error tolerance and adds a TODO for investigation. This is needed to ensure the QNN_Nuget_Windows pipeline runs successfully.

adrianlizarraga added 28 commits July 24, 2023 18:14

Add tests

caed9a1

Fix merge conflicts from main

4f22f43

Rework new Asin, Sin, and Sign tests to use float inputs/outputs

afbe467

Add 3-way comparision between Cpu(f32), Cpu(QDQ), QNN(QDQ)

59948ed

Clean up

35b91ee

Switch argmin/argmax tests to new QDQ accuracy testing

eee35db

Convert averagepool tests (need to disable inaccurate tests)

fa5dcb4

Add explicit inputs to AvergePool tests

aeba185

Use new QDQ acc testing for BatchNorm. Found issues.

9fcba0b

Update conv tests to new accuracy testing

b6123fd

Update comments for QDQ accuracy testing func

ec41d97

Fix linter errors. Add include for math

d89e0b3

More linter fixes. Use difference cmath funcs

40ac48a

Update shape for And/Or tests

a610a6e

More lint error

1eed952

Update Gather op tests. Need to support testing scalar indices

9c6149f

Simplify accuracy computation

46f4793

Convert InstanceNorm to new testing approach

7de0d07

Update LeakyRelu tests to new accuracy testing approach

974bc77

Update LRN tests

f91aea1

Update MatMul tests

d48980f

Fix merge conflicts with main

2a05554

Reuse GetDataQuantParams function

f82fb22

Update MaxPool tests

e513d63

Update ReduceOp cpu tests

fd01af3

Update QDQ Reduce op tests

56fa59b

Explicitly handle quant params when data only has a single 0

7af9410

Update Resize tests to use new accuracy measuring approach

962bca6

adrianlizarraga marked this pull request as ready for review July 31, 2023 07:27

adrianlizarraga requested a review from HectorSVC July 31, 2023 07:27

adrianlizarraga requested a review from jywu-msft July 31, 2023 07:27

HectorSVC reviewed Jul 31, 2023

View reviewed changes

onnxruntime/core/providers/qnn/qnn_execution_provider.cc Show resolved Hide resolved

adrianlizarraga added 4 commits July 31, 2023 11:03

Add disabled inaccurate tests

7917a07

Add a way to override a test input's range

f2216ee

Move all reduce op tests into 1 file and rename it

32ea966

Merge latest commits from main

0742fd3

HectorSVC reviewed Aug 3, 2023

View reviewed changes

onnxruntime/test/providers/qnn/argmaxmin_op_test.cc Show resolved Hide resolved

HectorSVC reviewed Aug 3, 2023

View reviewed changes

onnxruntime/test/providers/qnn/max_pool_test.cpp Outdated Show resolved Hide resolved

HectorSVC previously approved these changes Aug 3, 2023

View reviewed changes

adrianlizarraga added 3 commits August 3, 2023 14:08

Merge latest commits from main

20edf00

Revert change in input shape

3878e79

Merge latest commits from main

a21251e

adrianlizarraga dismissed HectorSVC’s stale review via a21251e August 3, 2023 22:36

HectorSVC approved these changes Aug 3, 2023

View reviewed changes

adrianlizarraga merged commit 191f98a into main Aug 4, 2023

adrianlizarraga deleted the adrianl/qnn-add-unit-tests branch August 4, 2023 19:15

adrianlizarraga mentioned this pull request Aug 9, 2023

[QNN EP] Increase tolerance for ReduceProd test on x64 Windows #17078

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QNN EP] Improve QDQ model accuracy tests #16916

[QNN EP] Improve QDQ model accuracy tests #16916

adrianlizarraga commented Jul 29, 2023 •

edited

Loading

HectorSVC left a comment

HectorSVC left a comment

[QNN EP] Improve QDQ model accuracy tests #16916

[QNN EP] Improve QDQ model accuracy tests #16916

Conversation

adrianlizarraga commented Jul 29, 2023 • edited Loading

Description

Motivation and Context

HectorSVC left a comment

Choose a reason for hiding this comment

HectorSVC left a comment

Choose a reason for hiding this comment

adrianlizarraga commented Jul 29, 2023 •

edited

Loading