src: attr: quantization refactor (part 3) #2746

dzarukin · 2025-02-24T21:54:14Z

This is the finishing part of quantization refactor that beautifies skip_mask values for scales and zero-points and introduces a single class for both of them with ability to extend one over the other.

dzarukin · 2025-02-24T21:54:59Z

make test
enable arch_gpu_ampere
enable compiler_icx-oss

mgouicem

(nit) would it makes sense to also remove usage of DNNL_ARG_ATTR_OUTPUT_SCALES from primitive_exec_types.cpp and miopen_{convolution,inner_product}.cpp as part of this PR?

Fix dispatching to jit gemm implementation. Previous logic would update the variable which would skip entire verbose verification function.

dzarukin requested review from a team as code owners February 24, 2025 21:54

mgouicem approved these changes Feb 25, 2025

View reviewed changes

dzarukin force-pushed the dzarukin/quant_styling branch from cf9b2da to 4478b31 Compare February 26, 2025 00:12

github-actions bot added the component:api Codeowner: @oneapi-src/onednn-arch label Feb 26, 2025

dzarukin added 4 commits February 27, 2025 15:49

src: update skip_mask names

090a8ac

common: primitive_attr_quant: establish a base class for scales and zps

3ff86c7

src: remove unused DNNL_ARG_ATTR_OUTPUT_SCALES

fd8619f

fixup: src: move zero_points to quant_entry_t abstraction

e444113

Fix dispatching to jit gemm implementation. Previous logic would update the variable which would skip entire verbose verification function.

dzarukin force-pushed the dzarukin/quant_styling branch from 4478b31 to e444113 Compare February 27, 2025 23:49

sgeor255 approved these changes Feb 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src: attr: quantization refactor (part 3) #2746

src: attr: quantization refactor (part 3) #2746

dzarukin commented Feb 24, 2025

dzarukin commented Feb 24, 2025

mgouicem left a comment

src: attr: quantization refactor (part 3) #2746

Are you sure you want to change the base?

src: attr: quantization refactor (part 3) #2746

Conversation

dzarukin commented Feb 24, 2025

dzarukin commented Feb 24, 2025

mgouicem left a comment

Choose a reason for hiding this comment