Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[igemm_dynamic] v4r1 bwd dynamic kernel (#272)
* [dynamic-kernel] add v4r1 generic dynamic kernel and solver, fwd fp32 * update tunable table * fix tidy for -abseil-string-find-startswith * fix tidy for readability-simplify-boolean-expr * add code of v4r1 dynamic fwd kc1x1 case * runnable code for v4r1 igemm 1x1 asm kernel case * modify igemm dynamic kernel call func: if kc1x1 kernel, remove the xy kernel args * add test_conv_for_dynamic_implicit_gemm to test dynamic kernel feature * register invoker for igemm_dynamic solver * fix hip-clang bug to run assembly kernel * put asm file in folder kernels/dynamic_igemm * add v4r1 bwd [skip ci] * add missing header include * split invoker into seperate conv direction * use conv_problem as invoker param, instead of conv ctx * remove kernel name check in invoker Co-authored-by: root <[email protected]> Co-authored-by: shaojiewang <[email protected]> Co-authored-by: Daniel Lowell <[email protected]>
- Loading branch information