rocBLAS 14.1.1 for ROCm 1.8.2
Changelist:
- update hgemm asm_full YAML file for performance; re-train hgemm hip_lite YAML file
- new YAML files with PreciseBoundsCheck disabled
- update hgemm asm_full YAML file, source and VW=2 for m,n,k <= 32
- update hgemm asm_full YAML file, source and VW=1 for m,n,k == 1
- add strided_batched tests for hgemm
- correct gemm test matrix initialization
- change cmake and source files to support hip-clang
- change from __fp16 to _Float16