Skip to content

Commit

Permalink
Rather than using the default algorithm, use KK_DENSE algorithm.
Browse files Browse the repository at this point in the history
The result is substantial performance improvements (10-30%) for the Floating Platform,
IEA15 Rotor, and IEA15 Rotor with 3000 elements on CPU.  Similar performance
improvements are seen on GPU, with the exception of the IEA15 with 3000 elements.
However, the performance of that test is dominated by the system solve, so the result
is no net change in performance.
  • Loading branch information
ddement committed Jan 24, 2025
1 parent c772ec4 commit 0bb6f70
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/solver/create_matrix_spgemm.hpp
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@ template <typename CrsMatrixType, typename KernelHandle>
const CrsMatrixType& A, const CrsMatrixType& B, KernelHandle& handle
) {
auto C = CrsMatrixType{};
handle.create_spgemm_handle();
handle.create_spgemm_handle(KokkosSparse::SPGEMMAlgorithm::SPGEMM_KK_DENSE);
KokkosSparse::spgemm_symbolic(handle, A, false, B, false, C);
KokkosSparse::spgemm_numeric(handle, A, false, B, false, C);
return C;
Expand Down

0 comments on commit 0bb6f70

Please sign in to comment.