-
Notifications
You must be signed in to change notification settings - Fork 915
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEA] Performance Updates for row/column conversions #10032
Comments
Tile sizes of 1 column, 1 row, 32 columns, and 32 rows were tried with little to no performance gain. Most saw a loss of performance. |
Changed shared memory sizes to 1/2, 1/4, and 1/16 of the available memory and performance was the same for column to row and degraded for row to column. |
This issue has been labeled |
This issue has been labeled |
Still relevant |
The row/column conversion code isn't as performance as desired. This is a list of things that should be tried to attempt to improve the performance.
The text was updated successfully, but these errors were encountered: