-
Notifications
You must be signed in to change notification settings - Fork 449
Retune radix sort, run length encoding, reduce by key, scan, select if, and histogram for SM70 and SM80 #208
Conversation
typedef AgentRadixSortDownsweepPolicy <192, 39, DominantT, BLOCK_LOAD_TRANSPOSE, LOAD_DEFAULT, RADIX_RANK_MEMOIZE, BLOCK_SCAN_WARP_SCANS, SEGMENTED_RADIX_BITS> SegmentedPolicy; | ||
typedef AgentRadixSortDownsweepPolicy <384, 11, DominantT, BLOCK_LOAD_TRANSPOSE, LOAD_DEFAULT, RADIX_RANK_MEMOIZE, BLOCK_SCAN_WARP_SCANS, SEGMENTED_RADIX_BITS - 1> AltSegmentedPolicy; | ||
typedef AgentRadixSortDownsweepPolicy <128, 39, DominantT, BLOCK_LOAD_TRANSPOSE, LOAD_DEFAULT, RADIX_RANK_MEMOIZE, BLOCK_SCAN_WARP_SCANS, SEGMENTED_RADIX_BITS> SegmentedPolicy; | ||
typedef AgentRadixSortDownsweepPolicy <256, 11, DominantT, BLOCK_LOAD_TRANSPOSE, LOAD_DEFAULT, RADIX_RANK_MEMOIZE, BLOCK_SCAN_WARP_SCANS, SEGMENTED_RADIX_BITS - 1> AltSegmentedPolicy; | ||
}; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe we should add an SM80 policy?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Andy's PR adds one. Dunno if he tuned the non-onesweep parameters or just pulled them from the SM70 tunings.
cc: @canonizer
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For the Policy800
in my PR, I copied the parameters from Policy700
and only tuned the onesweep-related parameters.
6204466
to
8f881de
Compare
Retune radix sort, run length encoding, reduce by key, scan, select if, and histogram for SM70 and SM80.
8f881de
to
a3d89dc
Compare
DVS CL: 29273912 |
The
|
Most of the algorithms have been changed significantly since this PR was opened. We'll have to tune algorithms in a more systematic way. Further work is tracked by the following issue. |
No description provided.