Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Use less thread blocks for find_uncached kernel
Summary: Find uncached takes quite long time and this often overalap with the main stream in prefetch mode. We do want it to be overlapped with other kernels so it has minimal effect on the main stream. This will likely make cache prefetch taking slightly longer (not significantly as prefetch spent longest time in data transfer), but will make overlapping main stream kernels faster. Differential Revision: D62401630
- Loading branch information