Tuned bandwidth numbers for some 5-d operators looks wrong #315

maddyscientist · 2015-07-14T01:04:36Z

Flops look correct, but the bandwidth numbers for the 5-d local kernels look way off. A quick look at the code suggests that these aren't special cased, and the default DslashCuda::bytes() is being used, but I'll need to check for more than 30 seconds to work this out exactly.

Tuned block=(128,3,1), shared=24577 giving 71.00 Gflop/s, 1116.09 GB/s for N4quda16MDWFDslashPCCudaI7double2S1_EE with type=single-GPU,reconstruct=18,Dslash4pre
Tuned block=(32,4,1), shared=0 giving 170.86 Gflop/s, 347.94 GB/s for N4quda16MDWFDslashPCCudaI7double2S1_EE with type=single-GPU,reconstruct=18,Dslash4
Tuned block=(32,4,1), shared=16385 giving 155.89 Gflop/s, 181.87 GB/s for N4quda16MDWFDslashPCCudaI7double2S1_EE with type=single-GPU,reconstruct=18,Dslash5inv
Tuned block=(32,2,1), shared=24577 giving 64.99 Gflop/s, 895.91 GB/s for N4quda16MDWFDslashPCCudaI7double2S1_EE with type=single-GPU,reconstruct=18,Xpay,Dslash5
Executing 10 kernel loops...

The text was updated successfully, but these errors were encountered:

mathiaswagner · 2015-07-14T02:15:11Z

What volume was this? Did you check limiting cases?

… operators (closes #315).

mathiaswagner · 2015-07-17T17:40:18Z

Closed with #317.

maddyscientist added the bug label Jul 14, 2015

maddyscientist added this to the QUDA 0.7.2 milestone Jul 14, 2015

maddyscientist added a commit that referenced this issue Jul 16, 2015

Fixed memory bandwidth computations for 4-d preconditioned 5-d dslash…

93836c0

… operators (closes #315).

maddyscientist mentioned this issue Jul 16, 2015

Hotfix/dslash cleanup #317

Merged

mathiaswagner closed this as completed Jul 17, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tuned bandwidth numbers for some 5-d operators looks wrong #315

Tuned bandwidth numbers for some 5-d operators looks wrong #315

maddyscientist commented Jul 14, 2015

mathiaswagner commented Jul 14, 2015

mathiaswagner commented Jul 17, 2015

Tuned bandwidth numbers for some 5-d operators looks wrong #315

Tuned bandwidth numbers for some 5-d operators looks wrong #315

Comments

maddyscientist commented Jul 14, 2015

mathiaswagner commented Jul 14, 2015

mathiaswagner commented Jul 17, 2015