-
Notifications
You must be signed in to change notification settings - Fork 217
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Class member align problem #88
Labels
Milestone
Comments
ghost
assigned heikoburau
Oct 16, 2013
@psychocoderHPC Did #100 close that? |
It looks very good, can't find any byte loads of pointer in ptx code. I think we can close this issue! |
Closed, thx @Heikman and @psychocoderHPC |
psychocoderHPC
pushed a commit
to psychocoderHPC/picongpu
that referenced
this issue
May 9, 2019
a96b293 Merge pull request ComputationalRadiationPhysics#106 from fwyzard/cupla_accelerator_namespace 1af2b2b Merge pull request ComputationalRadiationPhysics#110 from tdd11235813/cmake-macro 663521c Changes function to macro 'CUPLA_ADD_EXECUTABLE'. a440a7c Fix namespace definition if two accelerators are selected 64f02bb Move all accelerator-specific code to an inline namespece bbef699 Merge pull request ComputationalRadiationPhysics#105 from fwyzard/tbb_IsThreadSeqAcc b463a25 Mark the TBB backend as "thread sequential" b45d4e9 Merge pull request ComputationalRadiationPhysics#102 from psychocoderHPC/topic-updateToLatestAlapkaDev 7e8b832 createTaskExec -> createTaskKernel 84b6dc1 update to latest alapka develop branch 037f7b2 Merge pull request ComputationalRadiationPhysics#100 from DerWaldschrat/dev 7add5ed fix -Wreorder in cuplaPitchedPtr (#4) 9fa745b Merge pull request ComputationalRadiationPhysics#99 from ax3l/topic-cmake312rootHints 041d454 Merge pull request ComputationalRadiationPhysics#98 from DerWaldschrat/dev-fix ecc5839 CMake: Honor _ROOT Env Hints 24d159a Added conditional check for backends. 5822555 Add cudaEventBlockingSync to event creation 906cd41 Merge pull request ComputationalRadiationPhysics#97 from psychocoderHPC/topic-HIPBackend bc67259 alpaka backend: HIP 87fe58d Merge pull request ComputationalRadiationPhysics#96 from psychocoderHPC/fix-kernelSharedMemCalc 68877b5 fix dynamic shared mem definition 9452a70 Merge pull request ComputationalRadiationPhysics#95 from psychocoderHPC/topic-updateDocs 8b26918 update docs e822bfa Merge pull request ComputationalRadiationPhysics#89 from psychocoderHPC/topic-switchToAlpakaDevelop0.4.X 50cedbc alpaka renamings + cmake version incerase 5ecd227 update to alapka dev upcoming version 0.4.X 66a9240 Merge pull request ComputationalRadiationPhysics#94 from psychocoderHPC/topic-ciUpdateToCUDA8 3ba6104 CI: update CUDA from 7.5 to 8.0 5af56ce Merge pull request ComputationalRadiationPhysics#93 from psychocoderHPC/topic-updateCIToClang4 68a6555 Travis: CUDA Compiler Ignore 279bfd0 Travis: Xenial 9d583b7 CI: switch from clan 3.5 to 4.0 11eb874 Merge pull request ComputationalRadiationPhysics#92 from ax3l/topic-ciCMake3110 da617da Travis: CMake 3.11.0 da63524 Merge pull request ComputationalRadiationPhysics#91 from ComputationalRadiationPhysics/dev 66a2da7 Merge pull request ComputationalRadiationPhysics#88 from psychocoderHPC/topic-versionIncreaseTo0.1.1 e029584 version increase to 0.1.1 ffdb268 Merge pull request ComputationalRadiationPhysics#90 from psychocoderHPC/fix-tbbSelection 06030af fix TBB backend activation aa8a921 Merge pull request ComputationalRadiationPhysics#84 from psychocoderHPC/topic-cuplaGetErrorString 55c99b7 add implementation for `cuplaGetErrorString` c765052 Merge pull request ComputationalRadiationPhysics#86 from psychocoderHPC/topic-cuplaPeekAtLastError 49612bb Merge pull request ComputationalRadiationPhysics#87 from psychocoderHPC/topic-helpUpdatecuplaGetLastError c763de4 Merge pull request ComputationalRadiationPhysics#85 from psychocoderHPC/topic-cuplaStreamQuerySupport 5d19923 update documentation of `cuplaGetLastError` 3806b8f implement `cuplaPeekAtLastError()` 2e79816 add `cuplaStreamQuery()` 7c73dc8 Merge pull request ComputationalRadiationPhysics#83 from psychocoderHPC/fix-cuplaGetLastError b02299a fix `cuplaGetLastError` 1b7d7ea Merge pull request ComputationalRadiationPhysics#81 from tdd11235813/master b0adfd9 Updates alpaka to 0.3.5. git-subtree-dir: thirdParty/cupla git-subtree-split: a96b293dcb14662a57901b55dfedaa1cd615adc9
psychocoderHPC
pushed a commit
to psychocoderHPC/picongpu
that referenced
this issue
May 17, 2019
a96b293 Merge pull request ComputationalRadiationPhysics#106 from fwyzard/cupla_accelerator_namespace 1af2b2b Merge pull request ComputationalRadiationPhysics#110 from tdd11235813/cmake-macro 663521c Changes function to macro 'CUPLA_ADD_EXECUTABLE'. a440a7c Fix namespace definition if two accelerators are selected 64f02bb Move all accelerator-specific code to an inline namespece bbef699 Merge pull request ComputationalRadiationPhysics#105 from fwyzard/tbb_IsThreadSeqAcc b463a25 Mark the TBB backend as "thread sequential" b45d4e9 Merge pull request ComputationalRadiationPhysics#102 from psychocoderHPC/topic-updateToLatestAlapkaDev 7e8b832 createTaskExec -> createTaskKernel 84b6dc1 update to latest alapka develop branch 037f7b2 Merge pull request ComputationalRadiationPhysics#100 from DerWaldschrat/dev 7add5ed fix -Wreorder in cuplaPitchedPtr (#4) 9fa745b Merge pull request ComputationalRadiationPhysics#99 from ax3l/topic-cmake312rootHints 041d454 Merge pull request ComputationalRadiationPhysics#98 from DerWaldschrat/dev-fix ecc5839 CMake: Honor _ROOT Env Hints 24d159a Added conditional check for backends. 5822555 Add cudaEventBlockingSync to event creation 906cd41 Merge pull request ComputationalRadiationPhysics#97 from psychocoderHPC/topic-HIPBackend bc67259 alpaka backend: HIP 87fe58d Merge pull request ComputationalRadiationPhysics#96 from psychocoderHPC/fix-kernelSharedMemCalc 68877b5 fix dynamic shared mem definition 9452a70 Merge pull request ComputationalRadiationPhysics#95 from psychocoderHPC/topic-updateDocs 8b26918 update docs e822bfa Merge pull request ComputationalRadiationPhysics#89 from psychocoderHPC/topic-switchToAlpakaDevelop0.4.X 50cedbc alpaka renamings + cmake version incerase 5ecd227 update to alapka dev upcoming version 0.4.X 66a9240 Merge pull request ComputationalRadiationPhysics#94 from psychocoderHPC/topic-ciUpdateToCUDA8 3ba6104 CI: update CUDA from 7.5 to 8.0 5af56ce Merge pull request ComputationalRadiationPhysics#93 from psychocoderHPC/topic-updateCIToClang4 68a6555 Travis: CUDA Compiler Ignore 279bfd0 Travis: Xenial 9d583b7 CI: switch from clan 3.5 to 4.0 11eb874 Merge pull request ComputationalRadiationPhysics#92 from ax3l/topic-ciCMake3110 da617da Travis: CMake 3.11.0 da63524 Merge pull request ComputationalRadiationPhysics#91 from ComputationalRadiationPhysics/dev 66a2da7 Merge pull request ComputationalRadiationPhysics#88 from psychocoderHPC/topic-versionIncreaseTo0.1.1 e029584 version increase to 0.1.1 ffdb268 Merge pull request ComputationalRadiationPhysics#90 from psychocoderHPC/fix-tbbSelection 06030af fix TBB backend activation aa8a921 Merge pull request ComputationalRadiationPhysics#84 from psychocoderHPC/topic-cuplaGetErrorString 55c99b7 add implementation for `cuplaGetErrorString` c765052 Merge pull request ComputationalRadiationPhysics#86 from psychocoderHPC/topic-cuplaPeekAtLastError 49612bb Merge pull request ComputationalRadiationPhysics#87 from psychocoderHPC/topic-helpUpdatecuplaGetLastError c763de4 Merge pull request ComputationalRadiationPhysics#85 from psychocoderHPC/topic-cuplaStreamQuerySupport 5d19923 update documentation of `cuplaGetLastError` 3806b8f implement `cuplaPeekAtLastError()` 2e79816 add `cuplaStreamQuery()` 7c73dc8 Merge pull request ComputationalRadiationPhysics#83 from psychocoderHPC/fix-cuplaGetLastError b02299a fix `cuplaGetLastError` 1b7d7ea Merge pull request ComputationalRadiationPhysics#81 from tdd11235813/master b0adfd9 Updates alpaka to 0.3.5. git-subtree-dir: thirdParty/cupla git-subtree-split: a96b293dcb14662a57901b55dfedaa1cd615adc9
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
Member inside a class must ordered by it size. First big member than smaller. If you don't do this you get a undefined behavior with sm_13 and slow byte by byte copies inside a kernel in higher architectures.
This can be the reason of compiler bug #76. IMO both bugs point to the same code lines.
ptx example with bad sm_35 code:
The text was updated successfully, but these errors were encountered: