Double peak memory cost in `cast_memory_op` #4153

dyzheng · 2024-05-11T12:36:36Z

caic99 · 2024-05-12T14:11:15Z

Hi @dyzheng ,
I'm interested in this problem and wonder does the type conversion really happens?
FYI, you can select code lines and paste the permalink to show codes in input box. This way provides easier access to reference source codes.

abacus-develop/source/module_esolver/esolver_ks_pw.cpp

Lines 193 to 196 in b7e91aa

    
           this->kspw_psi = GlobalV::device_flag == "gpu"  
        
                                || GlobalV::precision_flag == "single" 
        
                                ? new psi::Psi<T, Device>(this->psi[0]) 
        
                                : reinterpret_cast<psi::Psi<T, Device>*>(this->psi);

dyzheng assigned mohanchen May 11, 2024

mohanchen added the GPU & DCU & HPC label May 11, 2024

denghuilu mentioned this issue May 11, 2024

Enhance PSI Constructor: Lower Peak Device Memory Usage #4154

Merged

4 tasks

dyzheng closed this as completed in #4154 May 13, 2024

caic99 mentioned this issue May 15, 2024

Refactor: optimize cast_memory_op #4160

Merged

4 tasks

denghuilu linked a pull request May 15, 2024 that will close this issue

Refactor: optimize cast_memory_op #4160

Merged

4 tasks

WHUweiqingzhou assigned denghuilu May 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Double peak memory cost in `cast_memory_op` #4153

Double peak memory cost in `cast_memory_op` #4153

dyzheng commented May 11, 2024 •

edited by caic99

Loading

caic99 commented May 12, 2024

Double peak memory cost in cast_memory_op #4153

Double peak memory cost in cast_memory_op #4153

Comments

dyzheng commented May 11, 2024 • edited by caic99 Loading

Describe the bug

Expected behavior

To Reproduce

Environment

Additional Context

Task list for Issue attackers (only for developers)

caic99 commented May 12, 2024

Double peak memory cost in `cast_memory_op` #4153

Double peak memory cost in `cast_memory_op` #4153

dyzheng commented May 11, 2024 •

edited by caic99

Loading