Split CUDA code (.cu) from CPU code (.cpp). #152

sergeyk · 2014-02-25T00:33:06Z

This will enable CPU-only Caffe compilation (#3), and go a long way to 10.9 CUDA compilation problems (#122)

Checklist [shelhamer]:

split layers into cpp and cu Splitting source files between CUDA and CPU code. #172
split enough for boost-eigen to build on osx (hide boost rng from nvcc) MKL/non-MKL Reconciliation #165
split and tag tests as CPU / GPU
split math_functions into cpp and cu
split common into normal and cpu-only singletons?

Follow-up: abstract CPU / GPU device computation #610

tdomhan · 2014-02-25T07:10:18Z

What's the plan to go about this? Split each layer into two subclasses, e.g. ConvLayerCPU and ConvLayerGPU or then have dummy Foward_GPU functions in the *.cpp files?

sergeyk · 2014-02-26T03:14:36Z

@erictzeng would be good if you submitted a work-in-progress pull request for this, or shared your plan here so that others could provide feedback before too much work is done

shelhamer · 2014-02-26T03:16:37Z

Sorry, we talked off-list about this. He's waiting for the merge of #142 and #163 but already made the split and it builds and tests pass.

@erictzeng it'd be good to push and PR your current work to dev anyway, and we can look at it once the mentioned PRs are folded in.

shelhamer · 2014-02-27T06:57:20Z

Closing since #172 is in.

kloudkl · 2014-03-17T11:13:42Z

The discussion in #172 indicates that a CPU only version is only possible when the CPU and CUDA codes live in different classes or better different namespaces like what was done in OpenCV. Do you agree?

tdomhan · 2014-03-17T11:50:20Z

@kloudkl what was it exactly that the opencv folks did? Something like having a namespace opencv_cpu and opencv_gpu?

kloudkl · 2014-03-18T06:25:30Z

https://github.com/Itseez/opencv

kloudkl · 2014-03-18T06:28:45Z

The modules dir contains cuda and cudev namespaces to separate the GPU related codes. The platforms dir includes cmake files and build scripts specific to the most common platforms.

shelhamer · 2014-03-21T21:00:25Z

Splitting common into cpp and cuda code with a common interface that satisfies the osx issue in #165 turns out to be intricate. An immediate frustration is that the Caffe singleton itself relies on curand and cublas, and although random number generation and blas can be abstracted into strategies for cpu/gpu operation this doesn't handle 1. switching modes or 2. mixed cpu+gpu implementations.

Instead of totally splitting, the simplest approach could be to implement two Caffe singletons

the normal Caffe singleton, already coded in common.cpp
a cpu-only Caffe singleton, to be coded in common_cpu.cpp

that both obey the same interface. However, the cpu-only singleton will have no-ops for gpu methods and instead complain through warnings when it is passed gpu methods are called or gpu args are passed. In this way the rest of Caffe such as the solver, tools, and the like can stay the same. Instead of ifdefs all around the changes will be isolated to the Caffe singletons.

shelhamer · 2014-07-18T11:06:56Z

Done in #561. #610 will carry it further to simplify the code and pave the way for different hardware backends.

Add printing GPU name in timing mode

sergeyk assigned erictzeng Feb 25, 2014

shelhamer mentioned this issue Feb 26, 2014

Sort out models, data, and tools #142

Merged

shelhamer mentioned this issue Feb 26, 2014

MKL/non-MKL Reconciliation #165

Merged

erictzeng mentioned this issue Feb 27, 2014

Splitting source files between CUDA and CPU code. #172

Merged

shelhamer mentioned this issue Feb 27, 2014

boost-eigen branch doesn't build on OSX 10.7, 10.9 (10.8 untested) #122

Closed

shelhamer closed this as completed Feb 27, 2014

shelhamer mentioned this issue Feb 27, 2014

Concat layer #125

Merged

shelhamer reopened this Mar 10, 2014

sergeyk added this to the 1.0 milestone Mar 13, 2014

kloudkl mentioned this issue Mar 17, 2014

How to run a pretrained model on CPU-only machine #211

Closed

sergeyk removed this from the 1.0 milestone Apr 22, 2014

shelhamer closed this as completed Jul 18, 2014

gheinrich pushed a commit to gheinrich/caffe that referenced this issue May 30, 2016

Merge pull request BVLC#152 from drnikolaev/caffe-0.15-timing-names

cf75318

Add printing GPU name in timing mode

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split CUDA code (.cu) from CPU code (.cpp). #152

Split CUDA code (.cu) from CPU code (.cpp). #152

sergeyk commented Feb 25, 2014

tdomhan commented Feb 25, 2014

sergeyk commented Feb 26, 2014

shelhamer commented Feb 26, 2014

shelhamer commented Feb 27, 2014

kloudkl commented Mar 17, 2014

tdomhan commented Mar 17, 2014

kloudkl commented Mar 18, 2014

kloudkl commented Mar 18, 2014

shelhamer commented Mar 21, 2014

shelhamer commented Jul 18, 2014

Split CUDA code (*.cu) from CPU code (*.cpp). #152

Split CUDA code (*.cu) from CPU code (*.cpp). #152

Comments

sergeyk commented Feb 25, 2014

tdomhan commented Feb 25, 2014

sergeyk commented Feb 26, 2014

shelhamer commented Feb 26, 2014

shelhamer commented Feb 27, 2014

kloudkl commented Mar 17, 2014

tdomhan commented Mar 17, 2014

kloudkl commented Mar 18, 2014

kloudkl commented Mar 18, 2014

shelhamer commented Mar 21, 2014

shelhamer commented Jul 18, 2014

Split CUDA code (.cu) from CPU code (.cpp). #152

Split CUDA code (.cu) from CPU code (.cpp). #152