Feature/use cudnn #7141

dzhwinter · 2018-01-02T10:09:54Z

No description provided.

QiJune · 2018-01-05T05:57:24Z

paddle/framework/CMakeLists.txt

@@ -73,8 +73,7 @@ cc_test(var_type_inference_test SRCS var_type_inference_test.cc DEPS op_registry
 cc_library(selected_rows SRCS selected_rows.cc DEPS tensor)
 cc_test(selected_rows_test SRCS selected_rows_test.cc DEPS selected_rows)

-
-cc_library(init SRCS init.cc DEPS gflags device_context place stringpiece)
+cc_library(init SRCS init.cc DEPS gflags device_context place stringpiece operator)


init does rely on operator

QiJune · 2018-01-05T05:59:56Z

paddle/framework/data_transform.cc

+void DummyTrans(const platform::DeviceContext* ctx,
+                const KernelTypePair& kernel_pair, const Variable& in,
+                Variable* out) {
+  PADDLE_ENFORCE(in.IsType<Tensor>(), "Only Support Tensor transform!.");


Since all Tensors are actually LoDTensors, here should be

in.IsType<LoDTensor>

Ok. Will fixed in next PR

QiJune · 2018-01-05T06:03:45Z

paddle/framework/operator.cc

+#endif
+}
+
+void UseALL() {


Since UseCUDNN calls UseCUDA
UseCUDA calls UseMKLDNN...

UseALL is no needed. We can call UseCUDNN directly

Actually, UseXXX is recursively called previous UseXXX.
But UseALL called UseCUDNN looks odd, just make it more clearly. And this interface should be removed in the future, we should ONLY allow user configure op in the attribute.

QiJune · 2018-01-05T06:20:44Z

paddle/framework/operator.cc

+      if ((actual_kernel_key == candidate_key) ||
+          (kernels.count(candidate_key) &&
+           trans_map.GetNullable(candidate_pair))) {
+        expected_kernel_key = candidate_key;


The default Priority will overwrite user's configuration.
We should strictly obey users' configuration first. If user does not provide a preference, then, we can find a kernel key under the guide of default Priority.

Yes, this does not obey the user configuration first rule. Will fix it in the next PR.

The cost of DataTrans are different, and the cost from small to large is the following: DataType, Layout, CPU<->GPU. when choosing candidate_key, these cost should take into account.

我觉得我们把问题搞复杂了，目前为止只有 CPU <-> GPU的需求，MKLDNNLayout <-> kPlain的需求。下个PR里从op的attribute让用户选是否使用就可以，考虑cost就和tensorflow的cost model一样了

premature optimization is the root of all evil.

QiJune

Since this PR will block another, some fix work will be done in another PR together.

dzhwinter · 2018-01-05T07:19:02Z

Thanks! These fix will be done in #6660

dzhwinter added 15 commits January 2, 2018 00:58

"add c++ side kernel selection"

67150a1

Merge remote-tracking branch 'origin/develop' into feature/use_cudnn

5cf9849

"add multiple kernel op test"

8016c7c

"kernel selection only support cudnn"

19d5777

"better formatter"

ed34369

"small fix with UseCPU"

00cad50

"depends on change interface Get(Place, Library)"

c73b619

"fix CI"

356301f

"fix python cudnn test"

0bb8e36

Merge remote-tracking branch 'origin/develop' into feature/use_cudnn

dd6f28f

"leave the register cudnn op to another PR"

f87fb57

"fix CI"

ffab74d

"use all kernel by default"

1b2a283

"fix CI"

cc2aacd

Merge remote-tracking branch 'origin/develop' into feature/use_cudnn

b73a76b

QiJune reviewed Jan 5, 2018

View reviewed changes

QiJune approved these changes Jan 5, 2018

View reviewed changes

dzhwinter merged commit 5593858 into PaddlePaddle:develop Jan 5, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/use cudnn #7141

Feature/use cudnn #7141

dzhwinter commented Jan 2, 2018

QiJune Jan 5, 2018

QiJune Jan 5, 2018

dzhwinter Jan 5, 2018

QiJune Jan 5, 2018 •

edited

Loading

dzhwinter Jan 5, 2018

QiJune Jan 5, 2018

dzhwinter Jan 5, 2018

chengduoZH Jan 5, 2018

dzhwinter Jan 5, 2018

dzhwinter Jan 5, 2018

QiJune left a comment

dzhwinter commented Jan 5, 2018

Feature/use cudnn #7141

Feature/use cudnn #7141

Conversation

dzhwinter commented Jan 2, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

QiJune Jan 5, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

QiJune left a comment

Choose a reason for hiding this comment

dzhwinter commented Jan 5, 2018

QiJune Jan 5, 2018 •

edited

Loading