Design doc of compile time register gradient operators #4517

reyoung · 2017-09-29T20:41:00Z

No description provided.

QiJune · 2017-09-29T21:05:59Z

doc/design/register_grad_op.md

+
+## Problem
+
+Since we separate users program in two stages, compile time and runtime, we should record and look up the mapping relationship between an operator and its gradient operators when compile. However, we register this relationship in runtime by these `OpInfo` fields.


The execution of a neural network topology in PaddlePaddle is separated into two stages, complie-time and run-time.
At complie-time, a ProgramDesc will be generated. At run-time, the ProgramDesc will be executed on specific hardware. We can refer to the design of computation-graphs.
The Gradient Operator's OpDesc is also generated at compile-time. We have to find the mapping relationship between Operator's OpDesc and its GradientOp's OpDesc.
However, we make the mapping relationship between Operator and its GradientOp at run-time in OpInfo class currently.

wangkuiyi · 2017-09-30T00:11:50Z

doc/design/register_grad_op.md

+
+## Problem
+
+Since we separate users program in two stages, compile time and runtime, we should record and look up the mapping relationship between an operator and its gradient operators when compile. However, we register this relationship in runtime by these `OpInfo` fields.


The Problem Posed

In our current operator registration mechanism, for each operator, the programmer should register a gradient operator creator function, which takes a C++ operator instance, and returns the corresponding gradient instance.

However, as we decided to separate the compilation and execution of DL models, we need to reshape the creator to take a protobuf OpDesc message, and returns a corresponding message.

More than that, the new registration mechanism need to support the fact that an operators' gradient computation might be a composition of operators.

Current Implementation

Proposed Solution

reyoung requested review from wangkuiyi, JiayiFeng, QiJune and tonyyang-svail September 29, 2017 20:41

reyoung force-pushed the feature/design_doc_of_new_grad_op branch 2 times, most recently from 52fec02 to 12713e9 Compare September 29, 2017 20:48

reyoung added 2 commits September 29, 2017 13:49

Design doc of compile time register gradient operators

12713e9

Not change macro before

5ee7093

QiJune reviewed Sep 29, 2017

View reviewed changes

reyoung mentioned this pull request Sep 29, 2017

Construct backward pass in Program/BlockDesc #4521

Closed

Update example

56f60ee

wangkuiyi reviewed Sep 30, 2017

View reviewed changes

Update

8676423

JiayiFeng approved these changes Sep 30, 2017

View reviewed changes

reyoung merged commit f76b38c into PaddlePaddle:develop Sep 30, 2017

reyoung deleted the feature/design_doc_of_new_grad_op branch October 2, 2017 18:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Design doc of compile time register gradient operators #4517

Design doc of compile time register gradient operators #4517

reyoung commented Sep 29, 2017

QiJune Sep 29, 2017 •

edited

Loading

wangkuiyi Sep 30, 2017 •

edited

Loading


		## Problem

		Since we separate users program in two stages, compile time and runtime, we should record and look up the mapping relationship between an operator and its gradient operators when compile. However, we register this relationship in runtime by these `OpInfo` fields.

Design doc of compile time register gradient operators #4517

Design doc of compile time register gradient operators #4517

Conversation

reyoung commented Sep 29, 2017

QiJune Sep 29, 2017 • edited Loading

Choose a reason for hiding this comment

wangkuiyi Sep 30, 2017 • edited Loading

Choose a reason for hiding this comment

The Problem Posed

Current Implementation

Proposed Solution

QiJune Sep 29, 2017 •

edited

Loading

wangkuiyi Sep 30, 2017 •

edited

Loading