Conv Shift Operator #4591

mkliegl · 2017-10-04T18:04:13Z

This closes #4574 .

This is an unoptimized Eigen-based port of the existing code:

Examples of possible future optimizations include:

For M >> log N or N >> log M, it could make sense to use FFT to do circular convolution.
Instead of a loop over the batch dimension on the outside, it could be faster to do a batchwise operation inside the loop like out.col(i) = x.col(index) * y.col(j). This would make the most sense if we either had column-major storage order or if the batch dimension were last rather than first.

I would probably leave such optimizations to the future when we can profile an actual use case and see whether the improvements are worth it.

@dzhwinter Would you mind taking a look? This is my first PR to this repo - your feedback would be much appreciated!

mkliegl · 2017-10-04T22:58:55Z

I think this code does not work for GPU - I will try to fix that. For an overall review, it may be better to wait until that is done. (That said, this code will probably stay about the same for CPU, so I would appreciate feedback on that any time.)

dzhwinter

Great Job! Thanks for the detailed comments! There is only some pieces of code need to be fixed. :)

dzhwinter · 2017-10-05T03:21:28Z

paddle/operators/conv_shift_op.h

+namespace paddle {
+namespace operators {
+
+using Tensor = framework::Tensor;


use using in the header file violate the google style, see the detail in Namespace

Do not use Namespace aliases at namespace scope in header files except in explicitly marked internal-only namespaces, because anything imported into a namespace in a header file becomes part of the public API exported by that file.

By the way, I think

using framework::Tensor;

is enough here.

Some old operators have a wrong guide, sorry for that.

I just see this comment by chance.

The Google style says:

Do not use Namespace aliases at namespace scope in header files

I think:

using Tensor = framework::Tensor

this usage does not violate the above Google style because it is just a type alias, not a namespace alias. But, I admit that using type alias in such a large scope is not a good thing, so you are right...

dzhwinter · 2017-10-05T03:24:06Z

paddle/operators/conv_shift_op.h

+    size_t y_width = y_dims[1];
+    size_t y_half_width = (y_width - 1) / 2;
+
+    // The below trades code duplication for efficiency (keeping the if


good point!
An if condition inside a three-fold for loop will cost a lot of time.

Limitations: - both gradient outputs must be specified and are always computed - explicit for loops => could be optimized in various ways (e.g., different memory layout)

fix case when not all output gradients desired

mkliegl · 2017-10-10T01:15:22Z

@dzhwinter @lcy-seso

Thank you for the feedback! I moved the using code you mentioned into the *.cc file and wrote an initial GPU implementation. I also cleaned the code up a little bit. The code is not optimized much but passes the tests. I think it is ready to be merged.

lcy-seso · 2017-10-10T01:30:38Z

This PR LGTM, quite clean and well commented~

lcy-seso

LGTM. Thank you.

mkliegl requested a review from dzhwinter October 4, 2017 18:04

dzhwinter reviewed Oct 5, 2017

View reviewed changes

qingqing01 added the OpPorting label Oct 9, 2017

Markus Kliegl added 5 commits October 10, 2017 00:23

conv_shift_op: initial implementation using Eigen

53cd0e4

Limitations: - both gradient outputs must be specified and are always computed - explicit for loops => could be optimized in various ways (e.g., different memory layout)

conv shift - gradient fixes

237318a

fix case when not all output gradients desired

conv shift: minor cleanup

06cb349

conv shift - more minor cleanup

b55ebe3

conv shift: clean up & initial GPU implementation

af26b85

mkliegl force-pushed the conv_shift branch from b1dc0ab to af26b85 Compare October 10, 2017 00:24

fix rebase issue

39f78ae

lcy-seso approved these changes Oct 10, 2017

View reviewed changes

mkliegl merged commit a281b38 into PaddlePaddle:develop Oct 10, 2017

mkliegl deleted the conv_shift branch October 10, 2017 18:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conv Shift Operator #4591

Conv Shift Operator #4591

mkliegl commented Oct 4, 2017

mkliegl commented Oct 4, 2017

dzhwinter left a comment •

edited

Loading

dzhwinter Oct 5, 2017

lcy-seso Oct 9, 2017

dzhwinter Oct 5, 2017

mkliegl commented Oct 10, 2017

lcy-seso commented Oct 10, 2017

lcy-seso left a comment

Conv Shift Operator #4591

Conv Shift Operator #4591

Conversation

mkliegl commented Oct 4, 2017

mkliegl commented Oct 4, 2017

dzhwinter left a comment • edited Loading

Choose a reason for hiding this comment

dzhwinter Oct 5, 2017

Choose a reason for hiding this comment

lcy-seso Oct 9, 2017

Choose a reason for hiding this comment

dzhwinter Oct 5, 2017

Choose a reason for hiding this comment

mkliegl commented Oct 10, 2017

lcy-seso commented Oct 10, 2017

lcy-seso left a comment

Choose a reason for hiding this comment

dzhwinter left a comment •

edited

Loading