-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix concat_op #9337
Fix concat_op #9337
Conversation
self.axis = 1 | ||
|
||
|
||
class TestConcatOp2(OpTest): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Seems that this case is somehow duplicated with the above one, how about change to test axis==0?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No, there are two CUDA kernels for Concat
, they represent two case, TestConcatOp
only covers one, TestConcatOp2
can covers the other.
outputs_ptr[i] = outputs[i].data<T>(); | ||
} | ||
|
||
T** outs_gpu = | ||
T** dev_out_gpu_data = | ||
reinterpret_cast<T**>(outputs_data.CUDAMutableData(context.GetPlace())); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please double check that whether static_cast
is more suitable.
4efe843
to
b68d53e
Compare
… feature/fix_concat
b68d53e
to
aca9180
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
No description provided.