feat: support flatten and reshape via shuffle_layer #2354

zewenli98 · 2023-09-29T03:58:09Z

Description

Support flatten and reshape via shuffle_layer

Fixes #2214

Type of change

Please delete options that are not relevant and/or add your own.

New feature (non-breaking change which adds functionality)

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes
I have added the relevant labels to my PR in so that relevant reviewers are notified

zewenli98 · 2023-09-29T03:58:41Z

@bowang007

zewenli98 · 2023-09-29T04:00:01Z

When I test flatten via torch.flatten(inputs, start_dim, end_dim), got error AssertionError: False is not true : expected ops {<OpOverload(op='aten.flatten', overload='using_ints')>}, actuall ops {<OpOverload(op='aten.view', overload='default')>, <OpOverloadPacket(op='aten.sym_size')>, <built-in function mul>}

py/torch_tensorrt/dynamo/conversion/aten_ops_converters.py

tests/py/dynamo/conversion/test_reshape_aten.py

gs-olive

Is there a specific reason - performance or otherwise, why flatten should need a different implementation than reshape, when using static shapes? Specifically, we can comment out the flatten implementation for now, and for any converters needing flatten for static shapes, they can just use a reshape and flatten the dimensions themselves.

As an alternative, @zewenli98, you can add a utility flatten_dims, which will flatten the dimensions of an input tensor into a reshape-usable form, then you can have @bowang007's converter test that utility.

zewenli98 · 2023-10-02T23:36:51Z

Is there a specific reason - performance or otherwise, why flatten should need a different implementation than reshape, when using static shapes?

Thanks for the advice! I did this because I noticed there's a flatten op in this schema. I thought our goal is to support these native_functions as much as possible. But anyways, I'll comment out the flatten converter and then wrap the reshape to do flatten op.

gs-olive · 2023-10-03T00:41:34Z

Generally, the focus is to cover as much of this operation set as possible: https://pytorch.org/docs/stable/ir.html#core-aten-ir, though if there are operators that show up which we can directly convert as opposed to lowering, that is certainly a good thing to have.

gs-olive

I do still think flatten_dims can be a utility which gives the shape to pass to reshape. That way, it can get tested as a utility and not as a converter (see tests/py/dynamo/conversion/test_converter_utils.py). Added a suggestion on syntax.

py/torch_tensorrt/dynamo/conversion/impl/shuffle.py

zewenli98 · 2023-10-04T02:31:20Z

I do still think flatten_dims can be a utility which gives the shape to pass to reshape.

I tried implementing flatten_dims in converter_utils.py. Since flatten_dims needs to call reshape, it caused circular import. That's why I moved to shuffle file. Anyways, I will rewrite in flatten_dims.

gs-olive · 2023-10-04T17:20:49Z

@zewenli98 I see - thanks for the details - to clarify, I was intending for flatten_dims to not change the network at all since changing the network requires the function to be in impl/. I was thinking it could be instead something like:

def flatten_dims(
    input: Sequence[Union[TRTTensor, torch.Tensor, np.ndarray]],
    start_dim: int,
    end_dim: int,
) -> Tuple[int]:
    shape = input.shape
    dim_size = len(shape)
    start_dim = get_positive_dim(start_dim, dim_size)
    end_dim = get_positive_dim(end_dim, dim_size)

    if not isinstance(input, TRTTensor):
        input = get_trt_tensor(ctx, input, f"{name}_flatten")

    num_elements = 1
    for i in range(start_dim, end_dim + 1):
        num_elements *= shape[i]

    new_shape = tuple(shape[:start_dim]) + (num_elements,) + tuple(shape[end_dim + 1 :])

    return new_shape

Then, the user can call flatten_dims to get back the flattened dimension shape, which they can then pass to reshape themselves.

zewenli98 · 2023-10-04T18:32:12Z

@gs-olive Thanks a lot! This makes more sense. Modified!

py/torch_tensorrt/dynamo/conversion/aten_ops_converters.py

py/torch_tensorrt/dynamo/conversion/converter_utils.py

py/torch_tensorrt/dynamo/conversion/impl/shuffle.py

gs-olive · 2023-10-05T22:06:33Z

tests/py/dynamo/conversion/test_reshape_aten.py

@@ -65,6 +68,7 @@ def forward(self, x):
        self.run_test_with_dynamic_shape(
            TestModule(target_shape),
            input_specs,
+            expected_ops={torch.ops.aten.view.default},


This line can be removed, in accordance with the new testing PR

gs-olive

Looks good to me, pending CI pass.

gs-olive · 2023-10-06T21:54:40Z

Relevant tests pass locally on Torch 2.1.0. Merging to main.

zewenli98 · 2023-10-06T22:07:49Z

@bowang007 Please consult this PR for the shuffle op.

bowang007 · 2023-10-06T22:25:50Z

@zewenli98 Thanks! Let me update PR accordingly

facebook-github-bot added the cla signed label Sep 29, 2023

github-actions bot requested a review from peri044 September 29, 2023 03:58

zewenli98 self-assigned this Sep 29, 2023

zewenli98 requested review from gs-olive and removed request for peri044 September 29, 2023 17:15

gs-olive added the priority: high label Sep 29, 2023

github-actions bot requested a review from peri044 September 29, 2023 18:53

zewenli98 force-pushed the shuffle_dynamo_converter branch from b414928 to d212e51 Compare September 30, 2023 01:37

zewenli98 mentioned this pull request Sep 30, 2023

feat: support group_norm, batch_norm, and layer_norm #2330

Merged

7 tasks

gs-olive reviewed Oct 2, 2023

View reviewed changes

py/torch_tensorrt/dynamo/conversion/aten_ops_converters.py Outdated Show resolved Hide resolved

py/torch_tensorrt/dynamo/conversion/aten_ops_converters.py Outdated Show resolved Hide resolved

tests/py/dynamo/conversion/test_reshape_aten.py Show resolved Hide resolved

gs-olive requested changes Oct 2, 2023

View reviewed changes

zewenli98 requested a review from gs-olive October 3, 2023 22:42

gs-olive reviewed Oct 3, 2023

View reviewed changes

py/torch_tensorrt/dynamo/conversion/impl/shuffle.py Outdated Show resolved Hide resolved

zewenli98 requested a review from gs-olive October 4, 2023 04:34

gs-olive reviewed Oct 4, 2023

View reviewed changes

py/torch_tensorrt/dynamo/conversion/aten_ops_converters.py Outdated Show resolved Hide resolved

py/torch_tensorrt/dynamo/conversion/converter_utils.py Show resolved Hide resolved

zewenli98 force-pushed the shuffle_dynamo_converter branch from 5847be9 to d48d611 Compare October 4, 2023 22:44

gs-olive reviewed Oct 4, 2023

View reviewed changes

py/torch_tensorrt/dynamo/conversion/impl/shuffle.py Outdated Show resolved Hide resolved

gs-olive reviewed Oct 4, 2023

View reviewed changes

py/torch_tensorrt/dynamo/conversion/impl/shuffle.py Outdated Show resolved Hide resolved

zewenli98 added 10 commits October 5, 2023 13:31

feat: support flatten and reshape via shuffle_layer

05fe728

update

5f9f7ab

modify TRTNetwork to ConversionContext

8c58b9f

update and add tests

9f1d7a8

remove flatten converter, instead, add flatten_dims utility

dfaa36d

fix circular import error

b4678d9

change to flatten_dims utility

30306f8

modify flatten_dims()

0f24a52

clean flatten converter and add tests

c0cfeb8

use enforce_tensor_types decorator to cast type

9846f23

zewenli98 force-pushed the shuffle_dynamo_converter branch from 4fc3a8b to 9846f23 Compare October 5, 2023 20:34

gs-olive reviewed Oct 5, 2023

View reviewed changes

rebase

72342b7

zewenli98 requested a review from gs-olive October 6, 2023 20:07

gs-olive approved these changes Oct 6, 2023

View reviewed changes

gs-olive merged commit d375d10 into pytorch:main Oct 6, 2023
12 of 14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support flatten and reshape via shuffle_layer #2354

feat: support flatten and reshape via shuffle_layer #2354

zewenli98 commented Sep 29, 2023 •

edited

Loading

zewenli98 commented Sep 29, 2023

zewenli98 commented Sep 29, 2023

gs-olive left a comment

zewenli98 commented Oct 2, 2023

gs-olive commented Oct 3, 2023

gs-olive left a comment

zewenli98 commented Oct 4, 2023 •

edited

Loading

gs-olive commented Oct 4, 2023

zewenli98 commented Oct 4, 2023

gs-olive Oct 5, 2023

gs-olive left a comment

gs-olive commented Oct 6, 2023

zewenli98 commented Oct 6, 2023

bowang007 commented Oct 6, 2023

feat: support flatten and reshape via shuffle_layer #2354

feat: support flatten and reshape via shuffle_layer #2354

Conversation

zewenli98 commented Sep 29, 2023 • edited Loading

Description

Type of change

Checklist:

zewenli98 commented Sep 29, 2023

zewenli98 commented Sep 29, 2023

gs-olive left a comment

Choose a reason for hiding this comment

zewenli98 commented Oct 2, 2023

gs-olive commented Oct 3, 2023

gs-olive left a comment

Choose a reason for hiding this comment

zewenli98 commented Oct 4, 2023 • edited Loading

gs-olive commented Oct 4, 2023

zewenli98 commented Oct 4, 2023

gs-olive Oct 5, 2023

Choose a reason for hiding this comment

gs-olive left a comment

Choose a reason for hiding this comment

gs-olive commented Oct 6, 2023

zewenli98 commented Oct 6, 2023

bowang007 commented Oct 6, 2023

zewenli98 commented Sep 29, 2023 •

edited

Loading

zewenli98 commented Oct 4, 2023 •

edited

Loading