TorchVision Roadmap - 2021 H1 #3221

datumbox · 2021-01-05T10:41:07Z

masahi · 2021-01-05T21:52:31Z

Can we revisit various workarounds for tracing Faster RCNN / MaskRCNN ? There are many such special code-path for tracing mode, to support ONNX export. These code were added one year ago when PyTorch ONNX had many limitation. Especially the one below does repeated concat in a loop so it is not efficient.

vision/torchvision/models/detection/roi_heads.py

Lines 454 to 461 in 3d60f49

    
           @torch.jit._script_if_tracing 
        
           def _onnx_paste_masks_in_image_loop(masks, boxes, im_h, im_w): 
        
               res_append = torch.zeros(0, im_h, im_w) 
        
               for i in range(masks.size(0)): 
        
                   mask_res = _onnx_paste_mask_in_image(masks[i][0], boxes[i], im_h, im_w) 
        
                   mask_res = mask_res.unsqueeze(0) 
        
                   res_append = torch.cat((res_append, mask_res)) 
        
               return res_append

Now that PyTorch ONNX export has support for lists and also inplace assignment, I think many of those workarounds can be removed. Even if removing is not possible, we should be able to make them more efficient, thanks to the improved ONNX export support in recent PyTorch. For example, The snippet above can be replaced with the one below, which uses a batched concat. I confirmed that ONNX export of maskrcnn works with this change applied. This is also what TVM prefers.

@torch.jit._script_if_tracing
def _onnx_paste_masks_in_image_loop(masks, boxes, im_h, im_w):
    res = []
    for i in range(masks.size(0)):
        mask_res = _onnx_paste_mask_in_image(masks[i][0], boxes[i], im_h, im_w)
        res += [mask_res]
    return torch.stack(res, dim=0)[:, None]

cc @fmassa @lara-hdr

fmassa · 2021-01-06T14:04:11Z

@masahi I would be happy to accept a PR to improve the ONNX export path, and removing as many workarounds as possible would be awesome. I've opened an issue in #3225 to centralize this discussion there.

oke-aditya · 2021-01-17T11:57:43Z

Just linking a few trackers here

vfdev-5 · 2021-01-28T16:41:11Z

How about adding RandAugment, in addition to already implemented AutoAugment ?

nSircombe · 2021-02-09T10:40:54Z

Hi,

I was just wondering if there are any plans to add support for the QNNpack backend to the quantised ResNet50 model? (this would enable quantised ResNet50 for AArch64, which isn't supported by the FBGEMM backend).

zhiqwang · 2021-03-04T08:07:35Z

Mobile support:

Support D2Go OSS

Is the newly released d2go mentioned here?

oke-aditya · 2021-04-25T19:48:51Z

Thoughts about FastAutoAugment ?

jamt9000 · 2021-05-11T16:15:48Z

Add GPU/CUDA ops for image decoding (jpeg)

Implemented in #3792

oke-aditya · 2021-05-11T20:03:34Z

Rotated Boxes RFC #2761.

fmassa · 2021-05-12T10:48:40Z

@nSircombe

I was just wondering if there are any plans to add support for the QNNpack backend to the quantised ResNet50 model? (this would enable quantised ResNet50 for AArch64, which isn't supported by the FBGEMM backend).

can you please open a new issue so that we can track this?

@zhiqwang

Is the newly released d2go mentioned here?

yes, exactly

@oke-aditya

Thoughts about FastAutoAugment ?

Can you open a new issue so that we can track it down and discuss it there?

nSircombe · 2021-05-12T10:54:34Z

Hi @fmassa,

I currently have #3362 open (I should have added this in my original comment, sorry), will that suffice?

Atze00 · 2021-06-30T07:37:14Z

Is there any interest in adding MoViNets or some other state of the art models in video understanding?
Are models valuable also if they are not pretrained/someone has the possibility to train them before pushing to Pytorch?
Otherwise these models may use imported weights from tf, with the overhead of same padding, but I'm not convinced about this option.

NicolasHug · 2021-07-16T13:59:43Z

~~I updated the post with our roadmap for H2.~~ Actually, I created a new issue #4187 to avoid mixing discussions

Comments and suggestions are welcome :) !

NicolasHug · 2021-07-16T14:00:53Z

@Atze00 there's no current plan to support MoViNets but regarding video understanding, we're planning to support Optical Flow with a RAFT implementation

Are models valuable also if they are not pretrained/someone has the possibility to train them before pushing to Pytorch?

Could you clarify what you mean? I'm not sure this answers your question but in general, when we implement a model in torchvision we provide the architecture, the pre-trained weights, and also some training receipe

Atze00 · 2021-07-19T13:04:36Z

@NicolasHug My question was based on the fact that I've been working on implementing movinets in pytorch, using the weights released by the authors in TF. I've been wondering if the code alone would be enough to discuss about a possible addition of this architecture into the roadmap. Unfortunately I don't have the resources or the experience necessary to reproduce state of the art results in video understanding.

datumbox · 2021-07-29T13:19:48Z

@fmassa @NicolasHug Shall we close this issue in favor of #4187?

Since only the rotated boxes RFC is still open we could consider moving it on H2 or omit it if nobody wants to pick it up.

datumbox · 2021-08-13T13:07:58Z

Moved the pending to H2 at #4187, closing this issue.

fmassa mentioned this issue Jan 6, 2021

Revisit ONNX-specific workarounds #3225

Open

fmassa pinned this issue Jan 6, 2021

oke-aditya mentioned this issue Jan 14, 2021

Multi-class focal loss #3250

Open

oke-aditya mentioned this issue Feb 9, 2021

TorchVision support for pre-trained, quantised, ResNet50 using QNNpack backend #3362

Open

zhiqwang mentioned this issue Apr 26, 2021

Conert inference results of c++ to bbox, class id and score zhiqwang/yolort#98

Closed

oke-aditya mentioned this issue May 12, 2021

[RFC] New Augmentation techniques in Torchvison #3817

Open

17 tasks

NicolasHug changed the title ~~TorchVision Roadmap~~ TorchVision Roadmap - 2021 H1 Jul 16, 2021

NicolasHug mentioned this issue Jul 16, 2021

TorchVision Roadmap - 2021 H2 #4187

Closed

9 tasks

datumbox closed this as completed Aug 13, 2021

datumbox unpinned this issue Aug 13, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TorchVision Roadmap - 2021 H1 #3221

TorchVision Roadmap - 2021 H1 #3221

datumbox commented Jan 5, 2021 •

edited

Loading

masahi commented Jan 5, 2021 •

edited

Loading

fmassa commented Jan 6, 2021 •

edited

Loading

oke-aditya commented Jan 17, 2021 •

edited

Loading

vfdev-5 commented Jan 28, 2021

nSircombe commented Feb 9, 2021

zhiqwang commented Mar 4, 2021

oke-aditya commented Apr 25, 2021 •

edited

Loading

jamt9000 commented May 11, 2021

oke-aditya commented May 11, 2021

fmassa commented May 12, 2021

nSircombe commented May 12, 2021

Atze00 commented Jun 30, 2021

NicolasHug commented Jul 16, 2021 •

edited

Loading

NicolasHug commented Jul 16, 2021

Atze00 commented Jul 19, 2021

datumbox commented Jul 29, 2021

datumbox commented Aug 13, 2021

TorchVision Roadmap - 2021 H1 #3221

TorchVision Roadmap - 2021 H1 #3221

Comments

datumbox commented Jan 5, 2021 • edited Loading

masahi commented Jan 5, 2021 • edited Loading

fmassa commented Jan 6, 2021 • edited Loading

oke-aditya commented Jan 17, 2021 • edited Loading

vfdev-5 commented Jan 28, 2021

nSircombe commented Feb 9, 2021

zhiqwang commented Mar 4, 2021

oke-aditya commented Apr 25, 2021 • edited Loading

jamt9000 commented May 11, 2021

oke-aditya commented May 11, 2021

fmassa commented May 12, 2021

nSircombe commented May 12, 2021

Atze00 commented Jun 30, 2021

NicolasHug commented Jul 16, 2021 • edited Loading

NicolasHug commented Jul 16, 2021

Atze00 commented Jul 19, 2021

datumbox commented Jul 29, 2021

datumbox commented Aug 13, 2021

datumbox commented Jan 5, 2021 •

edited

Loading

masahi commented Jan 5, 2021 •

edited

Loading

fmassa commented Jan 6, 2021 •

edited

Loading

oke-aditya commented Jan 17, 2021 •

edited

Loading

oke-aditya commented Apr 25, 2021 •

edited

Loading

NicolasHug commented Jul 16, 2021 •

edited

Loading