3D NMS and RoiAlign for volumetric data #2402

mibaumgartner · 2020-07-07T14:58:53Z

🚀 Feature

3D data gains more and more popularity inside the deep learning community. As a consequence it would be great to have a unified 3D NMS and 3D ROI Align for future and current projects like MONAI .

Motivation

Information added from @mjorgecardoso
Medical imaging is a huge field of research, with conferences such as ISMRM (5k+ attendees), MICCAI (2.5k+), ISBI (1.5k+). Volumetric neural network operations (convolutions, pooling, etc), are common and supported in PyTorch (see here https://pytorch.org/docs/master/generated/torch.nn.Conv3d.html).

Spatial dimensions summarised:
N = batch size, C = channels, H = height, W = width, D = depth / T = time

Typically found in 2D: [N, C, H, W]

Typically found in 2d + time (video): [N, C, T, H, W]
Expected behaviour: operations are only applied along the spatial dimensions (H, W) and NOT along T

Typically found in 3d (volumetric): [N, C, D, H, W] (sometimes also [N, C, H, W, D] as in medicaldetectiontoolkit)
Expected behaviour: operations are applied along all spatial dimensions (D,H,W)

Pitch

Add support for NMS and RoiAlign for volumetric data and define the right conventions and proper documentation to make clear which function needs to be used in which case.

For backward compatibility nms and roialign should be kept as an alias for their plain 2d counterparts. Moving forward, there could be two functions nms2d and nms3d (like typically found in pytorch e.g. Conv2d and Conv3d). I'm not quite sure what the optimal way of handling/naming the video case is (maybe a flag inside the 3d versions?).

Alternatives

Additional context

#2337
#1678
@pfjaeger

naga-karthik · 2022-04-18T20:27:20Z

Hello, I am wondering what's the status of this issue? Are 3D NMS and 3D ROI Align going to be implemented in future version of torchvision anytime soon? As the OP mentioned, having access to 3D versions of the above ops would make it convenient to train models on volumetric (medical) data. Thanks!

datumbox · 2022-04-19T08:47:38Z

@naga-karthik Thanks for the interest. Right now we don't have the bandwidth to investigate and implement the proposed features. We are a small team and we are currently tackling other more high-priority issues (new Datasets API, new Transforms API etc). Rest assured we will definitely review this on the next planning session.

etasnadi · 2024-04-29T17:24:05Z

Dear All,

If anyone is considering to implement this in torchvision, I have a working 3D RoiAlign kernel implemented in Tensnorflow that could be directly ported back into PyTorch. You can pull the 3D kernels from here: https://github.com/etasnadi/roi_align_3D.

etasnadi · 2024-04-29T19:26:59Z

Might worth considering https://github.com/TimothyZero/MedVision/tree/main for the torch version also.

mibaumgartner mentioned this issue Jul 7, 2020

3D NMS and ROI Align #2337

Open

mibaumgartner mentioned this issue Nov 9, 2020

[Feature request] Volumetric Detection Ops Project-MONAI/MONAI#1205

Closed

fmassa added module: ops needs discussion labels Feb 21, 2021

NicolasHug mentioned this issue Apr 14, 2021

Whether RoiAlign could add support for 3D data such as medical data? #3669

Closed

oke-aditya mentioned this issue Jun 14, 2021

[RFC] TorchVision with Batteries included - Phase 1 #3911

Closed

16 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

3D NMS and RoiAlign for volumetric data #2402

3D NMS and RoiAlign for volumetric data #2402

mibaumgartner commented Jul 7, 2020

naga-karthik commented Apr 18, 2022

datumbox commented Apr 19, 2022

etasnadi commented Apr 29, 2024

etasnadi commented Apr 29, 2024 •

edited

Loading

3D NMS and RoiAlign for volumetric data #2402

3D NMS and RoiAlign for volumetric data #2402

Comments

mibaumgartner commented Jul 7, 2020

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

naga-karthik commented Apr 18, 2022

datumbox commented Apr 19, 2022

etasnadi commented Apr 29, 2024

etasnadi commented Apr 29, 2024 • edited Loading

etasnadi commented Apr 29, 2024 •

edited

Loading