inference_detector_by_patches using RotatedDetector(Tensorrt) #728

maoye237 · 2023-02-14T00:15:21Z

maoye237
Feb 14, 2023

Has anyone able to get the existing inference_detector_by_patches function to work with RotatedDetector? If so can you let me know if i am going down the right path?

I am trying to perform rotated detection using TensorRT backend on a large image, but the existing inference_detector_by_patches function provided can't be used with RotatedDetector, at least not without significant modification(or at least I can't figure out).

Can someone tell me if I am on the right track maybe? Or if there is an easier way to perform inference on large images using RotatedDetector?

From what I observed, I need to split the images into sections with overlaps, perform inferences using the dissected images, combine the results back together, then implement the nms. Is that somewhat in the right ballpark? Any examples/advice would be much appreciate it.

def inference_detector_by_patches(model,
img,
sizes,
steps,
ratios,
merge_iou_thr,
bs=1):
"""inference patches with the detector.

Split huge image(s) into patches and inference them with the detector.
Finally, merge patch results on one huge image by nms.

Args:
    model (nn.Module): The loaded detector.
    img (str | ndarray or): Either an image file or loaded image.
    sizes (list): The sizes of patches.
    steps (list): The steps between two patches.
    ratios (list): Image resizing ratios for multi-scale detecting.
    merge_iou_thr (float): IoU threshold for merging results.
    bs (int): Batch size, must greater than or equal to 1.

Returns:
    list[np.ndarray]: Detection results.
"""
assert bs >= 1, 'The batch size must greater than or equal to 1'
cfg = model.cfg
device = next(model.parameters()).device  # model device
cfg = cfg.copy()
# set loading pipeline type
cfg.data.test.pipeline[0].type = 'LoadPatchFromImage'
cfg.data.test.pipeline = replace_ImageToTensor(cfg.data.test.pipeline)
test_pipeline = Compose(cfg.data.test.pipeline)

if not isinstance(img, np.ndarray):
    img = mmcv.imread(img)
height, width = img.shape[:2]
sizes, steps = get_multiscale_patch(sizes, steps, ratios)
windows = slide_window(width, height, sizes, steps)

results = []
start = 0
while True:
    # prepare patch data
    patch_datas = []
    if (start + bs) > len(windows):
        end = len(windows)
    else:
        end = start + bs
    for window in windows[start:end]:
        data = dict(img=img, win=window.tolist())
        data = test_pipeline(data)
        patch_datas.append(data)
    data = collate(patch_datas, samples_per_gpu=len(patch_datas))
    # just get the actual data from DataContainer
    data['img_metas'] = [
        img_metas.data[0] for img_metas in data['img_metas']
    ]
    data['img'] = [img.data[0] for img in data['img']]
    if next(model.parameters()).is_cuda:
        # scatter to specified GPU
        data = scatter(data, [device])[0]
    else:
        for m in model.modules():
            assert not isinstance(
                m, RoIPool
            ), 'CPU inference with RoIPool is not supported currently.'

    # forward the model
    with torch.no_grad():
        results.extend(model(return_loss=False, rescale=True, **data))

    if end >= len(windows):
        break
    start += bs

results = merge_results(
    results,
    windows[:, :2],
    img_shape=(width, height),
    iou_thr=merge_iou_thr,
    device=device)
return results

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inference_detector_by_patches using RotatedDetector(Tensorrt) #728

{{title}}

Replies: 0 comments

Select a reply

inference_detector_by_patches using RotatedDetector(Tensorrt) #728

maoye237 Feb 14, 2023

Replies: 0 comments

maoye237
Feb 14, 2023