GitHub - ldkong1205/Robo3D: [ICCV 2023] Robo3D: Towards Robust and Reliable 3D Perception against Corruptions

Robo3D: Towards Robust and Reliable 3D Perception against Corruptions

Lingdong Kong^1,2,*    Youquan Liu^1,3,*    Xin Li^1,4,*    Runnan Chen^1,5    Wenwei Zhang^1,6
Jiawei Ren⁶    Liang Pan⁶    Kai Chen¹    Ziwei Liu⁶
¹Shanghai AI Laboratory    ²National University of Singapore    ³Hochschule Bremerhaven    ⁴East China Normal University    ⁵The University of Hong Kong    ⁶S-Lab, Nanyang Technological University

About

Robo3D is an evaluation suite heading toward robust and reliable 3D perception in autonomous driving. With it, we probe the robustness of 3D detectors and segmentors under out-of-distribution (OoD) scenarios against corruptions that occur in the real-world environment. Specifically, we consider natural corruptions happen in the following cases:

Adverse weather conditions, such as fog, wet ground, and snow;
External disturbances that are caused by motion blur or result in LiDAR beam missing;
Internal sensor failure, including crosstalk, possible incomplete echo, and cross-sensor scenarios.



Clean	Fog	Wet Ground

Snow	Motion Blur	Beam Missing

Crosstalk	Incomplete Echo	Cross-Sensor

Visit our project page to explore more examples. 🚘

Updates

[2024.05] - Check out the technical report of this competition: The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition 🚙.
[2024.05] - The slides of the 2024 RoboDrive Workshop are available here ⤴️.
[2024.05] - The video recordings are available on YouTube ⤴️ and Bilibili ⤴️.
[2024.05] - We are glad to announce the winning teams of the 2024 RoboDrive Challenge:
- Track 1: Robust BEV Detection
  - 🥇 DeepVision, 🥈 Ponyville Autonauts Ltd, 🥉 CyberBEV
- Track 2: Robust Map Segmentation
  - 🥇 SafeDrive-SSR, 🥈 CrazyFriday, 🥉 Samsung Research
- Track 3: Robust Occupancy Prediction
  - 🥇 ViewFormer, 🥈 APEC Blue, 🥉 hm.unilab
- Track 4: Robust Depth Estimation
  - 🥇 HIT-AIIA, 🥈 BUAA-Trans, 🥉 CUSTZS
- Track 5: Robust Multi-Modal BEV Detection
  - 🥇 safedrive-promax, 🥈 Ponyville Autonauts Ltd, 🥉 HITSZrobodrive
[2024.01] - The toolkit tailored for the 2024 RoboDrive Challenge has been released. 🛠️
[2023.12] - We are hosting the RoboDrive Challenge at ICRA 2024. 🚙
[2023.09] - Intend to improve the OoD robustness of your 3D perception models? Check out our recent work, Seal 🦭, an image-to-LiDAR self-supervised pretraining framework that leverages off-the-shelf knowledge from vision foundation models for cross-modality representation learning.
[2023.07] - Robo3D was accepted to ICCV 2023! 🎉
[2023.03] - We establish "Robust 3D Perception" leaderboards on Paper-with-Code: ¹KITTI-C, ²SemanticKITTI-C, ³nuScenes-C, and ⁴WOD-C. Join the challenge today! 🙋
[2023.03] - The KITTI-C, SemanticKITTI-C, and nuScenes-C datasets are ready for download at the OpenDataLab platform. Kindly refer to this page for more details on preparing these datasets. 🍻
[2023.01] - Launch of the Robo3D benchmark. In this initial version, we include 12 detectors and 22 segmentors, evaluated on 4 large-scale autonomous driving datasets (KITTI, SemanticKITTI, nuScenes, and Waymo Open) with 8 corruption types across 3 severity levels.

Taxonomy




Fog	Wet Ground	Snow	Motion Blur



Beam Missing	Crosstalk	Incomplete Echo	Cross-Sensor

Video Demo

Demo 1	Demo 2	Demo 3

Link ^⤴️	Link ^⤴️	Link ^⤴️

Installation

For details related to installation, kindly refer to INSTALL.md.

Data Preparation

Our datasets are hosted by OpenDataLab.

OpenDataLab is a pioneering open data platform for the large AI model era, making datasets accessible. By using OpenDataLab, researchers can obtain free formatted datasets in various fields.

Kindly refer to DATA_PREPARE.md for the details to prepare the ¹KITTI, ²KITTI-C, ³SemanticKITTI, ⁴SemanticKITTI-C, ⁵nuScenes, ⁶nuScenes-C, ⁷WOD, and ⁸WOD-C datasets.

Getting Started

To learn more usage about this codebase, kindly refer to GET_STARTED.md.

Model Zoo

LiDAR Semantic Segmentation

SqueezeSeg, ICRA 2018. ^[Code]

SqueezeSegV2, ICRA 2019. ^[Code]

MinkowskiNet, CVPR 2019. ^[Code]

RangeNet++, IROS 2019. ^[Code]

KPConv, ICCV 2019. ^[Code]

SalsaNext, ISVC 2020. ^[Code]

RandLA-Net, CVPR 2020. ^[Code]

PolarNet, CVPR 2020. ^[Code]

3D-MiniNet, IROS 2020. ^[Code]

SPVCNN, ECCV 2020. ^[Code]

Cylinder3D, CVPR 2021. ^[Code]

FIDNet, IROS 2021. ^[Code]

RPVNet, ICCV 2021.

CENet, ICME 2022. ^[Code]

CPGNet, ICRA 2022. ^[Code]

2DPASS, ECCV 2022. ^[Code]

GFNet, TMLR 2022. ^[Code]

PCB-RandNet, arXiv 2022. ^[Code]

PIDS, WACV 2023. ^[Code]

SphereFormer, CVPR 2023. ^[Code]

WaffleIron, ICCV 2023. ^[Code]

FRNet, arXiv 2023. ^[Code]

LiDAR Panoptic Segmentation

DS-Net, CVPR 2021. ^[Code]

Panoptic-PolarNet, CVPR 2021. ^[Code]

3D Object Detection

SECOND, Sensors 2018. ^[Code]

PointPillars, CVPR 2019. ^[Code]

PointRCNN, CVPR 2019. ^[Code]

Part-A2, T-PAMI 2020.

PV-RCNN, CVPR 2020. ^[Code]

3DSSD, CVPR 2020. ^[Code]

SA-SSD, CVPR 2020. ^[Code]

CenterPoint, CVPR 2021. ^[Code]

PV-RCNN++, IJCV 2022. ^[Code]

SphereFormer, CVPR 2023. ^[Code]

Benchmark

LiDAR Semantic Segmentation

The mean Intersection-over-Union (mIoU) is consistently used as the main indicator for evaluating model performance in our LiDAR semantic segmentation benchmark. The following two metrics are adopted to compare among models' robustness:

mCE (the lower the better): The average corruption error (in percentage) of a candidate model compared to the baseline model, which is calculated among all corruption types across three severity levels.
mRR (the higher the better): The average resilience rate (in percentage) of a candidate model compared to its "clean" performance, which is calculated among all corruption types across three severity levels.