This work has been accepted by IEEE Transactions on Robotics (TRO) 🎉 [pdf] [video].
We propose LCR-Net to tackle both LiDAR-based loop detection and registration. It exploits novel feature extraction and a 3D rotary attention mechanism to precisely estimate similarities and 6-DoF poses between pairs of LiDAR scans.
LCR-Net has been integrated into SLAM systems, serving both as a loop closing module and relocalization module. This integration ensures robust and accurate online LiDAR SLAM in outdoor driving environments.
If you use our implementation in your academic work, please cite the corresponding paper:
@ARTICLE{shi2024tro,
author={Shi, Chenghao and Chen, Xieyuanli and Xiao, Junhao and Dai, Bin and Lu, Huimin},
journal={IEEE Transactions on Robotics},
title={Fast and Accurate Deep Loop Closing and Relocalization for Reliable LiDAR SLAM},
year={2024},
volume={40},
number={},
pages={2620-2640},
doi={10.1109/TRO.2024.3386363}}
The 3D-RoFormer follows RDMNet, please cite:
@article{shi2023tits,
author={Shi, Chenghao and Chen, Xieyuanli and Lu, Huimin and Deng, Wenbang and Xiao, Junhao and Dai, Bin},
journal={IEEE Transactions on Intelligent Transportation Systems},
title={RDMNet: Reliable Dense Matching Based Point Cloud Registration for Autonomous Driving},
year={2023},
volume={24},
number={10},
pages={11372-11383},
doi={10.1109/TITS.2023.3286464}}
python setup.py build develop
The code has been tested with {Ubuntu 20.04, torch 1.8.0, cuda 11.4, and python 3.8.8}/{Ubuntu 22.04, torch 2.4.1, cuda 12.4, and python 3.8.19}
For quick test, we provide 4 pre-trained models here.
For mainland China friends, you may use a faster link, extract code: m8bh.
Below is an notation for each model.
./weights/best-model-lc.tar # Dense point matching head trained on close-loop point cloud pairs, denoted as
$LCR-Net^\dagger$ in our paper. ./weights/best-model-reg.tar # Dense point matching head trained on continuous point cloud pairs, denoted as$LCR-Net^\diamond$ in our paper../weights/best-model-ld.tar # Global description head with backbone pre-trained on continuous point cloud pairs, denoted as
$LCR-Net^\diamond$ in our paper../weights/best-model-mixed.tar # Full LCR-Net trained on mixed point cloud pairs, denoted as
$LCR-Net$ in our paper.
Unzip weights and put them in weights
directory, then run:
python3 demo/demo.py
This script returns the dissimilarity between the two scans and calculates their relative pose:
Test pos_idx: 3854 and anc_idx: 958
L2 feature distance: 0.809192
Estimated transformation: [[ 3.8640183e-01 -9.2232913e-01 -1.6882520e-03 -5.1863933e+00] [ 9.2216253e-01 3.8629568e-01 1.9789029e-02 5.1413069e+00] [-1.7599827e-02 -9.2033548e-03 9.9980271e-01 -8.8044703e-02] [ 0.0000000e+00 0.0000000e+00 0.0000000e+00 1.0000000e+00]]
Use --vis=True
to enable visualization, and you will get the following in sequence:
Carefully examining figures b) and d), you can find that as long as there are geometrically salient object near the nodes, they are likely to be moved to the object, thereby enhancing the ability to recognize and match the same object as a whole.
We also provide a case for two scans (4481,958) that have no correlation. The network will output a significant dissimilarity.
This repo currently offers support for KITTI, KITTI360, Apollo, MulRan, and Ford campus datasets for continuous registration testing; KITTI and KITTI360 for closed-loop registration testing; and KITTI and Ford campus for loop detection testing. Following instructions to prepare data and ground truth files.
We employ a two-stage training approach, which initially trains the network for the registration task, followed by training the global description head for the candidate retrieval.
To begin with the initial training phase for the registration task, run the following command:
python3 experiments/registration/trainval_mixed.py
Saving the best model in the registration task, and then you can train the global description head. Run the following command:
python3 experiments/loop_detection/trainval_loop_detection.py --snapshot=path/to/best-model.tar
Saving the best model in the loop_detection task, and then use this script to merge the two snapshots:
python3 experiments/lcrnet/merge_model.py --model_reg=path/to/best-model-reg.tar --model_reg=path/to/best-model-ld.tar
python3 experiments/loop_detection/test_loop_detection.py --snapshot=./weights/best-model-mixed.tar --dataset=kitti
This script generates and saves global descriptors. Use --dataset
to specify the dataset; currently, loop detection testing supports the KITTI and Ford campus datasets. Use --snapshot=./weights/best-model-reg.tar
to reproduce the results of
To evaluate the loop detection performance, run:
python3 experiments/loop_detection/eval_loop_detection_overlap_dataset.py --dataset=kitti
python3 experiments/registration/test_loop_closure.py --snapshot=./weights/best-model-mixed.tar
This script returns registration results and middle outputs. Use --seq
to specify the test sequence, --lc_text_root
to specify ground truth root, --dataset
to select dataset. Currently, the loop closing test supports the KITTI and KITTI360 datasets.
python3 experiments/registration/test_loop_closure.py --snapshot=./weights/best-model-mixed.tar --dataset=kitti --seq=8 --lc_text_root=/loop_distance4
Use --snapshot=./weights/best-model-lc.tar
to reproduce the results of
To evaluate the loop closing performance, run:
python3 experiments/registration/eval.py --dataset=kitti
Use--method=ransac
to specify RANSAC as the pose estimation algorithm.
python3 experiments/registration/test_registration.py --snapshot=./weights/best-model-mixed.tar --dataset=kitti --reg_text_root=/icp10
This script returns registration results and middle outputs. Use --reg_text_root
to specify ground truth root, --dataset
to select datasets. Currently, the loop closing test supports the KITTI, KITTI360, Apollo, Ford campus, and Mulran datasets.
Use --snapshot=./weights/best-model-reg.tar
to reproduce the results of
python3 experiments/registration/eval.py --dataset=kitti --subset=continuous
Use--method=ransac
to specify RANSAC as the pose estimation algorithm.
All the results presented below are evaluated using the same model
Datasets (overlap>0.3) | AUC | F1max | Recall@1 | Recall@1% |
---|---|---|---|---|
KITTI Seq.00 | 0.958 | 0.922 | 0.937 | 0.993 |
Ford campus | 0.972 | 0.920 | 0.932 | 0.987 |
Datasets (overlap>0.3) | RR | RYE(°) | RTE(cm) |
---|---|---|---|
KITTI Seq.00 | 100 | 0.10 | 5 |
KITTI Seq.08 | 100 | 0.34 | 8 |
KITTI-360 Seq.09 | 100 | 0.14 | 10 |
KITTI-360 Seq.02 | 98.64 | 0.27 | 21 |
LCRNet is the only method achieves a 100% registration recall on the test data (except KITTI-360 Seq.02). We find that the majority of the failed samples in KITTI-360 Seq.02 are attributable to issues with the ground truth data.
Datasets (distance>10m) | RR | RRE(°) | RTE(cm) |
---|---|---|---|
KITTI | 99.82 | 0.19 | 3.9 |
KITTI-360 | 99.94 | 0.24 | 5.8 |
Apollo | 100 | 0.09 | 3.4 |
Ford campus | 100 | 0.16 | 6.6 |
Mulran | 98.22 | 0.17 | 7.4 |
To test on your own data, it is recommended to implement a Dataset
as in experiments/lcrnet/datasets/loop_closure/kitti/dataset_demo.py
, and generate your ground truth text referring to generate_kitti_loop_pairs.py
or generate_kitti_pairs.py
in the data/Kitti
directory.
Use --dataset_root
to specify the root directory for the dataset, --gt_text_root
to specify the root directory for the ground truth data, --seq
to specify the sequence. Run the following command:
python3 demo/demo.py --dataset_root=/your/Dataset/root --gt_text_root=/your/ground/truth/text/root --seq=0
In practical applications, when ground truth is unavailable, you might only be interested in obtaining the network's similarity estimates or pose estimates. We have provided corresponding scripts in the experiments/inference
directory.
To generate global descriptors for a set of data, please refer to the following example:
python3 experiments/inference/infer_loop_detection_descriptor_generation.py --dataset=kitti --seq=8
Afterward, utilize the following command to output the most similar candidate frames for each individual frame:
python3 experiments/inference/infer_loop_detection_find_top1.py --dataset=kitti --seq=8
To generate pose estimates between any two frames, please refer to the file to set the index of the data you wish to test, and then follow the example below to generate the pose estimates:
python3 experiments/inference/infer_registration.py --dataset=kitti --seq=8
We thank the following authors for open sourcing their methods.
This project is free software made available under the MIT License. For details see the LICENSE file.