An Evaluation of CNN Models and Data Augmentation Techniques in Hierarchical Localization of Mobile Robots

Authors: J.J. Cabrera, O.J. Céspedes, S. Cebollada, O. Reinoso, L. Payá
Journal: Evolving Systems (2024)
Publisher: Springer-Verlag
ISSN: 1868-6486
DOI: 10.1007/s12530-024-09604-6
arXiv: 2407.10596 YouTube: Video link

Introduction

This work presents an evaluation of CNN models and data augmentation techniques to carry out the hierarchical localization of a mobile robot using omnidirectional images. An ablation study of different state-of-the-art CNN models used as the backbone is presented, and a variety of data augmentation visual effects are proposed to address the visual localization of the robot. The proposed method is based on adapting and retraining a CNN with a dual purpose:

To perform a rough localization step where the model predicts the room from which an image was captured.
To address the fine localization step by retrieving the most similar image from the visual map among those in the previously predicted room through a pairwise comparison between descriptors obtained from an intermediate layer of the CNN.

We evaluate the impact of different state-of-the-art CNN models such as ConvNeXt for the proposed localization. Various data augmentation visual effects are separately employed for training the model, and their impact is assessed. The performance of the resulting CNNs is evaluated under real operation conditions, including changes in lighting conditions.

Comparison with Other Methods

Model	Cloudy Error	Night Error	Sunny Error
Alexnet + DA [1]	0.29 m	0.29 m	0.69 m
EfficientNet [2]	0.24 m	0.33 m	0.44 m
Triplet VGG16 [3]	0.25 m	0.28 m	0.40 m
ConvNeXt Large (ours)	0.22 m	0.26 m	0.83 m
ConvNeXt Large + DA (ours)	0.22 m	0.27 m	0.57 m
HOG [4]	-	0.45 m	0.82 m
gist [4]	-	1.07 m	0.88 m

[1] Cabrera, J.J., Cebollada, S., Flores, M., Reinoso, O., Payá, L.: Training, optimiza- tion and validation of a cnn for room retrieval and description of omnidirectional images. SN Computer Science 3(4), 1–13 (2022)

[2] Rostkowska, M. and P. Skrzypczy´nski. 2023. Optimizing appearance-based localization with catadioptric cameras: Small-footprint models for real-time inference on edge devices. Sensors 23 (14): 6485

[3] Alfaro, M., Cabrera, J.J., Jiménez, L.M., Reinoso, Payá, L.: Hierarchical local- ization with panoramic views and triplet loss functions (2024)

[4] Cebollada, S., L. Payá, X. Jiang, and O. Reinoso. 2022. Development and use of a convolutional neural network for hierarchical appearance-based localization. Artificial Intelligence Review 55 (4): 2847–2874

Citation

If you find this work useful, please consider citing:

@article{Cabrera2024CNNLocalization,
title={An evaluation of CNN models and data augmentation techniques in hierarchical localization of mobile robots},
author={J.J. Cabrera and O.J. Céspedes and S. Cebollada and O. Reinoso and L. Payá},
journal={Evolving Systems},
year={2024},
publisher={Springer-Verlag},
issn={1868-6486},
doi={10.1007/s12530-024-09604-6}
}

Repository Structure

The repository is structured as follows:

├── config
│ ├── config.py
│ ├── parameters.yaml
├── eval
│ ├── evaluate.py
│ ├── evaluation_utils.py
│ ├── save_all_test_errors.py
│ ├── validation.py
├── train
│ ├── run_train.py
│ ├── training_module.py
├── datasets.py 
├── models.py
├── README.md
└── requirements.txt

Getting Started

Prerequisites

Ensure you have the following installed:

Python 3.8+
Torch
NumPy
Matplotlib

You can install the required packages using:

pip install -r requirements.txt

Usage

Dataset

This project uses the COsy Localization Database (Freiburg), which has been divided into training, validation, and test sets. Six data augmentation effects are individually applied to the training set. The dataset used in this research can be downloaded from: https://drive.google.com/drive/folders/1izX9LsE9f34q3cq2UbUEcneBxfFwPKh9?usp=sharing

Configuration:

Adjust the dataset path (dataset_folder) and the training parameters in config/parameters.yaml as needed.

dataset_folder: '/media/arvc/DATOS/Juanjo/Datasets/Friburgo/'

train_batch_size: 16
validation_batch_size: 512
num_classes: 9
epochs: 30

models_to_train: ['AlexNet', 'resnet_152', 'convnext', 'resnext', 'efficientnet', 'mobilenet']
DA_training_sequences: ['noDA', 'DA1', 'DA2', 'DA3', 'DA4', 'DA5', 'DA6']

models_to_test: ['AlexNet', 'resnet_152', 'convnext', 'resnext', 'efficientnet', 'mobilenet']
DA_testing_sequences: ['noDA', 'DA1', 'DA2', 'DA3', 'DA4', 'DA5', 'DA6']

Training:

Run the training script:

python3 train/run_train.py

Evaluation:

Evaluate the trained model:

python3 eval/evaluate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An Evaluation of CNN Models and Data Augmentation Techniques in Hierarchical Localization of Mobile Robots

Introduction

Comparison with Other Methods

Citation

Repository Structure

Getting Started

Prerequisites

Usage

Dataset

Configuration:

Training:

Evaluation:

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
config		config
eval		eval
media		media
plot_utils		plot_utils
train		train
README.md		README.md
datasets.py		datasets.py
models.py		models.py
requirements.txt		requirements.txt

juanjo-cabrera/IndoorLocalizationSingleCNN

Folders and files

Latest commit

History

Repository files navigation

An Evaluation of CNN Models and Data Augmentation Techniques in Hierarchical Localization of Mobile Robots

Introduction

Comparison with Other Methods

Citation

Repository Structure

Getting Started

Prerequisites

Usage

Dataset

Configuration:

Training:

Evaluation:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages