Pallet Detection and Segmentation

This project implements an end-to-end solution for pallet detection and segmentation using YOLOv11 and SAM2 (Segment Anything Model 2). It includes tools for annotation, training, inference, and evaluation.

Dataset

The complete dataset is available on Google Drive:

Dataset

Fine tuned model Weights

Fine tuned model Weights on Google Drive:

Weights

Installation

Create and activate a virtual environment:

python -m venv venv_sam2
source venv_sam2/bin/activate # Linux/Mac
# or
.\venv_sam2\Scripts\activate # Windows

Install dependencies:

pip install -r requirements.txt

Requirements

torch>=2.0.0
ultralytics>=8.0.0
segment-anything-2==0.1.0
supervision>=0.3.0
opencv-python>=4.8.0
numpy>=1.24.0
ros2 humble

Project Structure

pallet_vision/
├── data/
│   ├── dataset/             # Detection dataset
│   └── dataset-segment/     # Segmentation dataset
├── models/
│   └── weights/             # Trained model weights
├── results/                 # Inference results
├── scripts/
│   ├── detection/
│   │   ├── annotation/
│   │   │   └── pallet_annotator.py
│   │   ├── network/
│   │   │   └── pallet_object_detection.py
│   │   └── inference/
│   │       └── detection_inference_display.py      # ✓ Available
│   ├── segmentation/
│   │   ├── annotation/
│   │   │   ├── sam2_pallets.py
│   │   │   └── det2_segformat.py
│   │   ├── network/
│   │   │   └── segment_network.py
│   │   └── inference/
│   │       └── segmentation_inference_display.py   # ✓ Available
│   └── utils/
│       ├── yolo_label_normalizer.py
│       ├── dataset_preparation.py
│       ├── datasize.py
│       ├── dataset_augment.py
│       ├── dataset_download.py
│       └── optimize_yolo.py
└── requirements.txt

Available Scripts

Detection Inference

Run single-image detection inference with live visualization:

 python /home/uthira/pallet-detection/scripts/detection/inference/detection_inference_display.py --weights /home/uthira/pallet-detection/models/weights/best_detect.pt --image /home/uthira/pallet-detection/data/dataset/test/imagesmain/pallet_362.jpg

Script features:

Real-time visualization of detection results
Displays confidence scores for each detection
Side-by-side view of original and annotated images
GPU acceleration with half precision

Segmentation Inference

Run single-image segmentation inference with live visualization:

python /home/uthira/pallet-detection/scripts/segmentation/inference/segmentation_inference_display.py --weights /home/uthira/pallet-detection/models/weights/best_segment.pt --image /home/uthira/pallet-detection/data/dataset/test/imagesmain/pallet_362.jpg

Script features:

Real-time visualization of segmentation masks
Different colors for each instance
Side-by-side view of original and segmented images
GPU acceleration with half precision

Results

Detection Results

- mAP@50: 0.59144
- Precision: 0.64713
- Recall: 0.51344

Segmentation Results

Mean IoU Overall: 0.0400
Max IoU Overall: 0.9516
Min IoU Overall: 0.0000
Median IoU: 0.0131

ros2 Package

/ros2_package/pallet

ROS Pallet Package

This package provides functionality for publishing and processing images in ROS2, with support for both test images and live camera feed.

Prerequisites

ROS2 Humble
Python 3.8+
OpenCV
PyTorch
Ultralytics YOLO

Setup Instructions

Initialize ROS2 environment:

source /opt/ros/humble/setup.bash

Create and build workspace:

# Create workspace
mkdir -p ~/ros2_ws/src
cd ~/ros2_ws/src

# Clone or copy the pallet package
# [Copy pallet package into src folder]

# Build the workspace
cd ~/ros2_ws
colcon build

# Source the workspace
source install/setup.bash

Usage

Running the ROS2 Bag

Play the ROS bag at 1Hz:

ros2 bag play <path-to-your-rosbag> --rate 0.067

Example:

ros2 bag play /home/user/bags/internship_assignment_sample_bag --rate 0.067

This rate setting ensures 1Hz playback (assuming the bag was recorded at ~15Hz).

Running Inference

In a new terminal:

source /opt/ros/humble/setup.bash
source ~/ros2_ws/install/setup.bash
ros2 run pallet infer

Expected Output

The inference node will display:

Original image (resized)
Segmentation results
Detection results
Terminal output showing detection counts and confidence scores

Notes

Adjust the bag playback rate if you need different visualization speeds:
- --rate 0.033 for 0.5Hz (one image every 2 seconds)
- --rate 0.017 for 0.25Hz (one image every 4 seconds)

Troubleshooting

If you encounter the QoS warning:

[WARN] []: New publisher discovered on topic '/robot1/zed2i/left/image_rect_color', offering incompatible QoS

This is expected and handled by the node's QoS settings.

Citation

If you use this project in your research, please cite: Cite this work as,

Uthiralakshmi Sivaraman. Pallet Detection and Segmentation. (https://github.com/UthiraS/pallet_vision), GitHub. Retrieved date.

@software{pallet_detection_2024,
author = {Uthiralakshmi Sivaraman},
title = {Pallet Detection and Segmentation},
year = {2024},
publisher = {GitHub},
url = {https://github.com/UthiraS/pallet_vision}
}

Acknowledgments

SAM2 by Meta AI Research
YOLOv11
Roboflow for the initial dataset annotation

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
results		results
ros2_package/pallet		ros2_package/pallet
scripts		scripts
.gitignore		.gitignore
Palletize_Vision_System.pdf		Palletize_Vision_System.pdf
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pallet Detection and Segmentation

Dataset

Fine tuned model Weights

Installation

Requirements

Project Structure

Available Scripts

Detection Inference

Segmentation Inference

Results

Detection Results

Segmentation Results

ros2 Package

ROS Pallet Package

Prerequisites

Setup Instructions

Usage

Running the ROS2 Bag

Running Inference

Expected Output

Notes

Troubleshooting

Citation

Acknowledgments

About

Releases 1

Packages

Languages

UthiraS/pallet_vision

Folders and files

Latest commit

History

Repository files navigation

Pallet Detection and Segmentation

Dataset

Fine tuned model Weights

Installation

Requirements

Project Structure

Available Scripts

Detection Inference

Segmentation Inference

Results

Detection Results

Segmentation Results

ros2 Package

ROS Pallet Package

Prerequisites

Setup Instructions

Usage

Running the ROS2 Bag

Running Inference

Expected Output

Notes

Troubleshooting

Citation

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages