Michael Schwimmbeck, Serouj Khajarian, and Stefanie Remmele (University of Applied Sciences Landshut)
How To Use HOLa (YouTube video):
Network Architecture:
- Install all packages listed in requirements.txt. All code is based on python 3.9 interpreter.
- Download SAM model to ./ckpt. HOLa uses SAM-VIT-H (sam_vit_h_4b8939.pth).
- Download DeAOT/AOT model to ./ckpt. HOLa uses R50-DeAOT-L (R50_DeAOTL_PRE_YTB_DAV.pth).
- Create a HoloLens 2 calibration file with ./viewer/pv_extrinsic_calibration.py and save it to ./hololens_recordings/Calib
- Install the HOLa Unity app (Releases: HOLa_1.0.0.0_ARM64.appx) on your HoloLens 2. (Alternatively, you can open ./unity in Unity, build the project as app and install the HOLa app on your HoloLens 2 by yourself).
- Make sure that both computer and HoloLens 2 are connected to the same Wi-Fi. Enter your HoloLens IP address as "General settings" -> host in main.py.
- Set a take number in "General settings".
- Run HOLa on your HoloLens 2.
- Run main.py on your computer and follow the console instructions.
Recording is based on HL2SS. Data are acquired in PV, RGB, Depth and Pose format. Pointcloud format is switched off by default as it decreases recording performance. You can either switch on the pointcloud stream in ./viewer/HololensStreaming.py or generate pointclouds offline with ./viewer/OfflinePointcloudGenerator.py.
- Adjust the HOLa sphere cursor onto the object of interest. Make sure to have an appropriate distance from the object such that the object is clearly visible in the initially recorded frame.
- While keeping the sphere cursor on the object of interest, trigger recording by saying "Start". Wait until the cursor color changes to red (indicating active recording). After this, you can move the cursor off the object.
- When you wish to stop recording, simply say "Stop". The cursor color changes back to white to indicate finalized recording.
- Find recorded data in ./hololens_recordings.
Make sure to run this mode on a computer with GPU storage sufficient for running Segment Anything (SAM). After the process has finished, you can find all object masks per frame in ./assets. Note: If you wish to set a custom seedpoint for labeling subsequent to recording, you can specifiy the seedpoint in "General settings" in main.py.
Licenses for borrowed code can be found in licenses.md file.
- DeAOT/AOT - https://github.com/yoxu515/aot-benchmark
- SAM - https://github.com/facebookresearch/segment-anything
- SAM-Track - https://github.com/z-x-yang/Segment-and-Track-Anything
- HL2SS - https://github.com/jdibenes/hl2ss
The project is licensed under the AGPL-3.0 license. To utilize or further develop this project for commercial purposes through proprietary means, permission must be granted by us (as well as the owners of any borrowed code).
Please consider citing the related paper(s) in your publications if it helps your research.
@inproceedings{schwimmbeck2024hola,
title={HOLa: HoloLens Object Labeling},
author={Schwimmbeck, Michael and Khajarian, Serouj and Holzapfel, Konstantin and Schmidt, Johannes and Remmele, Stefanie},
booktitle={Current Directions in Biomedical Engineering},
volume={10},
number={4},
pages={571--574},
year={2024},
organization={De Gruyter}
}
@article{cheng2023segment,
title={Segment and Track Anything},
author={Cheng, Yangming and Li, Liulei and Xu, Yuanyou and Li, Xiaodi and Yang, Zongxin and Wang, Wenguan and Yang, Yi},
journal={arXiv preprint arXiv:2305.06558},
year={2023}
}
@article{kirillov2023segment,
title={Segment anything},
author={Kirillov, Alexander and Mintun, Eric and Ravi, Nikhila and Mao, Hanzi and Rolland, Chloe and Gustafson, Laura and Xiao, Tete and Whitehead, Spencer and Berg, Alexander C and Lo, Wan-Yen and others},
journal={arXiv preprint arXiv:2304.02643},
year={2023}
}
@inproceedings{yang2022deaot,
title={Decoupling Features in Hierarchical Propagation for Video Object Segmentation},
author={Yang, Zongxin and Yang, Yi},
booktitle={Advances in Neural Information Processing Systems (NeurIPS)},
year={2022}
}
@inproceedings{yang2021aot,
title={Associating Objects with Transformers for Video Object Segmentation},
author={Yang, Zongxin and Wei, Yunchao and Yang, Yi},
booktitle={Advances in Neural Information Processing Systems (NeurIPS)},
year={2021}
}
@article{dibene2022hololens,
title={HoloLens 2 Sensor Streaming},
author={Dibene, Juan C and Dunn, Enrique},
journal={arXiv preprint arXiv:2211.02648},
year={2022}
}