Voice_Assistant

Voice Assistant for Visually Impaired Built on Raspberry Pi 5

This code works on Windows, Mac, etc. Simply edit where camera footage is pulled from

Uses a wakeword model for activation. The wakeword is 'Suno'. The wakeword model is saved under 'optimizer2.pt'.

After waking up the model, give an input (via speaking) of either an object (ie. person) or give the command 'all objects'

The code will announce all objects as well as their depths and orientations.

In order for depth to work, you must have a 2 camera system and calibrate it. Collect images using calibration_images.py and create stereomap.xml by running stereo_calibration.py.

Enjoy!

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Help		Help
images		images
LICENSE		LICENSE
README.md		README.md
calibration.py		calibration.py
calibration_images.py		calibration_images.py
command_output.wav		command_output.wav
dataset.py		dataset.py
engine.py		engine.py
optimizer2.pt		optimizer2.pt
requirements.txt		requirements.txt
stereoMap.xml		stereoMap.xml
stereo_calibration.py		stereo_calibration.py
triangulation.py		triangulation.py
wakeword_temp		wakeword_temp
yolo11n.pt		yolo11n.pt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voice_Assistant

About

Releases

Packages

Languages

License

shivenPython2023/Voice_Assistant

Folders and files

Latest commit

History

Repository files navigation

Voice_Assistant

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages