GitHub - kaitolucifer/posenet-python: A Python port of Google TensorFlow.js PoseNet (Real-time Human Pose Estimation)

PoseNet Python

This repository contains a pure Python implementation (multi-pose only) of the Google TensorFlow.js Posenet model. For a (slightly faster) PyTorch implementation that followed from this, see (https://github.com/rwightman/posenet-pytorch)

I first adapted the JS code more or less verbatim and found the performance was low so made some vectorized numpy/scipy version of a few key functions (named _fast).

Further optimization is possible

The base MobileNet models have a throughput of 200-300 fps on a GTX 1080 Ti (or better)
The multi-pose post processing code brings this rate down significantly. With a fast CPU and a GTX 1080+:
- A literal translation of the JS post processing code dropped performance to approx 30fps
- My 'fast' post processing results in 90-110fps
A Cython or pure C++ port would be even better...

Install

A suitable Python 3.x environment with a recent version of Tensorflow is required.

Development and testing was done with Python 3.9.13 and Tensorflow 2.10.0 on Linux and Mac OS(Apple Silicon).

A environment setup as below should suffice:

pip install -r requirements.txt

Usage

There are three demo apps in the root that utilize the PoseNet model. They are very basic and could definitely be improved.

The first time these apps are run (or the library is used) model weights will be downloaded from the TensorFlow.js version and converted on the fly.

For all demos, the model can be specified with the '--model` argument by using its ordinal id (0-3) or integer depth multiplier (50, 75, 100, 101). The default is the 101 model.

image_demo.py

Image demo runs inference on an input folder of images and outputs those images with the keypoints and skeleton overlayed.

python image_demo.py --model 101 --image_dir ./images --output_dir ./output

A folder of suitable test images can be downloaded by first running the get_test_images.py script.

benchmark.py

A minimal performance benchmark based on image_demo. Images in --image_dir are pre-loaded and inference is run --num_images times with no drawing and no text output.

webcam_demo.py

The webcam demo uses OpenCV to capture images from a connected webcam. The result is overlayed with the keypoints and skeletons and rendered to the screen. The default args for the webcam_demo assume device_id=0 for the camera and that 1280x720 resolution is possible.

Credits

The original model, weights, code, etc. was created by Google and can be found at https://github.com/tensorflow/tfjs-models/tree/master/posenet

This port and my work is in no way related to Google.

The Python conversion code that started me on my way was adapted from the CoreML port at https://github.com/infocom-tpo/PoseNet-CoreML

TODO (someday, maybe)

More stringent verification of correctness against the original implementation
Performance improvements (especially edge loops in 'decode.py')
OpenGL rendering/drawing
Comment interfaces, tensor dimensions, etc
Implement batch inference for image_demo

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
posenet		posenet
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
NOTICE.txt		NOTICE.txt
README.md		README.md
benchmark.py		benchmark.py
get_test_images.py		get_test_images.py
image_demo.py		image_demo.py
requirements.txt		requirements.txt
webcam_demo.py		webcam_demo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PoseNet Python

Install

Usage

image_demo.py

benchmark.py

webcam_demo.py

Credits

TODO (someday, maybe)

About

Releases

Packages

Languages

License

kaitolucifer/posenet-python

Folders and files

Latest commit

History

Repository files navigation

PoseNet Python

Install

Usage

image_demo.py

benchmark.py

webcam_demo.py

Credits

TODO (someday, maybe)

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages