Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pretrained CPM model output resolution is only 24x24 #61

Open
sfktrifork opened this issue Feb 3, 2019 · 2 comments
Open

Pretrained CPM model output resolution is only 24x24 #61

sfktrifork opened this issue Feb 3, 2019 · 2 comments
Labels
enhancement New feature or request

Comments

@sfktrifork
Copy link

sfktrifork commented Feb 3, 2019

(Debugged using the iOS demo app at https://github.com/tucan9389/PoseEstimation-CoreML)

The output dimension of the pretrained CPM model when converted to CoreML is 96x96; however, the resolution of its prediction is ostensibly only 1/4th of that. All coordinates of predicted keypoint positions are multiples of 4, e.g. (4, 56), (24, 28), etc. This effectively means a prediction whose accuracy is 4 times worse than expected.

In debugging, I looked at the predicted position of the top point. Notice how the position is always a multiple of 4 for both x and y.

Max top point is: (44,24) with confidence 0.68310546875
w/h: 96/96
Max top point is: (44,24) with confidence 0.6572265625
w/h: 96/96
Max top point is: (44,24) with confidence 0.677734375
w/h: 96/96
Max top point is: (44,24) with confidence 0.72900390625
w/h: 96/96
Max top point is: (48,20) with confidence 0.13720703125
w/h: 96/96
Max top point is: (84,84) with confidence 0.021026611328125

Why does this occur? Does this have to do with the pretrained model itself? Should I train the network myself to yield a higher resolution?

@tucan9389
Copy link
Contributor

tucan9389 commented Feb 4, 2019

I have the same problem here.
@edvardHua Do you have any idea?
img_1254

@edvardHua
Copy link
Owner

edvardHua commented Feb 7, 2019

Indeed, the network architectures has a huge margin of improvement.
We could follow the tips of paper Convolutional Neural Networks at Constrained Time Cost to optimize it....
But it takes time.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants