PointAndManipulate

Manipulate robot arm with pointing gesture

Similar idea as ICRA2022: Augmenteted Pointing Gesture Estimation for Human-Robot Interaction

LINK: https://ieeexplore.ieee.org/abstract/document/9811617

Ideas

the paper above use the direction of forearm as the poiting direction, but a pointing happens when we look at something and finger tip at it. It means the direction of forearm may not be the most ituitive way to represent a pointing action especially when the arm is not fully stretched out. Out sight should be the reference which is a line going through our eyes and finger tip.

Problem

~~?eye recognition~~

hard to locate the hand when arm is not fully streched out because there may be a lot of obstruction.

precision is also a problem since 2 points(eye and fingertip) stays not far enough in this situation for a distance from camera to people(under 2m) which means a small difference in coordinate brings a big difference in angle change.

Steps

Pose estimation

[x]human pose estimation: arm and eyes center(midpoint of two eyes)

on going:

AlphaPose: https://github.com/MVIG-SJTU/AlphaPose

Model: pretrained_models/multi_domain_fast50_dcn_combined_256x192.pth

Config: configs/halpe_coco_wholebody_136/resnet/256x192_res50_lr1e-3_2x-dcn-combined.yaml

Parameters/flags: https://github.com/MVIG-SJTU/AlphaPose/blob/master/docs/run.md

Output Format: https://github.com/MVIG-SJTU/AlphaPose/blob/master/docs/output.md

[ ]hand pose estimation: considering which to be the end point of the sight line

Pointing estimation

transform keypoints in image frame(in camera) to world frame

[ ]coordinates transform
[ ]hand-eye calibration
[ ]jittering optimization(depends on performance)
[ ]compensation(optional)

Interacton design

after getting the pointing position/line of sight, how do we interact with the arm?

projection on X-Y plane and see what is there and how can we interact/manipulate
collision checking: line of sight and object in the workspace then interact

Progress

2022.3.22

running :

python scripts/demo_inference.py --cfg configs/halpe_coco_wholebody_136/resnet/256x192_res50_lr1e-3_2x-dcn-combined.yaml --checkpoint pretrained_models/multi_domain_fast50_dcn_combined_256x192.pth --vis --webcam 6

model here is Fast Pose (DCN), expecting around 10 fps. Other models available at here

in --webcam parameter can choose the streaming by

id	camera
0	webcam on laptop
5	depth or stereo of D435i
6	color of D435i

2022.3.23

zhiyuan: gesture guided pick-and-place {learning from demonstration}

use hand orientation to clarify the action

error: when using a .mp4 file as input, the batch size(default 5) should be smaller to prevent crashing, setting to 1 can work

 python scripts/demo_inference.py --cfg configs/halpe_coco_wholebody_136/resnet/256x192_res50_lr1e-3_2x-dcn-combined.yaml --checkpoint pretrained_models/multi_domain_fast50_dcn_combined_256x192.pth --video examples/demo/bsktb.mp4 --outdir examples/res/ --save_video --detbatch 1

2022.3.27

GPU memory burst

check memory use by

nvidia-smi

kill all unnecessary PID process

watch -n 0.1 -d nvidia-smi #每隔0.1秒刷新一次

在writer.py里， result变量是一个存了image name和result的字典dict，这个result是长度为1的list，元素为一个dict类型元素，这个dict长度为5，结构为

{
'keypoints': tensor(136x1),
'kp_score': tensor,
'proposal_score': tensor,
'idx': 1x1,
'box': 1x4
}

keypoint定义index

    //26 body keypoints
    {0,  "Nose"},
    {1,  "LEye"},
    {2,  "REye"},
    {3,  "LEar"},
    {4,  "REar"},
    {5,  "LShoulder"},
    {6,  "RShoulder"},
    {7,  "LElbow"},
    {8,  "RElbow"},
    {9,  "LWrist"},
    {10, "RWrist"},
    {11, "LHip"},
    {12, "RHip"},
    {13, "LKnee"},
    {14, "Rknee"},
    {15, "LAnkle"},
    {16, "RAnkle"},
    {17,  "Head"},
    {18,  "Neck"},
    {19,  "Hip"},
    {20, "LBigToe"},
    {21, "RBigToe"},
    {22, "LSmallToe"},
    {23, "RSmallToe"},
    {24, "LHeel"},
    {25, "RHeel"},
    //face
    {26-93, 68 Face Keypoints}
    //left hand
    {94-114, 21 Left Hand Keypoints}
    //right hand
    {115-135, 21 Right Hand Keypoints}
    ```

Name		Name	Last commit message	Last commit date
Latest commit History 226 Commits
alphapose		alphapose
configs		configs
detector		detector
docs		docs
examples		examples
model_files		model_files
pretrained_models		pretrained_models
scripts		scripts
trackers		trackers
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_for_AlphaPose.md		README_for_AlphaPose.md
RSintrin.txt		RSintrin.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PointAndManipulate

Manipulate robot arm with pointing gesture

Ideas

Problem

Steps

Pose estimation

Pointing estimation

Interacton design

Progress

About

Releases

Packages

Languages

License

AaronLiu666/PointAndManipulate

Folders and files

Latest commit

History

Repository files navigation

PointAndManipulate

Manipulate robot arm with pointing gesture

Ideas

Problem

Steps

Pose estimation

Pointing estimation

Interacton design

Progress

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages