Skip to content

You can call Using Sapiens to get seg,normal,pose,depth,mask

License

Notifications You must be signed in to change notification settings

smthemex/ComfyUI_Sapiens

Repository files navigation

ComfyUI_Sapiens

You can call Using Sapiens to get seg,normal,pose,depth,mask maps.

Sapiens From: facebookresearch/sapiens

Update-2024/12/01

  • 基于COCOfullbody编码 ,单独使用pose模型时,可选pose的5种分离模式,分别是躯干,下肢,手,上肢,头部,对应选择seg_select 的编号分别是(21.torso,4.Left_foot,5.Left_Hand,6.Left_lower_arm,3.Face_Neck),这5种也可以自由组合,全选默认输出所有pose;
  • Based on COCOfullbody encoding, when using the pose model alone, five separation modes of pose can be selected, namely ' torso, lower limbs, hands,lower_arm, and head.' The corresponding selection numbers for 'seg_ select' are (21. Torso, 4. Left_Foot, 5. Left_Hand,6.Left_lower_arm, 3. Face_Neck). These five modes can also be freely combined, and selecting all will output all poses by default;

previous update

  • Fixed bug where SEG cannot be used to separate normals, poses, and depths, and added a button to save pose npy files;
  • try add MPS support or no cuda user..
  • Add model uninstallation code for easy connection to other nodes. Thanks to @lyxkilo's code, it is possible to convert the FP32 model to an FP16 model (smaller, around 1B SEG 2G),The first run will generate an fp16 model file with the same name, and there is no need to enable the fp16 generation button after generation.
  • 修复无法利用SEG分离法线,姿态和深度的bug,新增保存姿态npy文件按钮;
  • 加入模型卸载代码,便于连接其他节点,感谢@lyxkilo 的代码,通过他的代码可以将fp32的模型转为fp16模型(更小,1B seg 2G左右),首次运行会生成一个同名加fp16模型文件,生成后不需要再开启fp16生成按钮;
  • seg选择人体部位的方式是数字加“,”,例如 2,1,11,注意逗号是英文符号。

1.Installation

In the ./ComfyUI /custom_node directory, run the following:

git clone https://github.com/smthemex/ComfyUI_Sapiens

2.requirements

pip install -r requirements.txt

If some modules missing, please pip install #ultralytics yolov8

3.checkpoints

3.1 base :(choice repo_id or ckpt_name)

  • 3.1.1 #sapiens
    only support torchscript version now,you can choice 1b,0.3b,0.6b,2b,do not changge ckpt's name!!!
    只支持torchscript的版本,但是有多种模型可选,最好质量的是1b或者2B,如果模型选择全是none,会自动下载一个1B的seg,下载后不要改模型名字;
    seg
    pose
    depth
    normal
├── ComfyUI/models/sapiens/
|     ├── seg/sapiens_1b_goliath_best_goliath_mIoU_7994_epoch_151_torchscript.pt2
|     ├── pose/sapiens_1b_goliath_best_goliath_AP_639_torchscript.pt2
|     ├── normal/sapiens_0.3b_normal_render_people_epoch_66_torchscript.pt2
|     ├── depth/sapiens_1b_render_people_epoch_88_torchscript.pt2
  • 3.1.2 yolo # if using pose yolo是pose必须的,官方的太复杂,不如yolo好用;
    yolov8m
├── ComfyUI/models/sapiens/
|     ├── yolov8m.pt

4 Example

seg body

  • 可以选人体部位27种类型 (Latest version)
  • only pose 仅使用pose模型分离躯干;

Citation

facebookresearch/sapiens

@article{khirodkar2024sapiens,
  title={Sapiens: Foundation for Human Vision Models},
  author={Khirodkar, Rawal and Bagautdinov, Timur and Martinez, Julieta and Zhaoen, Su and James, Austin and Selednik, Peter and Anderson, Stuart and Saito, Shunsuke},
  journal={arXiv preprint arXiv:2408.12569},
  year={2024}
}

About

You can call Using Sapiens to get seg,normal,pose,depth,mask

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages