Skip to content

Latest commit

 

History

History
125 lines (89 loc) · 11 KB

embodied_ai.md

File metadata and controls

125 lines (89 loc) · 11 KB

Embodied AI

Survey

  • Neural Fields in Robotics: A Survey, arXiv, 2410.20220, arxiv, pdf, cication: -1

    Muhammad Zubair Irshad, Mauro Comi, Yen-Chen Lin, ..., Zsolt Kira, Jonathan Tremblay · (robonerf.github)

Embodied AI

  • OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints, arXiv, 2501.03841, arxiv, pdf, cication: -1

    Mingjie Pan, Jiyao Zhang, Tianshu Wu, ..., Wenlong Gao, Hao Dong · (omnimanip.github)

  • Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous Sensors via Language Grounding, arXiv, 2501.04693, arxiv, pdf, cication: -1

    Joshua Jones, Oier Mees, Carmelo Sferrazza, ..., Pieter Abbeel, Sergey Levine

  • 🌟 EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation, arXiv, 2501.01895, arxiv, pdf, cication: -1

    Siyuan Huang, Liliang Chen, Pengfei Zhou, ..., Maoqing Yao, Guanghui Ren · (sites.google)

  • Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models, arXiv, 2412.14058, arxiv, pdf, cication: -1

    Xinghang Li, Peiyan Li, Minghuan Liu, ..., Hanbo Zhang, Huaping Liu

  • Code-as-Monitor: Constraint-aware Visual Programming for Reactive and Proactive Robotic Failure Detection, arXiv, 2412.04455, arxiv, pdf, cication: -1

    Enshen Zhou, Qi Su, Cheng Chi, ..., Lu Sheng, He Wang · (zhoues.github)

  • Moto: Latent Motion Token as the Bridging Language for Robot Manipulation, arXiv, 2412.04445, arxiv, pdf, cication: -1

    Yi Chen, Yuying Ge, Yizhuo Li, ..., Ying Shan, Xihui Liu · (chenyi99.github) · (Moto - TencentARC) Star

  • 🌟 Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation, arXiv, 2412.06531, arxiv, pdf, cication: -1

    Egor Cherepanov, Nikita Kachaev, Artem Zholus, ..., Alexey K. Kovalev, Aleksandr I. Panov

  • Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making, arXiv, 2410.07166, arxiv, pdf, cication: 1

    Manling Li, Shiyu Zhao, Qineng Wang, ..., Jiayuan Mao, Jiajun Wu · (embodied-agent-interface.github) · (embodied-agent-eval - embodied-agent-eval) Star · (𝕏)

  • foundation models for the physical world.

  • DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution, arXiv, 2411.02359, arxiv, pdf, cication: -1

    Yang Yue, Yulin Wang, Bingyi Kang, ..., Jiashi Feng, Gao Huang · (DeeR-VLA - yueyang130) Star

  • DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile Manipulation, arXiv, 2411.04999, arxiv, pdf, cication: -1

    Peiqi Liu, Zhanqiu Guo, Mohit Warke, ..., Nur Muhammad Mahi Shafiullah, Lerrel Pinto · (dynamem.github)

  • A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks, arXiv, 2410.22391, arxiv, pdf, cication: -1

    Thomas Schmied, Thomas Adler, Vihang Patil, ..., Razvan Pascanu, Sepp Hochreiter · (arxiv) · (huggingface) · (LRAM - ml-jku) Star

  • VLMimic: Vision Language Models are Visual Imitation Learner for Fine-grained Actions, arXiv, 2410.20927, arxiv, pdf, cication: -1

    Guanyan Chen, Meiling Wang, Te Cui, ..., Yi Yang, Yufeng Yue

  • VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI, arXiv, 2410.11623, arxiv, pdf, cication: -1

    Sijie Cheng, Kechen Fang, Yangyang Yu, ..., Lei Han, Yang Liu

Robotics

Humanoids

Projects

Misc