loading page

DaDu-E: Rethinking the Role of Large Language Model in Robotic Computing Pipeline
  • +6
  • Wenhao Sun,
  • Sai Hou,
  • Zixuan Wang,
  • Bo Yu,
  • Shaoshan Liu,
  • Xu Yang,
  • Shuai Liang,
  • Yiming Gan,
  • Yinhe Han
Wenhao Sun
Institute of Computing Technology Chinese Academy of Sciences
Author Profile
Sai Hou
Beijing Institute of Technology
Author Profile
Zixuan Wang
University of the Chinese Academy of Sciences
Author Profile
Bo Yu
Shenzhen Institute of Artificial Intelligence and Robotics for Society
Author Profile
Shaoshan Liu
Shenzhen Institute of Artificial Intelligence and Robotics for Society
Author Profile
Xu Yang
Beijing Institute of Technology
Author Profile
Shuai Liang
Institute of Computing Technology Chinese Academy of Sciences
Author Profile
Yiming Gan
Institute of Computing Technology Chinese Academy of Sciences

Corresponding Author:ganyiming@ict.ac.cn

Author Profile
Yinhe Han
Institute of Computing Technology Chinese Academy of Sciences
Author Profile

Abstract

Performing complex tasks in open environments remains challenging for robots, even when using large language models (LLMs) as the core planner. Many LLM-based planners are inefficient due to their large number of parameters and prone to inaccuracies because they operate in open-loop systems. We think the reason is that only applying LLMs as planners is insufficient. In this work, we propose DaDu-E, a robust closed-loop planning framework for embodied AI robots. Specifically, DaDu-E is equipped with a relatively lightweight LLM, a set of encapsulated robot skill instructions, a robust feedback system, and memory augmentation. Together, these components enable DaDu-E to (i) actively perceive and adapt to dynamic environments, (ii) optimize computational costs while maintaining high performance, and (iii) recover from execution failures using its memory and feedback mechanisms. Extensive experiments on real-world and simulated tasks show that DaDu-E achieves task success rates comparable to embodied AI robots with larger models as planners like COME-Robot, while reducing computational requirements by 6 .6×. Users are encouraged to explore our system at: https://rlc-lab.github.io/dadu-e/.
01 Dec 2024Submitted to Journal of Field Robotics
05 Dec 2024Submission Checks Completed
05 Dec 2024Assigned to Editor
05 Dec 2024Review(s) Completed, Editorial Evaluation Pending
19 Dec 2024Reviewer(s) Assigned