From text to motion: grounding GPT-4 in a humanoid robot “Alter3”

This paper introduces Alter3, a humanoid robot that demonstrates spontaneous motion generation through the integration of GPT-4, a cutting-edge Large Language Model (LLM). This integration overcomes the challenge of applying LLMs to direct robot control, which typically struggles with the hardware-s...

Full description

Saved in:

Bibliographic Details
Main Authors:	Takahide Yoshida, Atsushi Masumori, Takashi Ikegami
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2025-05-01
Series:	Frontiers in Robotics and AI
Subjects:	humanoid robot large language models motion generation embodiment agency
Online Access:	https://www.frontiersin.org/articles/10.3389/frobt.2025.1581110/full
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849325324927500288
author	Takahide Yoshida Atsushi Masumori Atsushi Masumori Takashi Ikegami Takashi Ikegami
author_facet	Takahide Yoshida Atsushi Masumori Atsushi Masumori Takashi Ikegami Takashi Ikegami
author_sort	Takahide Yoshida
collection	DOAJ
description	This paper introduces Alter3, a humanoid robot that demonstrates spontaneous motion generation through the integration of GPT-4, a cutting-edge Large Language Model (LLM). This integration overcomes the challenge of applying LLMs to direct robot control, which typically struggles with the hardware-specific nuances of robotic operation. By translating linguistic descriptions of human actions into robotic movements via programming, Alter3 can autonomously perform a diverse range of actions, such as adopting a “selfie” pose or simulating a “ghost.” This approach not only shows Alter3’s few-shot learning capabilities but also its adaptability to verbal feedback for pose adjustments without manual fine-tuning. This research advances the field of humanoid robotics by bridging linguistic concepts with physical embodiment and opens new avenues for exploring spontaneity in humanoid robots.
format	Article
id	doaj-art-212570ee8fc645b0be489e03221f967a
institution	Kabale University
issn	2296-9144
language	English
publishDate	2025-05-01
publisher	Frontiers Media S.A.
record_format	Article
series	Frontiers in Robotics and AI
spelling	doaj-art-212570ee8fc645b0be489e03221f967a2025-08-20T03:48:27ZengFrontiers Media S.A.Frontiers in Robotics and AI2296-91442025-05-011210.3389/frobt.2025.15811101581110From text to motion: grounding GPT-4 in a humanoid robot “Alter3”Takahide Yoshida0Atsushi Masumori1Atsushi Masumori2Takashi Ikegami3Takashi Ikegami4Graduate School of Arts and Sciences, The University of Tokyo, Tokyo, JapanGraduate School of Arts and Sciences, The University of Tokyo, Tokyo, JapanAlternative Machine Inc., Tokyo, JapanGraduate School of Arts and Sciences, The University of Tokyo, Tokyo, JapanAlternative Machine Inc., Tokyo, JapanThis paper introduces Alter3, a humanoid robot that demonstrates spontaneous motion generation through the integration of GPT-4, a cutting-edge Large Language Model (LLM). This integration overcomes the challenge of applying LLMs to direct robot control, which typically struggles with the hardware-specific nuances of robotic operation. By translating linguistic descriptions of human actions into robotic movements via programming, Alter3 can autonomously perform a diverse range of actions, such as adopting a “selfie” pose or simulating a “ghost.” This approach not only shows Alter3’s few-shot learning capabilities but also its adaptability to verbal feedback for pose adjustments without manual fine-tuning. This research advances the field of humanoid robotics by bridging linguistic concepts with physical embodiment and opens new avenues for exploring spontaneity in humanoid robots.https://www.frontiersin.org/articles/10.3389/frobt.2025.1581110/fullhumanoid robotlarge language modelsmotion generationembodimentagency
spellingShingle	Takahide Yoshida Atsushi Masumori Atsushi Masumori Takashi Ikegami Takashi Ikegami From text to motion: grounding GPT-4 in a humanoid robot “Alter3” Frontiers in Robotics and AI humanoid robot large language models motion generation embodiment agency
title	From text to motion: grounding GPT-4 in a humanoid robot “Alter3”
title_full	From text to motion: grounding GPT-4 in a humanoid robot “Alter3”
title_fullStr	From text to motion: grounding GPT-4 in a humanoid robot “Alter3”
title_full_unstemmed	From text to motion: grounding GPT-4 in a humanoid robot “Alter3”
title_short	From text to motion: grounding GPT-4 in a humanoid robot “Alter3”
title_sort	from text to motion grounding gpt 4 in a humanoid robot alter3
topic	humanoid robot large language models motion generation embodiment agency
url	https://www.frontiersin.org/articles/10.3389/frobt.2025.1581110/full
work_keys_str_mv	AT takahideyoshida fromtexttomotiongroundinggpt4inahumanoidrobotalter3 AT atsushimasumori fromtexttomotiongroundinggpt4inahumanoidrobotalter3 AT atsushimasumori fromtexttomotiongroundinggpt4inahumanoidrobotalter3 AT takashiikegami fromtexttomotiongroundinggpt4inahumanoidrobotalter3 AT takashiikegami fromtexttomotiongroundinggpt4inahumanoidrobotalter3

From text to motion: grounding GPT-4 in a humanoid robot “Alter3”

Similar Items