#Humanoid Robotics

5개의 포스트

[논문리뷰] UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling

본 논문은 시각적 앵커링을 통해 이질적인 동작들을 공통 잠재 공간으로 정렬하는 UniT를 제안합니다. UniT는 시각적, 동작적, 융합적 세 가지 브랜치로 구성된 트리 브랜치(tri-branch) 아키텍처를 가지며, 모든 브랜치는 Residual Quantization(RQ-VAE)을 통해 공유 코드북(shared codebook)으로 양자화됩니다 .

#Review #Humanoid Robotics #Vision-Language-Action Models #Cross-Embodiment Transfer #Latent Action Tokenizer #World Modeling #Visual Anchoring #Cross-Reconstruction

2026년 4월 23일

[논문리뷰] Learning Humanoid End-Effector Control for Open-Vocabulary Visual Loco-Manipulation

본 연구는 인간형 로봇이 온보드 센서만을 사용하여 새로운 객체를 새로운 환경에서 자율적으로 로코-조작(loco-manipulate) 하는 능력을 개발하는 것을 목표로 합니다. 특히, 정확한 엔드-이펙터(EE) 제어 와 오픈-보케뷸러리 대규모 시각 모델 을 통한 장면 이해의 일반화라는 핵심 난제를 해결하고자 합니다.

#Review #Humanoid Robotics #End-Effector Control #Loco-Manipulation #Open-Vocabulary Perception #Visual Generalization #Sim2Real Transfer #Residual Learning #Robot Grasping

2026년 2월 18일

[논문리뷰] EgoHumanoid: Unlocking In-the-Wild Loco-Manipulation with Robot-Free Egocentric Demonstration

본 논문은 로봇 텔레오퍼레이션의 한계(높은 비용, 복잡성, 환경 제약)로 인해 부족한 휴머노이드 로코-조작(loco-manipulation) 데이터 문제를 해결하고자 합니다.

#Review #Humanoid Robotics #Loco-Manipulation #Egocentric Demonstration #Robot-Free Learning #Cross-Embodiment Transfer #View Alignment #Action Alignment #VLA Co-training

2026년 2월 12일

[논문리뷰] Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

본 논문은 로봇 학습의 고질적인 문제인 데이터의 이질성, 낮은 품질, 그리고 행동 모방 (Behavior Cloning, BC)의 장기 태스크 한계를 해결하고자 합니다.

#Review #Vision-Language-Action #Generalist Robots #Staged Training #Reinforcement Learning #Multi-embodiment #Data Quality #Humanoid Robotics #Flow Matching

2026년 2월 2일

[논문리뷰] TWIST2: Scalable, Portable, and Holistic Humanoid Data Collection System

휴머노이드 로봇 분야에서 대규모 데이터 수집의 비효율성 과 기존 텔레오퍼레이션 시스템의 한계 를 극복하는 것입니다.

#Review #Humanoid Robotics #Data Collection #Teleoperation #Full-Body Control #Visuomotor Policy Learning #VR #Portable MoCap-Free

2025년 11월 9일