#Latent Actions

6개의 포스트

[논문리뷰] ABot-M0.5: Unified Mobility-and-Manipulation World Action Model

본 논문은 모바일 매니퓰레이션(mobile manipulation) 환경에서 기존의 Embodied Learning 방식들이 겪는 구조적 한계를 해결하고자 합니다.

#Review #Mobile Manipulation #World Action Model #Conditional Flow Matching #Latent Actions #Mixture-of-Transformers #Dream Forcing

2026년 7월 1일

[논문리뷰] Olaf-World: Orienting Latent Actions for Video World Modeling

본 논문은 액션 레이블의 희소성으로 인해 액션-제어 가능한 월드 모델의 확장이 제한되는 문제를 해결하고자 합니다.

#Review #Video World Models #Latent Actions #Cross-context Transfer #Zero-shot Action Transfer #Data-efficient Adaptation #Self-supervised Learning #Representation Alignment

2026년 2월 10일

[논문리뷰] Self-Improving World Modelling with Latent Actions

본 논문은 액션이 레이턴트 변수로 취급되는 상태-온리 시퀀스 로부터 LLM(Large Language Models) 및 VLM(Vision-Language Models)의 내재적 월드 모델링 능력을 향상시키는 것을 목표로 합니다.

#Review #World Modeling #Latent Actions #Self-Improvement #Reinforcement Learning #LLMs #VLMs #Inverse Dynamics Model #Forward World Modelling

2026년 2월 8일

[논문리뷰] Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight

본 논문은 기존 Vision-Language-Action (VLA) 모델의 한계인 희소한 행동 감독 신호, 과도한 시각 상태 예측 비용, 정보 병목 현상, 그리고 언어 감독 부족으로 인한 이해 및 추론 능력 저하를 해결하고자 합니다.

#Review #Vision-Language-Action (VLA) Models #Visual Foresight #Diffusion Transformer (DiT)#Robotics #Multimodal Learning #Adaptive Temporal Ensemble #Latent Actions

2025년 11월 23일

[논문리뷰] iFlyBot-VLA Technical Report

iFlyBot-VLA는 장기적인 로봇 조작 작업을 위한 대규모 Vision-Language-Action (VLA) 모델 을 개발하는 것을 목표로 합니다.

#Review #Vision-Language-Action Models #Robotics #Imitation Learning #Latent Actions #Diffusion Models #Dual-Arm Manipulation #Pretraining #Flow-Matching

2025년 11월 9일

[논문리뷰] villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models

본 논문은 Vision-Language-Action (VLA) 모델에서 로봇 조작 정책 학습을 위한 잠재 행동(latent actions) 모델링을 개선하는 새로운 프레임워크인 villa-X 를 제안합니다.

#Review #Vision-Language-Action Models #Latent Actions #Robot Manipulation #Pre-training #Diffusion Models #Proprioceptive Feedback #Foundation Models

2025년 8월 2일