#Imitation Learning

36개의 포스트

[논문리뷰] SIEVE: Structure-Aware Data Selection for Imitation Learning with VLA Models

본 논문은 대규모 로봇 데모 데이터셋에 존재하는 중복성, 노이즈, 불균일한 작업 커버리지 문제를 해결하기 위해 구조적 데이터 선택 프레임워크인 SIEVE를 제안한다 .

#Review #Vision-Language-Action Models #Imitation Learning #Data Selection #Primitive Discovery #Structural Exposure #Behavior Cloning

2026년 7월 7일

[논문리뷰] RynnWorld-Teleop: An Action-Conditioned World Model for Digital Teleoperation

본 논문은 로봇 학습을 위한 대규모 데이터 수집이 물리적 teleoperation의 물리적 제약과 자원 한계로 인해 병목 현상을 겪고 있다는 문제를 해결하고자 합니다.

#Review #Digital Teleoperation #World Model #Robotic Learning #Video Diffusion Transformer #Action-Conditioned Generation #Sim2Real Transfer #Imitation Learning

2026년 7월 7일

[논문리뷰] Translation as a Bridging Action: Transferring Manipulation Skills from Humans to Robots

본 논문은 인간의 풍부한 동작 데이터를 로봇 학습에 활용할 때 발생하는 데이터 간 불일치 및 품질 문제를 해결하고자 한다.

#Review #Robotics #Imitation Learning #Vision-Language-Action Models #Cross-Embodiment #Manipulation Skills #Bridging Action

2026년 6월 28일

[논문리뷰] LIMMT: Less is More for Motion Tracking

본 논문은 휴머노이드 모션 트래킹 학습에서 무분별한 데이터 확장(Data Scaling)이 오히려 성능 저하를 초래한다는 문제점을 지적합니다.

#Review #Motion Tracking #Humanoid Robot #Data-Centric AI #Physics-based Simulation #Imitation Learning #Data Curation

2026년 6월 7일

[논문리뷰] DexJoCo: A Benchmark and Toolkit for Task-Oriented Dexterous Manipulation on MuJoCo

본 논문은 기존의 로봇 조작 벤치마크가 단순한 그리퍼 중심의 환경에 치중되어 있어, 진정한 의미의 인간 수준(Human-level) 조작 능력을 평가하는 데 한계가 있다는 문제 의식에서 출발합니다.

#Review #Dexterous Manipulation #Robotics Benchmark #Teleoperation #Imitation Learning #Vision-Language-Action Models #MuJoCo #Domain Randomization

2026년 5월 17일

[논문리뷰] Learning to Communicate Locally for Large-Scale Multi-Agent Pathfinding

본 연구는 대규모 다중 에이전트 시스템에서 충돌 없는 경로 탐색을 효율적으로 수행하기 위한 탈중앙화 MAPF 솔루션의 한계를 극복하고자 합니다.

#Review #Multi-Agent Pathfinding #Imitation Learning #Transformer #Decentralized Coordination #Local Communication #Scalability

2026년 5월 14일

[논문리뷰] IntentVLA: Short-Horizon Intent Modeling for Aliased Robot Manipulation

본 논문은 프레임 단위로만 조건을 부여하는 기존 VLA 모델들이 부분 관측성(Partial Observability) 하에서 발생하는 짧은 기간의 의도 모호성 문제를 해결하지 못한다는 점을 지적합니다.

#Review #Vision-Language-Action (VLA)#Robot Manipulation #AliasBench #Short-Horizon Intent #Imitation Learning #Inter-chunk Consistency #Partial Observability

2026년 5월 14일

[논문리뷰] Revisiting DAgger in the Era of LLM-Agents

본 논문은 장기 상호작용을 수행하는 LLM 에이전트의 사후 학습(Post-training) 단계에서 발생하는 고질적인 분포 불일치 문제를 해결하고자 합니다.

#Review #LLM-Agents #DAgger #Covariate Shift #Multi-Turn Interaction #Post-Training #Imitation Learning

2026년 5월 13일

[논문리뷰] Agentic Critical Training

본 논문은 LLM 에이전트가 단순한 모방을 넘어, 행동의 품질에 대한 자율적인 비판적 추론 및 진정한 자기 성찰 능력 을 개발하도록 훈련시키는 것을 목표로 합니다. 기존 모방 학습(IL)이 '무엇을 할지'만 가르치고 '왜 그 행동이 더 나은지'에 대한 이해가 부족하다는 한계를 해결하고자 합니다.

#Review #LLM Agents #Reinforcement Learning #Imitation Learning #Self-Reflection #Action Quality #Out-of-Distribution Generalization #Critical Reasoning #GRPO

2026년 3월 9일

[논문리뷰] Demystifying Action Space Design for Robotic Manipulation Policies

로봇 조작 정책 학습에서 액션 공간 설계가 주로 경험적 휴리스틱에 의해 이루어져 최적화 및 안정성에 대한 체계적인 이해가 부족한 문제를 해결하는 것입니다. 이 연구는 시간적(절대값 vs. 델타) 및 공간적(조인트-공간 vs.

#Review #Robotic Manipulation #Action Space Design #Imitation Learning #Delta Actions #Joint Space Control #Task Space Control #Generalization #Control Stability

2026년 3월 8일

[논문리뷰] RoboPocket: Improve Robot Policies Instantly with Your Phone

이 논문은 로봇 모방 학습의 핵심 제약인 비효율적인 데이터 수집과 느린 정책 반복 과정을 해결하고자 합니다.

#Review #Robot Learning #Imitation Learning #Policy Iteration #Augmented Reality #Visual Foresight #Data Collection #Human-in-the-Loop #Smartphone

2026년 3월 5일

[논문리뷰] World Guidance: World Modeling in Condition Space for Action Generation

본 논문은 Vision-Language-Action (VLA) 모델이 효율적이고 예측 가능한 미래 표현을 유지하면서 정밀한 액션 생성을 위한 충분한 세분화된 정보를 보존하는 데 어려움을 겪는 문제를 해결합니다.

#Review #World Model #Action Generation #Vision-Language-Action Models (VLA)#Condition Space #Imitation Learning #Robotics #Generalization #Human Manipulation

2026년 2월 25일

[논문리뷰] VLANeXt: Recipes for Building Strong VLA Models

파편화되어 있는 Vision-Language-Action (VLA) 모델 연구 분야에 구조를 제공하고, 일관된 프레임워크와 평가 환경에서 VLA 모델의 설계 공간을 체계적으로 재검토하는 것을 목표로 합니다.

#Review #Vision-Language-Action Models #Robotics #Imitation Learning #Foundation Models #Ablation Study #Generalization #LIBERO Benchmark #Time-Series Forecasting

2026년 2월 23일

[논문리뷰] RoboCurate: Harnessing Diversity with Action-Verified Neural Trajectory for Robot Learning

로봇 학습을 위한 비디오 생성 모델 기반 합성 데이터는 액션 품질의 일관성 부족과 물리적 정확성 검증의 어려움으로 인해 제한적인 성능을 보입니다.

#Review #Robot Learning #Synthetic Data Generation #Action Verification #Neural Trajectory #Video Generative Models #Imitation Learning #Data Diversity

2026년 2월 23일

[논문리뷰] χ_{0}: Resource-Aware Robust Manipulation via Taming Distributional Inconsistencies

본 논문은 장시간 로봇 조작 태스크에서 발생하는 분포 불일치(distributional inconsistencies) 문제를 해결하여 생산 수준의 로봇 강건성(robustness)을 달성하는 것을 목표로 합니다.

#Review #Robotic Manipulation #Distributional Shift #Imitation Learning #Model Arithmetic #Stage Advantage #Train-Deploy Alignment #Resource-Efficient AI #Long-Horizon Tasks

2026년 2월 12일

[논문리뷰] InterPrior: Scaling Generative Control for Physics-Based Human-Object Interactions

논문은 물리 기반 휴머노이드 로봇이 고수준의 목표만으로도 다양한 객체와 상호작용하는 복잡한 로코-조작(loco-manipulation) 행동을 생성하고 일반화하는 데 있어 기존 방법론의 확장성 및 견고성 한계를 해결하고자 합니다.

#Review #Human-Object Interaction #Physics-Based Simulation #Generative Control #Reinforcement Learning #Imitation Learning #Variational Policy #Failure Recovery #Loco-Manipulation

2026년 2월 5일

[논문리뷰] Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning

본 논문은 대규모 사전 훈련된 비디오 생성 모델 의 시공간적 사전 지식을 로봇 정책 학습에 활용하는 것을 목표로 합니다.

#Review #Video Models #Visuomotor Control #Robot Policy #Fine-tuning #Diffusion Models #World Models #Model-based Planning #Imitation Learning

2026년 1월 22일

[논문리뷰] SOP: A Scalable Online Post-Training System for Vision-Language-Action Models

본 논문은 대규모 사전 훈련을 통해 일반화 능력을 갖춘 Vision-Language-Action (VLA) 모델 이 실세계에서 전문가 수준의 숙련도와 확장 가능한 온라인 적응 능력을 확보하지 못하는 문제를 해결하고자 합니다.

#Review #Vision-Language-Action Models #Online Post-training #Scalable Robot Learning #Distributed Systems #Multi-task Learning #Imitation Learning #Reinforcement Learning

2026년 1월 6일

[논문리뷰] SurgWorld: Learning Surgical Robot Policies from Videos via World Modeling

본 논문은 수술 로봇 학습의 주요 병목인 시각 관측 및 정확한 로봇 움직임 데이터의 부족 문제 를 해결하고자 합니다. 대량의 수술 비디오가 존재하지만 로봇 액션 레이블이 없어 모방 학습에 직접 활용하기 어렵습니다. 따라서, 세계 모델을 통해 일반화 가능하고 데이터 효율적인 수술 로봇 정책 을 학습하는 것을 목표로 합니다.

#Review #Surgical Robotics #World Models #Video Generation #Imitation Learning #Inverse Dynamics Model #Synthetic Data #Vision-Language-Action Models #Data Scarcity

2025년 12월 29일

[논문리뷰] Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

논문은 복잡한 3D 오픈 월드 환경 에서 인간 수준의 효율성으로 수 시간 길이의 미션을 실시간으로 완수할 수 있는 제너럴리스트 에이전트 를 구축하기 위한 '오픈 레시피'인 Lumine을 제시합니다.

#Review #Generalist Agent #3D Open World #Vision-Language Model #Imitation Learning #Real-time Inference #Hybrid Thinking #Action Chunking #Genshin Impact

2025년 11월 12일

[논문리뷰] iFlyBot-VLA Technical Report

iFlyBot-VLA는 장기적인 로봇 조작 작업을 위한 대규모 Vision-Language-Action (VLA) 모델 을 개발하는 것을 목표로 합니다.

#Review #Vision-Language-Action Models #Robotics #Imitation Learning #Latent Actions #Diffusion Models #Dual-Arm Manipulation #Pretraining #Flow-Matching

2025년 11월 9일

[논문리뷰] Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving

본 논문은 자율주행 시스템에서 기존 모방 학습 기반 VLA(Vision-Language-Action) 모델 이 물리적 규칙 및 안전 제약 조건을 내재적으로 인코딩하는 데 어려움을 겪는 문제를 해결하고자 합니다.

#Review #Autonomous Driving #Vision-Language-Action Models #Discrete Diffusion #Reflection Mechanism #Trajectory Generation #Safety Constraints #Imitation Learning

2025년 9월 26일

[논문리뷰] Do You Need Proprioceptive States in Visuomotor Policies?

본 연구는 로봇의 시각-운동 정책(visuomotor policies)에서 고유 수용성 상태(proprioceptive states)의 필요성을 재평가하고, 기존 상태 기반 정책이 학습 궤적에 과적합되어 공간 일반화 능력이 저해되는 문제를 해결하고자 합니다.

#Review #Visuomotor Policies #Spatial Generalization #Imitation Learning #Proprioception #State-free Policies #Robot Manipulation #End-Effector Control #Data Efficiency

2025년 9월 24일

[논문리뷰] FLOWER: Democratizing Generalist Robot Policies with Efficient Vision-Language-Action Flow Policies

본 논문은 현재 Vision-Language-Action (VLA) 정책의 높은 계산 비용과 자원 요구사항 문제를 해결하고자 합니다. 특히, 수십억 개의 파라미터를 가진 대규모 모델 없이도 강력한 성능을 달성하는 효율적인 일반화 로봇 정책을 개발하는 것을 목표로 합니다.

#Review #Generalist Robot Policies #Vision-Language-Action Models #Efficient AI #Imitation Learning #Diffusion Models #Intermediate Fusion #Robotics

2025년 9월 15일

[논문리뷰] Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment

이 논문은 대규모 언어 모델(LLM) 정렬(alignment) 방법론의 한계를 해결하고자 합니다. 기존 방법론들( SFT, DPO, PPO, GRPO )은 특정 정렬 방식에 고정되거나 정량적 지표만을 최적화하여 일반화 및 견고성 측면에서 부족함을 보였습니다.

#Review #LLM Alignment #Reinforcement Learning from Human Feedback #Preference Learning #Group Relative Alignment Optimization #Self-Optimization #Mixture-of-Experts #Imitation Learning

2025년 8월 14일

[논문리뷰] AMFT: Aligning LLM Reasoners by Meta-Learning the Optimal Imitation-Exploration Balance

대규모 언어 모델(LLM)이 추론 태스크에서 겪는 catastrophic forgetting 및 모방(imitation) 과 탐색(exploration) 간의 최적화되지 않은 트레이드오프 문제를 해결하는 것이 목표입니다.

#Review #Large Language Models #Fine-tuning #Reinforcement Learning #Meta-learning #Adaptive Control #Imitation Learning #Exploration #Reasoning

2025년 8월 14일

[논문리뷰] Shortcut Learning in Generalist Robot Policies: The Role of Dataset Diversity and Fragmentation

본 논문은 일반 로봇 정책의 제한된 일반화 능력의 근본 원인을 규명하고자 합니다. 특히, 태스크와 관련 없는 특징에 의존하는 숏컷 학습(shortcut learning)이 일반화의 주요 장애물인지 조사합니다.

#Review #Robot Learning #Generalization #Shortcut Learning #Dataset Diversity #Dataset Fragmentation #Data Augmentation #Imitation Learning

2025년 8월 12일

[논문리뷰] Exploring Conditions for Diffusion models in Robotic Control

본 논문은 사전 훈련된 텍스트-투-이미지 diffusion 모델 을 로봇 제어에 활용하여 태스크 적응형 시각 표현 을 얻는 것을 목표로 합니다.

#Review #Diffusion Models #Robotic Control #Imitation Learning #Task-Adaptive Representations #Visual Prompts #Text-to-Image #Conditioning #Behavior Cloning

2025년 10월 31일

[논문리뷰] ACG: Action Coherence Guidance for Flow-based VLA models

본 논문은 모방 학습을 통해 훈련된 Vision-Language-Action (VLA) 모델, 특히 Diffusion 및 Flow Matching 모델 에서 발생하는 액션 불일치(jerks, pauses, jitter) 문제를 해결하여 안정성과 궤적 드리프트로 인한 정밀 조작 실패를 방지하는 것을 목표로 합니다.

#Review #Action Coherence #Flow Matching #VLA Models #Guidance #Robotics #Imitation Learning #Transformer #Self-Attention

2025년 10월 28일

[논문리뷰] Unified Reinforcement and Imitation Learning for Vision-Language Models

본 논문은 대규모 Vision-Language Models (VLMs) 의 비효율성을 해결하기 위해, 리소스가 제한된 환경에서도 강력하고 경량화된 VLM을 구축하는 효율적인 훈련 알고리즘 Unified Reinforcement and Imitation Learning (RIL) 을 제안합니다.

#Review #Vision-Language Models #Reinforcement Learning #Imitation Learning #Model Distillation #Lightweight VLMs #LLM-as-a-Judge #Multimodal Learning

2025년 10월 23일

[논문리뷰] pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation

이 논문은 기존 few-step 확산 및 흐름 기반 생성 모델의 증류 과정에서 발생하는 품질-다양성 트레이드오프 와 복잡한 훈련 절차 문제를 해결하고자 합니다.

#Review #Diffusion Models #Flow Matching #Generative Models #Model Distillation #Imitation Learning #Few-Step Generation #Policy-Based AI #Text-to-Image

2025년 10월 17일

[논문리뷰] CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving

본 논문은 모방 학습(IL)에만 의존하는 자율주행 모델이 겪는 일반화 성능 저하 및 롱테일 시나리오 대응 문제 를 해결하고자 합니다. 또한, 강화 학습(RL)의 샘플 비효율성 및 불안정한 수렴 문제를 극복하기 위해, IL과 RL을 효과적으로 통합 하여 보다 견고하고 일반화된 자율주행 정책을 개발하는 것을 목표로 합니다.

#Review #Autonomous Driving #Imitation Learning #Reinforcement Learning #World Models #Latent Space #Dual-Policy #Competitive Learning

2025년 10월 16일

[논문리뷰] Robot Learning: A Tutorial

이 튜토리얼은 현대 로봇 학습의 발전 과정을 종합적으로 안내하여, 연구자와 실무자가 로봇 학습 분야의 개념적 이해와 실제 도구를 습득하도록 돕는 것을 목표로 합니다.

#Review #Robot Learning #Reinforcement Learning #Imitation Learning #Behavioral Cloning #Vision-Language-Action Models #Diffusion Models #Transformers #LeRobot

2025년 10월 15일

[논문리뷰] R2RGEN: Real-to-Real 3D Data Generation for Spatially Generalized Manipulation

본 연구는 로봇 매니퓰레이션에서 공간적 일반화 를 위한 방대한 인간 시연 데이터 의 필요성을 해결하고자 합니다.

#Review #Robotic Manipulation #Data Augmentation #Spatial Generalization #3D Data Generation #Imitation Learning #Point Cloud #Real-to-Real #Mobile Manipulation

2025년 10월 10일

[논문리뷰] Agent Learning via Early Experience

본 논문은 보상이 없거나 불명확한 환경에서 언어 에이전트 가 스스로 경험을 통해 학습하고 개선하는 데 따르는 어려움을 해결하고자 합니다.

#Review #Language Agents #Early Experience #Reward-Free Learning #World Modeling #Self-Reflection #Imitation Learning #Reinforcement Learning #Out-of-Domain Generalization

2025년 10월 10일

[논문리뷰] Boolean Satisfiability via Imitation Learning

본 논문은 CDCL(Conflict-Driven Clause Learning) SAT solver 의 핵심 구성 요소인 브랜칭 정책의 비효율성을 개선하는 것을 목표로 합니다.

#Review #Boolean Satisfiability #Imitation Learning #CDCL Solvers #Branching Policy #KeyTrace #Transformer Architecture #Perceiver AR

2025년 10월 2일