본문으로 건너뛰기

#Vision-Language-Action Models

46개의 포스트

[논문리뷰] Silent Failures in Physical AI: A Literature Review of Runtime Action Authorization for Autonomous Systems

댓글 수 로딩 중

[논문리뷰] RoboSemanticBench: Diagnosing Semantic Grounding in Action Prediction for VLA Models

댓글 수 로딩 중

[논문리뷰] Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

댓글 수 로딩 중

[논문리뷰] StableVLA: Towards Robust Vision-Language-Action Models without Extra Data

댓글 수 로딩 중

[논문리뷰] DexJoCo: A Benchmark and Toolkit for Task-Oriented Dexterous Manipulation on MuJoCo

댓글 수 로딩 중

[논문리뷰] Overcoming Dynamics-Blindness: Training-Free Pace-and-Path Correction for VLA Models

댓글 수 로딩 중

[논문리뷰] UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling

댓글 수 로딩 중

[논문리뷰] Cortex 2.0: Grounding World Models in Real-World Industrial Deployment

댓글 수 로딩 중

[논문리뷰] UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving

댓글 수 로딩 중

[논문리뷰] RoboMME: Benchmarking and Understanding Memory for Robotic Generalist Policies

댓글 수 로딩 중

[논문리뷰] Chain of World: World Model Thinking in Latent Motion

댓글 수 로딩 중

[논문리뷰] SCALE: Self-uncertainty Conditioned Adaptive Looking and Execution for Vision-Language-Action Models

댓글 수 로딩 중

[논문리뷰] Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning

댓글 수 로딩 중

[논문리뷰] BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries

댓글 수 로딩 중

[논문리뷰] CLARE: Continual Learning for Vision-Language-Action Models via Autonomous Adapter Routing and Expansion

댓글 수 로딩 중

[논문리뷰] SOP: A Scalable Online Post-Training System for Vision-Language-Action Models

댓글 수 로딩 중

[논문리뷰] SurgWorld: Learning Surgical Robot Policies from Videos via World Modeling

댓글 수 로딩 중

[논문리뷰] Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone

댓글 수 로딩 중

[논문리뷰] VLA-4D: Embedding 4D Awareness into Vision-Language-Action Models for SpatioTemporally Coherent Robotic Manipulation

댓글 수 로딩 중

[논문리뷰] SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models

댓글 수 로딩 중

[논문리뷰] 10 Open Challenges Steering the Future of Vision-Language-Action Models

댓글 수 로딩 중

[논문리뷰] RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

댓글 수 로딩 중

[논문리뷰] Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving

댓글 수 로딩 중

[논문리뷰] FLOWER: Democratizing Generalist Robot Policies with Efficient Vision-Language-Action Flow Policies

댓글 수 로딩 중

[논문리뷰] VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

댓글 수 로딩 중

[논문리뷰] EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control

댓글 수 로딩 중

[논문리뷰] Do What? Teaching Vision-Language-Action Models to Reject the Impossible

댓글 수 로딩 중

[논문리뷰] VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation

댓글 수 로딩 중

[논문리뷰] VLA-0: Building State-of-the-Art VLAs with Zero Modification

댓글 수 로딩 중

[논문리뷰] LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

댓글 수 로딩 중

[논문리뷰] Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model

댓글 수 로딩 중