본문으로 건너뛰기

#Vision-Language-Action (VLA)

22개의 포스트

[논문리뷰] Hide-and-Seek in Trajectories: Discovering Failure Signals for VLA Runtime Monitoring

댓글 수 로딩 중

[논문리뷰] IntentVLA: Short-Horizon Intent Modeling for Aliased Robot Manipulation

댓글 수 로딩 중

[논문리뷰] MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation

댓글 수 로딩 중

[논문리뷰] Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution

댓글 수 로딩 중

[논문리뷰] ABot-M0: VLA Foundation Model for Robotic Manipulation with Action Manifold Learning

댓글 수 로딩 중

[논문리뷰] RISE: Self-Improving Robot Policy with Compositional World Model

댓글 수 로딩 중

[논문리뷰] VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model

댓글 수 로딩 중

[논문리뷰] BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation

댓글 수 로딩 중

[논문리뷰] TwinBrainVLA: Unleashing the Potential of Generalist VLMs for Embodied Tasks via Asymmetric Mixture-of-Transformers

댓글 수 로딩 중

[논문리뷰] VideoVLA: Video Generators Can Be Generalizable Robot Manipulators

댓글 수 로딩 중

[논문리뷰] SwiftVLA: Unlocking Spatiotemporal Dynamics for Lightweight VLA Models at Minimal Overhead

댓글 수 로딩 중

[논문리뷰] Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

댓글 수 로딩 중

[논문리뷰] Expertise need not monopolize: Action-Specialized Mixture of Experts for Vision-Language-Action Learning

댓글 수 로딩 중

[논문리뷰] InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

댓글 수 로딩 중