[논문리뷰] From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation PriorsarXiv에 게시된 'From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation Priors' 논문에 대한 자세한 리뷰입니다.#Review#Vision-Language-Action (VLA)#3D Spatial Reasoning#Embodied AI#Foundation Models#Multimodal Fusion#Robot Manipulation#Modality Transferability#Action Grounding2025년 10월 29일댓글 수 로딩 중
[논문리뷰] OmniHuman-1.5: Instilling an Active Mind in Avatars via Cognitive SimulationJiaqi Yang이 arXiv에 게시한 'OmniHuman-1.5: Instilling an Active Mind in Avatars via Cognitive Simulation' 논문에 대한 자세한 리뷰입니다.#Review#Video Avatar Generation#Cognitive Simulation#Multimodal Large Language Models (MLLMs)#Diffusion Transformers (DiT)#Multimodal Fusion#Human Motion Synthesis#Contextual Animation2025년 8월 27일댓글 수 로딩 중