#Attention Mask

2개의 포스트

[논문리뷰] RynnVLA-002: A Unified Vision-Language-Action and World Model

본 논문은 기존 VLA 모델(액션 다이내믹스 이해 부족, 상상력 및 물리 지식 결여)과 월드 모델(직접적인 액션 생성 불가)의 한계를 극복하기 위해, VLA 모델과 월드 모델을 단일 프레임워크로 통합 하는 것을 목표로 합니다.

#Review #Vision-Language-Action (VLA) Model #World Model #Robotics #Unified Framework #Multi-modal Learning #Action Generation #Attention Mask #Continuous Control

2025년 11월 23일

[논문리뷰] VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning

기존 시각 효과(VFX) 생성 모델들이 겪는 자원 집약적인 '효과당 LoRA' 패러다임 과 미학습 효과에 대한 낮은 일반화 능력 이라는 근본적인 한계를 해결하고자 합니다.

#Review #VFX Generation #In-Context Learning #Diffusion Models #Video Generation #Generalization #Attention Mask #One-Shot Adaptation

2025년 10월 30일