#Masked Modeling

3개의 포스트

[논문리뷰] AnyMo: Scaling Any-Modality Conditional Motion Generation with Masked Modeling

본 논문은 범용적인 인간 모션 생성(Human Motion Generation) 모델이 직면한 데이터 부족 및 제어 유연성 문제를 해결하는 것을 목표로 한다. 기존 연구들은 특정 모달리티에 국한된 태스크(예: Text-to-Motion)나 고비용의 MoCap 데이터에 의존하여 확장성과 범용성이 떨어진다는 한계가 있다.

#Review #Any-Modality Conditional Motion Generation #Masked Modeling #OmniHuMo #Residual FSQ #Multimodal Motion Synthesis

2026년 5월 31일

[논문리뷰] Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

본 논문은 기존 Vision-Language-Action (VLA) 모델 디코더의 한계(고정된 순서의 autoregressive 생성 또는 continuous diffusion /flow matching 헤드의 백본 분리)를 해결하고자 합니다.

#Review #Vision-Language-Action (VLA)#Discrete Diffusion #Action Decoding #Transformer #Robot Control #Masked Modeling #Adaptive Decoding #Reinforcement Learning

2025년 8월 28일

[논문리뷰] From Masks to Worlds: A Hitchhiker's Guide to World Models

이 논문은 '진정한 월드 모델'을 구축하기 위한 명확한 로드맵을 제시하며, 단순한 모델 목록을 나열하는 것을 넘어선다.

#Review #World Models #Generative AI #Multimodal Learning #Masked Modeling #Interactive AI #Memory Systems #Autonomous Agents #AI Roadmap

2025년 10월 24일