본문으로 건너뛰기

#Unified Multimodal Models

16개의 포스트

[논문리뷰] Do Text Edits Generalize to Visual Generation? Benchmarking Cross-Modal Knowledge Editing in UMMs

댓글 수 로딩 중

[논문리뷰] Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning

댓글 수 로딩 중

[논문리뷰] Steering Visual Generation in Unified Multimodal Models with Understanding Supervision

댓글 수 로딩 중

[논문리뷰] InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

댓글 수 로딩 중

[논문리뷰] UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

댓글 수 로딩 중

[논문리뷰] Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models

댓글 수 로딩 중

[논문리뷰] UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision

댓글 수 로딩 중

[논문리뷰] TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

댓글 수 로딩 중

[논문리뷰] Architecture Decoupling Is Not All You Need For Unified Multimodal Model

댓글 수 로딩 중

[논문리뷰] Reconstruction Alignment Improves Unified Multimodal Models

댓글 수 로딩 중

[논문리뷰] SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models

댓글 수 로딩 중