본문으로 건너뛰기

#Text-to-Image Generation

57개의 포스트

[논문리뷰] UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Models

댓글 수 로딩 중

[논문리뷰] Personalizing Text-to-Image Generation to Individual Taste

댓글 수 로딩 중

[논문리뷰] CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation

댓글 수 로딩 중

[논문리뷰] LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model

댓글 수 로딩 중

[논문리뷰] TextPecker: Rewarding Structural Anomaly Quantification for Enhancing Visual Text Rendering

댓글 수 로딩 중

[논문리뷰] Understanding vs. Generation: Navigating Optimization Dilemma in Multimodal Models

댓글 수 로딩 중

[논문리뷰] Rethinking Global Text Conditioning in Diffusion Transformers

댓글 수 로딩 중

[논문리뷰] Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO

댓글 수 로딩 중

[논문리뷰] CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation

댓글 수 로딩 중

[논문리뷰] UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision

댓글 수 로딩 중

[논문리뷰] VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation

댓글 수 로딩 중

[논문리뷰] GARDO: Reinforcing Diffusion Models without Reward Hacking

댓글 수 로딩 중

[논문리뷰] Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

댓글 수 로딩 중

[논문리뷰] SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

댓글 수 로딩 중

[논문리뷰] RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards

댓글 수 로딩 중

[논문리뷰] DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation

댓글 수 로딩 중

[논문리뷰] Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation

댓글 수 로딩 중

[논문리뷰] Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield

댓글 수 로딩 중

[논문리뷰] UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios

댓글 수 로딩 중

[논문리뷰] Generating an Image From 1,000 Words: Enhancing Text-to-Image With Structured Captions

댓글 수 로딩 중

[논문리뷰] Hyper-Bagel: A Unified Acceleration Framework for Multimodal Understanding and Generation

댓글 수 로딩 중

[논문리뷰] MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

댓글 수 로딩 중

[논문리뷰] FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark

댓글 수 로딩 중

[논문리뷰] Interleaving Reasoning for Better Text-to-Image Generation

댓글 수 로딩 중

[논문리뷰] Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?

댓글 수 로딩 중

[논문리뷰] Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

댓글 수 로딩 중

[논문리뷰] T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation

댓글 수 로딩 중

[논문리뷰] UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation

댓글 수 로딩 중

[논문리뷰] The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models

댓글 수 로딩 중

[논문리뷰] HPSv3: Towards Wide-Spectrum Human Preference Score

댓글 수 로딩 중

[논문리뷰] MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency

댓글 수 로딩 중

[논문리뷰] PairUni: Pairwise Training for Unified Multimodal Language Models

댓글 수 로딩 중

[논문리뷰] UltraHR-100K: Enhancing UHR Image Synthesis with A Large-Scale High-Quality Dataset

댓글 수 로딩 중

[논문리뷰] EchoDistill: Bidirectional Concept Distillation for One-Step Diffusion Personalization

댓글 수 로딩 중

[논문리뷰] SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models

댓글 수 로딩 중

[논문리뷰] Speculative Jacobi-Denoising Decoding for Accelerating Autoregressive Text-to-image Generation

댓글 수 로딩 중

[논문리뷰] Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation

댓글 수 로딩 중

[논문리뷰] UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation

댓글 수 로딩 중