#Text-to-Image Synthesis

6개의 포스트

[논문리뷰] StageVAR: Stage-Aware Acceleration for Visual Autoregressive Models

Visual Autoregressive (VAR) 모델은 고품질 이미지 생성을 가능하게 하지만, 특히 대규모 스케일 단계에서 상당한 연산 복잡도와 긴 런타임으로 어려움을 겪습니다.

#Review #Visual Autoregressive Models #Image Generation #Model Acceleration #Low-Rank Approximation #Semantic Irrelevance #Stage-Aware Optimization #Text-to-Image Synthesis

2025년 12월 21일

[논문리뷰] DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation

기존 픽셀 확산 모델이 Diffusion Transformer (DiT) 하나로 고주파수 신호와 저주파수 의미론을 동시에 모델링하여 발생하는 느린 학습 및 추론 속도, 낮은 이미지 품질 문제를 해결하고자 합니다.

#Review #Pixel Diffusion #Image Generation #Frequency Decoupling #Diffusion Transformer (DiT)#Flow Matching #AdaLN #Text-to-Image Synthesis

2025년 11월 24일

[논문리뷰] Beyond Objects: Contextual Synthetic Data Generation for Fine-Grained Classification

텍스트-이미지(T2I) 모델을 활용한 합성 데이터 생성 에서 발생하는 과적합 및 다양성 감소 문제를 해결하고, 특히 소량 데이터(few-shot) 환경에서 미세 조정 분류(fine-grained classification) 성능을 극대화하는 것을 목표로 합니다.

#Review #Text-to-Image Synthesis #Synthetic Data Generation #Fine-Grained Classification #Few-Shot Learning #Diffusion Models #Contextual Conditioning #Causal Intervention

2025년 11월 9일

[논문리뷰] Symbolic Graphics Programming with Large Language Models

본 논문은 대규모 언어 모델(LLMs)이 자연어 설명으로부터 정확한 시각적 콘텐츠를 렌더링하는 심볼릭 그래픽 프로그램(SGPs) , 특히 Scalable Vector Graphics (SVGs) 를 생성하는 능력을 탐구합니다.

#Review #Symbolic Graphics Programming #Large Language Models #Reinforcement Learning #SVG Generation #Text-to-Image Synthesis #Cross-Modal Alignment #Program Synthesis

2025년 9월 8일

[논문리뷰] OneFlow: Concurrent Mixed-Modal and Interleaved Generation with Edit Flows

이 논문은 오토회귀(AR) 모델 의 엄격한 순차적 생성과 확산(Diffusion) 모델 의 고정 길이 생성이라는 근본적인 한계를 극복하는 것을 목표로 합니다.

#Review #Non-Autoregressive #Multimodal Generation #Edit Flows #Flow Matching #Interleaved Generation #Text-to-Image Synthesis #Unified Models

2025년 10월 8일

[논문리뷰] Visual Autoregressive Models Beat Diffusion Models on Inference Time Scaling

본 연구는 대규모 언어 모델(LLMs)에서 성공적인 추론 시간 스케일링(search) 전략이 연속적인 잠재 공간을 사용하는 확산 모델(Diffusion Models)에서는 제한적인 이점을 보이는 문제를 해결하고자 합니다.

#Review #Visual Autoregressive Models #Diffusion Models #Inference Time Scaling #Beam Search #Image Generation #Text-to-Image Synthesis #Discrete Latent Space

2025년 10월 21일