본문으로 건너뛰기

#Direct Preference Optimization

22개의 포스트

[논문리뷰] Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer

댓글 수 로딩 중

[논문리뷰] JavisDiT++: Unified Modeling and Optimization for Joint Audio-Video Generation

댓글 수 로딩 중

[논문리뷰] Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

댓글 수 로딩 중

[논문리뷰] EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

댓글 수 로딩 중

[논문리뷰] PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation

댓글 수 로딩 중

[논문리뷰] MPJudge: Towards Perceptual Assessment of Music-Induced Paintings

댓글 수 로딩 중

[논문리뷰] Reg-DPO: SFT-Regularized Direct Preference Optimization with GT-Pair for Improving Video Generation

댓글 수 로딩 중

[논문리뷰] MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning

댓글 수 로딩 중

[논문리뷰] Phi-Ground Tech Report: Advancing Perception in GUI Grounding

댓글 수 로딩 중

[논문리뷰] PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning

댓글 수 로딩 중

[논문리뷰] Margin Adaptive DPO: Leveraging Reward Model for Granular Control in Preference Optimization

댓글 수 로딩 중