본문으로 건너뛰기

#Post-Training

16개의 포스트

[논문리뷰] WavAlign: Enhancing Intelligence and Expressiveness in Spoken Dialogue Models via Adaptive Hybrid Post-Training

댓글 수 로딩 중

[논문리뷰] Watch Before You Answer: Learning from Visually Grounded Post-Training

댓글 수 로딩 중

[논문리뷰] DARE: Diffusion Large Language Models Alignment and Reinforcement Executor

댓글 수 로딩 중

[논문리뷰] Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

댓글 수 로딩 중

[논문리뷰] PISCES: Annotation-free Text-to-Video Post-Training via Optimal Transport-Aligned Rewards

댓글 수 로딩 중

[논문리뷰] Typhoon-S: Minimal Open Post-Training for Sovereign Large Language Models

댓글 수 로딩 중

[논문리뷰] What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards

댓글 수 로딩 중

[논문리뷰] P1: Mastering Physics Olympiads with Reinforcement Learning

댓글 수 로딩 중

[논문리뷰] Towards a Unified View of Large Language Model Post-Training

댓글 수 로딩 중

[논문리뷰] SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models

댓글 수 로딩 중

[논문리뷰] Thinking Sparks!: Emergent Attention Heads in Reasoning Models During Post Training

댓글 수 로딩 중