본문으로 건너뛰기

#Reasoning Tasks

14개의 포스트

[논문리뷰] KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks

댓글 수 로딩 중

[논문리뷰] CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR

댓글 수 로딩 중

[논문리뷰] Surgical Post-Training: Cutting Errors, Keeping Knowledge

댓글 수 로딩 중

[논문리뷰] From Imitation to Discrimination: Toward A Generalized Curriculum Advantage Mechanism Enhancing Cross-Domain Reasoning Tasks

댓글 수 로딩 중

[논문리뷰] Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

댓글 수 로딩 중

[논문리뷰] MEENA (PersianMMMU): Multimodal-Multilingual Educational Exams for N-level Assessment

댓글 수 로딩 중

[논문리뷰] Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models

댓글 수 로딩 중

[논문리뷰] Don't Waste Mistakes: Leveraging Negative RL-Groups via Confidence Reweighting

댓글 수 로딩 중