본문으로 건너뛰기

secrett2633's blog

카테고리

Python

PEP (650)

AI/ML

Review (3569)

OpenSource

PR Analysis (761)
vLLM (71)
SGLang (130)
llm-compressor (45)

Python

PEP (650)

AI/ML

Review (3569)

OpenSource

PR Analysis (761)
vLLM (71)
SGLang (130)
llm-compressor (45)

홈
#On-Policy Learning

#On-Policy Learning

2개의 포스트

[논문리뷰] Online Experiential Learning for Language Models

arXiv에 게시된 'Online Experiential Learning for Language Models' 논문에 대한 자세한 리뷰입니다.

#Review #Online Experiential Learning (OEL)#Context Distillation #Language Models #Reward-Free Learning #Catastrophic Forgetting #Token Efficiency #On-Policy Learning

2026년 3월 17일댓글 수 로딩 중

[논문리뷰] On-Policy Self-Distillation for Reasoning Compression

Zhipeng Wang이 arXiv에 게시한 'On-Policy Self-Distillation for Reasoning Compression' 논문에 대한 자세한 리뷰입니다.

#Review #Reasoning Compression #Self-Distillation #On-Policy Learning #Large Language Models #Mathematical Reasoning #Knowledge Distillation #Efficient Inference

2026년 3월 5일댓글 수 로딩 중

AI Review Python PEP PR Analysis RSS GitHub

© 2026 secrett2633. All rights reserved.