#LLM Rollout

1개의 포스트

[논문리뷰] EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts

본 논문은 LLM의 RL 학습 과정에서 발생하는 Rollout 생성의 고질적인 Latency 문제를 해결하기 위해 고안되었습니다.

#Review #Reinforcement Learning #Speculative Decoding #Self-Speculative Decoding #LLM Rollout #System-Aware #Quantization

2026년 6월 17일