#Self-Evolving LLMs

2개의 포스트

[논문리뷰] TTCS: Test-Time Curriculum Synthesis for Self-Evolving

TTCS는 대규모 언어 모델(LLM)이 테스트 질문만 사용하여 추론 능력을 향상시키는 기존 Test-Time Training(TTT) 방법론의 한계를 극복하고자 합니다.

#Review #Test-Time Training #Self-Evolving LLMs #Curriculum Learning #Reinforcement Learning #Question Synthesis #Mathematical Reasoning #GRPO

2026년 2월 1일

[논문리뷰] Guided Self-Evolving LLMs with Minimal Human Supervision

본 논문은 기존의 자율 진화(self-evolving) 언어 모델(LLM)이 겪는 불안정성, 성능 정체, 개념 표류(concept drift) 및 다양성 붕괴(diversity collapse) 문제를 해결하고자 합니다.

#Review #Self-Evolving LLMs #Self-Play #Reinforcement Learning #Curriculum Learning #Few-shot Learning #Human Supervision #Concept Drift #Diversity Collapse

2025년 12월 2일