#Meta-learning

4개의 포스트

[논문리뷰] Hyperagents

기존 Self-improving AI 시스템은 대부분 고정된 Meta agent에 의존하여 Self-improvement 메커니즘의 개선에 근본적인 한계가 있었습니다.

#Review #Hyperagents #Metacognitive Self-modification #Self-improving AI #Open-ended Exploration #Darwin Gödel Machine #Meta-learning #Robotics Reward Design #Olympiad-level Math Grading

2026년 3월 22일

[논문리뷰] GradMem: Learning to Write Context into Memory with Test-Time Gradient Descent

최근 Large Language Model(LLM) 애플리케이션들은 방대한 외부 컨텍스트에 의존하며, 이는 KV-cache 와 같은 방식으로 처리될 경우 상당한 메모리 오버헤드를 발생시킵니다.

#Review #GradMem #Test-Time Optimization #Context Removal #Compressive Memory #Meta-learning #Gradient Descent #LLM #KV-retrieval

2026년 3월 17일

[논문리뷰] AMFT: Aligning LLM Reasoners by Meta-Learning the Optimal Imitation-Exploration Balance

대규모 언어 모델(LLM)이 추론 태스크에서 겪는 catastrophic forgetting 및 모방(imitation) 과 탐색(exploration) 간의 최적화되지 않은 트레이드오프 문제를 해결하는 것이 목표입니다.

#Review #Large Language Models #Fine-tuning #Reinforcement Learning #Meta-learning #Adaptive Control #Imitation Learning #Exploration #Reasoning

2025년 8월 14일

[논문리뷰] Estimating Time Series Foundation Model Transferability via In-Context Learning

이 논문은 증가하는 시계열 파운데이션 모델(TSFM) 중에서 특정 하위 태스크에 가장 적합한 모델을 효율적으로 식별하는 문제를 해결하고자 합니다.

#Review #Time Series Foundation Models #Transferability Estimation #In-Context Learning #Tabular Foundation Models #Model Selection #Entropy Profile #Meta-learning #Forecasting

2025년 10월 1일