#In-context Learning

5개의 포스트

[논문리뷰] Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation

본 논문은 초저자원(Extreme Low-resource) 언어 번역을 위해 모델이 특정 언어를 암기하는 방식에서 벗어나, 언어에 독립적인 Meta-skill을 습득하게 하는 새로운 학습 프레임워크를 제안합니다.

#Review #Low-resource Translation #Reinforcement Learning #In-context Learning #Meta-skill #Language-independent Learning #Meta-linguistic Reasoning

2026년 6월 4일

[논문리뷰] Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

본 논문은 기존 LLM 에이전트 프레임워크가 겪는 높은 구성 비용 과 정적 기능 문제를 해결하는 것을 목표로 합니다.

#Review #LLM Agents #Automated Agent Generation #Reinforcement Learning #Hybrid Policy Optimization #Tool Synthesis #In-context Learning #Agent Framework #Scalability

2026년 1월 4일

[논문리뷰] Nested Learning: The Illusion of Deep Learning Architectures

본 논문은 기존 딥러닝 모델, 특히 대규모 언어 모델(LLM) 이 직면한 지속 학습, 자기 개선, 효과적인 문제 해결 능력의 한계를 극복하고자 합니다. 이를 위해 기계 학습 모델을 중첩되고 다단계의 최적화 문제로 해석하는 새로운 학습 패러다임인 Nested Learning (NL) 을 제안합니다.

#Review #Nested Learning #Continual Learning #In-context Learning #Associative Memory #Multi-Timescale Memory #Self-Modifying Models #Optimizers

2026년 1월 4일

[논문리뷰] Meta-RL Induces Exploration in Language Agents

본 논문은 기존 강화 학습(RL) 기반의 대규모 언어 모델(LLM) 에이전트가 환경에서 능동적인 탐색과 시행착오 경험으로부터 효율적인 정책 적응에 어려움을 겪는 문제를 해결하고자 합니다.

#Review #Meta-RL #LLM Agents #Exploration #Reinforcement Learning #Policy Adaptation #In-context Learning #Self-reflection #Multi-episode tasks

2025년 12월 21일

[논문리뷰] DeContext as Defense: Safe Image Editing in Diffusion Transformers

본 논문은 대규모 Diffusion Transformer(DiT) 기반 이미지 편집 모델 의 심각한 프라이버시 문제를 해결하고자 합니다.

#Review #Diffusion Transformers #Image Editing #Privacy Protection #Adversarial Attack #Attention Mechanism #Identity Preservation #Deepfake Defense #In-context Learning

2025년 12월 18일