#Inference-time Scaling

2개의 포스트

[논문리뷰] Critic-R: Improving Agentic Search using Instruction-tuned Retrievers with Natural Language Introspective Feedback

본 논문은 Agentic Search 환경에서 기존 Retriever의 경직성이 전체 시스템 성능의 병목 현상(bottleneck)을 유발한다는 점을 해결하고자 합니다 . 기존 연구들은 주로 Reasoning Agent만을 최적화하거나, Retriever를 고정된 블랙박스로 간주하는 한계를 보입니다.

#Review #Agentic Search #Retrieval-Augmented Generation #Instruction-tuned Retriever #Inference-time Scaling #Contrastive Learning #Introspective Feedback

2026년 6월 7일

[논문리뷰] The Shadow Price of Reasoning: Economic Perspective on Optimal Budget Allocation for LLMs

본 연구는 고정된 컴퓨팅 자원 환경에서 LLM의 추론 성능을 극대화하기 위한 효율적인 예산 배분 문제를 해결합니다. 기존의 Uniform 정책은 모든 쿼리에 동일한 토큰 제한을 부여함으로써, 쉬운 문제에는 자원을 낭비하고 어려운 문제에는 성능 발휘에 필요한 충분한 자원을 제공하지 못하는 한계가 있습니다.

#Review #Inference-time Scaling #Budget Allocation #Shadow Price #Lambert W Function #Rational Abandonment #LLM Reasoning #Compute-Utility Equilibrium

2026년 6월 4일