#LLM Orchestration

4개의 포스트

[논문리뷰] When Does Combining Language Models Help? A Co-Failure Ceiling on Routing, Voting, and Mixture-of-Agents Across 67 Frontier Models

본 논문은 다양한 LLM 시스템(Routing, Voting, Mixture-of-Agents)의 정확도 향상 잠재력이 일반적으로 알려진 것보다 훨씬 낮다는 문제를 제기한다. 기존 실무에서는 모델 간의 오차 상관관계인 $\rho$를 지표로 활용하여, $\rho$가 낮으면 다양한 모델을 결합하는 것이 효과적이라 판단해왔다.

#Review #LLM Orchestration #Model Routing #Co-failure Ceiling #Error Correlation #Mixture-of-Agents #Inference Economics

2026년 6월 25일

[논문리뷰] SciOrch: Learning to Orchestrate Expert LLMs for Solving Frontier Multimodal Scientific Reasoning Tasks

본 논문은 frontier multimodal scientific reasoning 분야에서 단일 상용 LLM 시스템이 전문가 수준의 성능을 달성하지 못하는 한계를 극복하고자 합니다.

#Review #Multimodal Scientific Reasoning #LLM Orchestration #MCTS #Reinforcement Learning #Expert Model Delegation #Agentic Workflow

2026년 6월 17일

[논문리뷰] Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

다중 에이전트 LLM 시스템의 강화 학습(RL) 사후 훈련 시 발생하는 불안정성의 핵심 원인을 규명하고, 이를 해결하여 안정적인 훈련을 가능하게 하는 새로운 방법론을 제안하는 것입니다.

#Review #Multi-Agent LLM #Reinforcement Learning #Training Stability #GRPO #Agent-wise Normalization #Gradient Explosion #LLM Orchestration

2026년 2월 10일

[논문리뷰] A BERTology View of LLM Orchestrations: Token- and Layer-Selective Probes for Efficient Single-Pass Classification

본 논문은 프로덕션 LLM 시스템에서 안전성 검토 및 기타 분류 태스크를 위해 별도의 모델을 사용하는 방식이 야기하는 추론 지연 시간, VRAM 사용량, 운영 복잡성 증가 문제를 해결하고자 합니다.

#Review #LLM Orchestration #Lightweight Probes #Token-Layer Aggregation #Hidden States #Single-Pass Classification #Safety Moderation #Sentiment Analysis

2026년 1월 20일