최신 포스트

[sglang] SGLang MoE All-Reduce 최적화: NCCL Symmetric Memory 활용으로 지연 시간 50% 단축

SGLang MoE 레이어의 all-reduce 성능을 NCCL symmetric memory로 최적화하여 TPOT 6.55% 개선.

#SGLang #MoE #AllReduce #NCCL #SymmetricMemory #DeepSeek-V4 #성능최적화 #GPU

2026년 7월 15일

[논문리뷰] Vinci2: Providing Proactive Assistance in Continuous Egocentric Videos

본 논문은 기존 Egocentric assistant가 수동적인 Reactive 방식이나 특정 이벤트 발생 시에만 응답하는 Semi-proactive 방식에 머물러 있다는 한계를 지적합니다.

#Review #Egocentric Video #Proactive Assistance #Retrieval-Augmented Reasoning #Streaming Memory #Video-LLM #Benchmarking

2026년 7월 15일

[논문리뷰] Tracing Agentic Failure from the Flow of Success

본 논문은 LLM 기반 에이전트 시스템의 실패를 자동으로 진단할 때 발생하는 비용과 비효율성 문제를 해결하기 위해 Oat를 제안한다.

#Review #LLM Agents #Failure Attribution #Unsupervised Learning #Neural CDE #One-Class Learning #Anomaly Detection #Agentic Systems

2026년 7월 15일

[논문리뷰] ShortOPD: Recovering Pruned LLMs with Short-to-Long On-Policy Distillation

본 논문은 구조적 프루닝(Structured Pruning)이 적용된 LLM이 객관식 벤치마크에서는 성능을 유지하지만, 실제 배포 시 요구되는 자유 형식 생성(Free-form generation)에서는 심각하게 붕괴하는 현상을 해결하고자 합니다.

#Review #Structured Pruning #On-Policy Distillation #LLM Compression #Model Recovery #Repetition Control #Token-level Supervision

2026년 7월 15일

[논문리뷰] Self-Improvements in Modern Agentic Systems: A Survey

본 논문은 현대의 Agentic Systems가 어떻게 인간의 개입을 최소화하면서 경험을 통해 스스로 역량을 확장할 수 있는지에 대한 체계적인 분석을 제공합니다. 기존 연구들은 개별적인 개선 기법에 집중해왔으나, 이러한 기술들을 포괄하는 통합된 프레임워크가 부족했습니다.

#Review #Agentic Systems #Self-Improvement #Foundation Model #Scaffolding #Meta-Learning #Autonomous Agents

2026년 7월 15일

[논문리뷰] Ring-Zero: Scaling Zero RL to a Trillion Parameters for Emergent Reasoning

본 논문은 대규모 언어 모델이 단순히 정보를 암기하는 단계를 넘어 고도의 논리적 추론 능력을 갖추기 위한 핵심 동력으로 Zero RL의 확장성을 주목합니다.

#Review #Zero RL #Trillion Parameters #Emergent Reasoning #Reinforcement Learning #Scalability #LLM

2026년 7월 15일

[논문리뷰] Registers Matter for Pixel-Space Diffusion Transformers

본 논문은 Register Tokens가 기존 ViTs에서의 고질적인 문제인 '고 norm 아웃라이어(high-norm patch-token outliers)'를 해결하는 것과 달리, DiTs에서의 구체적인 역할과 효과는 미비하게 탐구되었다는 점에 주목합니다.

#Review #Diffusion Transformers #Register Tokens #Pixel-Space #Feature Norms #Attention Sinks #Dual-Stream Architecture

2026년 7월 15일

[논문리뷰] PolicyShiftGuard: Benchmarking and Improving Policy-Adaptive Image Guardrails

본 논문은 기존의 이미지 가드레일이 고정된 안전 정책하에서만 작동하며, 실제 산업 현장에서 요구되는 정책적 유연성을 결여하고 있다는 문제를 해결하고자 합니다.

#Review #Image Guardrail #Policy-Adaptive #PolicyShiftBench #PolicyShiftGuard #Boundary-Pair Policy Adaptation #Multimodal Safety

2026년 7월 15일

[논문리뷰] PalmClaw: A Native On-Device Agent Framework for Mobile Phones

본 논문은 기존 모바일 에이전트가 주로 의존하는 GUI 기반 조작의 한계를 극복하고, 모바일 기기 환경에서 더 효율적이고 제어 가능한 에이전트 프레임워크를 구축하는 것을 목표로 한다.

#Review #Mobile Agent #On-Device #LLM Agent #Device Tools #Execution Boundary #Agent Framework

2026년 7월 15일

[논문리뷰] OvisOCR2 Technical Report

본 논문은 기존의 문서 파싱 방식인 파이프라인(Pipeline) 모델의 복잡한 배포 구조와 단계별 오류 누적 문제를 해결하고자 OvisOCR2를 제안한다. 기존의 파이프라인 방식은 레이아웃 분석, 콘텐츠 인식, 페이지 병합 등 여러 단계가 분리되어 있어 효율성이 낮고, 한 단계의 오류가 후속 단계로 전파되는 한계가 있다.

#Review #End-to-End Document Parsing #Markdown Serialization #Multimodal Large Language Model #Reinforcement Learning #On-policy Distillation #OvisOCR2

2026년 7월 15일

[논문리뷰] MetaView: Monocular Novel View Synthesis with Scale-Aware Implicit Geometry Priors

본 논문은 기존 NVS 방법론들이 겪고 있는 구조적 불일치와 스케일 표류 문제를 해결하고자 합니다. 기존의 명시적 재구성 기반 방식은 국소적인 일관성은 보장하지만, 복잡한 재구성 파이프라인으로 인해 대규모 시점 변화 시 일반화 성능이 제한됩니다 .

#Review #Monocular Novel View Synthesis #Diffusion Models #Implicit Geometry Priors #Scale-Awareness #Camera Control #MM-DiT

2026년 7월 15일

[논문리뷰] KnowAct-GUIClaw: Know Deeply, Act Perfectly, Personal GUI Assistant with Self-Evolving Memory and Skill

본 논문은 기존의 OpenClaw 계열 에이전트가 GUI 환경에서의 복잡한 작업 자동화 시 겪는 구조적 한계를 해결하고자 합니다. 기존 방식은 플랫폼 간의 호환성이 부족하고, 지속적인 학습을 통한 성능 향상 메커니즘이 부재하여 다양한 기기 환경에 적응하기 어렵다는 문제점이 있습니다.

#Review #GUI Agents #Personal Assistant #Self-Evolving Memory #Skill Library #Cross-Platform Interaction #POMDP #Task Decomposition

2026년 7월 15일

[논문리뷰] Harness Handbook: Making Evolving Agent Harnesses Readable,Navigable, and Editable

본 논문은 대규모 Agent Harness의 구조적 복잡성으로 인해 발생하는 Behavior Localization의 어려움을 해결하는 것을 목표로 합니다.

#Review #Agent Harness #Behavior Localization #Static Program Analysis #LLM-assisted Behavioral Structuring #Behavior-Guided Progressive Disclosure #Software Engineering

2026년 7월 15일

[논문리뷰] Hallo4D: Multi-Modal Hallucination Mitigation for Consistent Spatio-Temporal Generation

본 논문은 3D 및 4D 콘텐츠 생성 시 발생하는 공간적·시간적 불일치(hallucination) 문제를 해결하는 것을 목적으로 합니다.

#Review #3D Generation #4D Generation #Spatio-temporal Consistency #Multi-Modal Reasoning #Diffusion Models #Hallucination Mitigation

2026년 7월 15일

[논문리뷰] GigaWorld-Policy-0.5: A Faster and Stronger WAM Empowered by AutoResearch

본 논문은 기존 WAM 방식이 추론 시 명시적인 미래 비디오 생성을 요구하여 발생하는 높은 연산 오버헤드와 실시간 제어의 한계를 해결하는 것을 목표로 합니다.

#Review #World Action Models #Robot Control #Mixture-of-Transformers #AutoResearch #Inference Latency #Flow Matching #Visual Dynamics

2026년 7월 15일

[논문리뷰] From Noisy Traces to Root Causes: Structural Trajectory Analysis and Causal Extraction for Agent Optimization

본 논문은 장기적(Long-horizon) 에이전트 최적화 시 발생하는 컨텍스트 노이즈 문제를 해결하고자 합니다.

#Review #Agent Optimization #Causal Localization #Execution Dependency Graph #Failure Pattern Mining #Structural Trajectory Analysis #Context-Noise Trade-off

2026년 7월 15일

[논문리뷰] From Controlled to the Wild: Evaluation of Pentesting Agents for the Real-World

본 논문은 기존의 사이버 보안 벤치마크가 지나치게 제한된 환경(예: Capture-the-Flag)에 국한되어 있어, 실제 환경에서의 복잡한 공격 표면과 전략적 탐색 능력을 평가하지 못하는 한계를 해결하고자 한다 .

#Review #AI Pentesting Agents #Vulnerability Discovery #Evaluation Protocol #Ground-Truth Matching #Stochasticity #Agentic Workflow

2026년 7월 15일

[논문리뷰] Boogu-Image-0.1: Boosting Open-Source Unified Multimodal Understanding and Generation

본 연구는 기존 오픈소스 생성 모델이 상업적 frontier 모델 대비 복잡한 의도를 해석하는 Understanding 능력이 부족하다는 점을 해결하고자 합니다.

#Review #Unified Multimodal #Text-to-Image #Agentic Inference #Data Curation #Diffusion Transformer #Instruction-Driven Generation

2026년 7월 15일

[논문리뷰] AgentCompass: A Unified Evaluation Infrastructure for Agent Capabilities

본 논문은 LLM 기반 Agent의 성능을 평가하기 위한 인프라가 극도로 파편화되고 복잡하게 얽혀 있는 문제를 해결하고자 한다. 기존의 평가 방식은 특정 도메인에 고착화되어 있거나, 실행 환경과 평가 프로토콜이 강하게 결합되어 있어 재현성(Reproducibility)을 저해하고 반복적인 엔지니어링 비용을 발생시킨다 .

#Review #LLM-based Agents #Evaluation Infrastructure #Benchmarking #Trajectory Analysis #Agentic Capabilities #Reproducibility

2026년 7월 15일

[논문리뷰] AffectFlow-DINO: Uncertainty-Aware Multi-Task Affect Estimation via Conditional Rectified Flow

본 논문은 in-the-wild 환경의 감정 분석에서 발생하는 데이터의 내재적 모호성과 표현의 불확실성을 해결하기 위해 AffectFlow-DINO를 제안합니다.

#Review #Affective Computing #Conditional Rectified Flow #Multi-Task Learning #Uncertainty-Aware #DINOv3 #Facial Affect Estimation #ABAW Challenge

2026년 7월 15일