Review

[논문리뷰] OVO-S-Bench: A Hierarchical Benchmark for Streaming Spatial Intelligence in Multimodal LLMs

본 논문은 실시간 환경에서 활동하는 멀티모달 에이전트가 단편적인 현재 시점의 정보가 아닌, 시간 흐름에 따른 공간적 구조를 지속적으로 유지하고 추론해야 한다는 도전 과제를 해결하고자 합니다.

#Review #Multimodal LLMs #Streaming Spatial Intelligence #Egocentric Video #Hierarchical Benchmark #Spatiotemporal Reasoning #Allocentric Mapping

2026년 6월 3일

[논문리뷰] MeshWeaver: Sparse-Voxel-Guided Surface Weaving for Autoregressive Mesh Generation

본 논문은 기존의 autoregressive 메시 생성 모델들이 겪고 있는 낮은 토큰화 효율성과 기하학적 정보 부족 문제를 해결하고자 한다.

#Review #Autoregressive Mesh Generation #Sparse-Voxel Encoder #Surface Weaving #Vertex-Level Tokenization #Geometry-Aware Guidance #3D Deep Learning

2026년 6월 3일

[논문리뷰] MemTrain: Self-Supervised Context Memory Training

본 논문은 장기적인 컨텍스트를 처리해야 하는 LLM 에이전트에서 Memory 병목 현상을 해결하는 것을 핵심 문제로 다룹니다 . 기존의 연구들은 전체 입력 기록을 컨텍스트에 모두 포함시키는 방식을 사용했으나, 이는 계산 비용의 급격한 증가를 초래합니다.

#Review #LLM Agents #Context Memory #Self-Supervised Learning #Reinforcement Learning #GRPO #Long-Horizon Reasoning

2026년 6월 3일

[논문리뷰] MapAgent: An Industrial-Grade Agentic Framework for City-scale Lane-level Map Generation

본 연구는 고정밀 Lane-level Map의 자동 생성 및 유지보수 과정에서 발생하는 사양 준수 오류를 해결하기 위한 Agentic Framework를 제안한다.

#Review #Map Generation #Map Agent #Vision-Language Models #Lane-level Mapping #Agentic Framework #Vectorization #Constraint Verification

2026년 6월 3일

[논문리뷰] MMG2Skill: Can Agents Distill In-the-Wild Guides into Self-Evolving Skills?

본 논문은 웹상의 방대한 절차적 지식을 에이전트가 실행 가능한 Skill로 활용하지 못하는 근본적인 문제(procedural grounding 부족)를 해결한다.

#Review #Guide-to-Skill Learning #Vision-Language Model (VLM) Agents #Closed-Loop Framework #Procedural Grounding #In-the-Wild Guides #Trajectory-Driven Revision

2026년 6월 3일

[논문리뷰] M^3Eval: Multi-Modal Memory Evaluation through Cognitively-Grounded Video Tasks

본 논문은 현존하는 많은 멀티모달 모델이 짧은 컨텍스트 내의 정보 이해에는 능숙하지만, 복잡한 비디오 시퀀스에서 장기적인 기억을 유지하는 데는 심각한 한계가 있다는 점을 지적합니다. 기존 벤치마크들은 주로 단기적 정보 인식에 치중되어 있어, 인간처럼 긴 시간 동안 사건을 축적하고 재구성하는 능력을 측정하기 어렵습니다.

#Review #Multi-Modal Memory #Video Understanding #Benchmark #Cognitive Science #Long-term Memory

2026년 6월 3일

[논문리뷰] KletterMix: Climbing Toward High-Quality German Pretraining Data

본 논문은 독일어 프리트레이닝 데이터의 낮은 품질과 부족한 문서화 문제를 해결하기 위해 KletterMix를 제안합니다. 기존의 독일어 데이터는 노이즈가 많은 웹 크롤링 기반이거나, 데이터 구조와 문서화 수준이 영어권 데이터에 비해 현저히 낮아 모델 성능 개선에 한계가 있었습니다.

#Review #Pretraining Data #German Corpus #Machine Translation #Dataset Construction #Quality Estimation #COMETKiwi #Language Modeling

2026년 6월 3일

[논문리뷰] GRAIL: Generating Humanoid Loco-Manipulation from 3D Assets and Video Priors

본 논문은 휴머노이드 로봇의 loco-manipulation 정책을 학습시키기 위한 데이터 수집의 높은 비용과 확장성 문제를 해결하고자 한다.

#Review #Humanoid #Loco-Manipulation #Video Foundation Models #3D Assets #Sim-to-Real #HOI Reconstruction

2026년 6월 3일

[논문리뷰] Functional Attention: From Pairwise Affinities to Functional Correspondences

본 논문은 기존의 Transformer 기반 연산자 학습 모델이 가진 비효율성과 구조적 한계를 극복하기 위해 제안되었다.

#Review #Operator Learning #Functional Attention #Functional Maps #PDE Solving #Spectral Domain #Adaptive Basis #Resolution Invariance

2026년 6월 3일

[논문리뷰] Filter, Then Reweight: Rethinking Optimization Granularity in On-Policy Distillation

본 논문은 기존 OPD가 가진 불균일한 학습 가치 문제를 해결하기 위해 최적화 Granularity를 재설계하고자 합니다. 기존 연구들은 단순히 전체 trajectory를 사용하거나, 개별 토큰을 선별하는 Hard selection 방식에 의존하여 정보 손실과 최적화의 불안정성을 초래했습니다 .

#Review #On-Policy Distillation #Knowledge Distillation #Optimization Granularity #Trajectory Filtering #Token Reweighting #Large Language Models

2026년 6월 3일

[논문리뷰] Evaluating Large Language Models in Dynamic Clinical Decision-Making with Standardized Patient Cases

본 연구는 기존의 정적인 단일 턴(Single-turn) 의학 벤치마크가 복잡하고 역동적인 실제 임상 환경을 충분히 반영하지 못한다는 한계를 해결하고자 한다.

#Review #MedSP1000 #Clinical Agents #Standardized Patients #ACGME Competencies #Dynamic Clinical Decision-Making #LLM Evaluation

2026년 6월 3일

[논문리뷰] Eliciting Complex Spatial Reasoning in MLLMs through Wide-Baseline Matching

본 논문은 MLLM이 물리적 환경에서 복잡한 공간 추론을 수행하기 위해 필수적인 Wide-Baseline Matching 능력을 체계적으로 학습하고 평가할 프레임워크가 부족하다는 점을 문제로 지적합니다.

#Review #Multimodal Large Language Models #Spatial Reasoning #Wide-Baseline Matching #Reinforcement Learning #Curriculum Learning #Vision-Language Benchmarks

2026년 6월 3일

[논문리뷰] Economy of Minds: Emerging Multi-Agent Intelligence with Economic Interactions

본 논문은 중앙 집중식 제어 없이도 다중 에이전트 시스템이 자율적으로 협력하고 고도의 지능을 갖출 수 있는 방법을 탐구합니다. 기존의 중앙 집중식 오케스트레이션은 모든 정보를 단일 게이트웨이로 처리해야 하므로 성능 병목 현상이 발생하고, 시스템 규모가 커짐에 따라 좌표화 복잡도가 기하급수적으로 증가하는 한계가 있습니다.

#Review #Multi-Agent System #Economic Interaction #Decentralized Coordination #Credit Assignment #Large Language Models #Agentic Intelligence #Self-Organization

2026년 6월 3일

[논문리뷰] Echo-Infinity: Learning Evolving Memory for Real-Time Infinite Video Generation

본 논문은 autoregressive 비디오 생성 모델에서 발생하는 무제한적인 KV-cache 성장과 시간적 위치 임베딩 오버플로우 문제를 해결하는 것을 목적으로 한다.

#Review #Autoregressive Video Generation #KV-cache #Memory Queries #RoPE #Long-term Consistency

2026년 6월 3일

[논문리뷰] Do Text Edits Generalize to Visual Generation? Benchmarking Cross-Modal Knowledge Editing in UMMs

본 논문은 UMMs에서 수행된 텍스트 기반 지식 편집(Knowledge Editing)이 이미지 생성 과정으로 적절히 전이되는지 검증하고자 합니다 . 기존의 텍스트 도메인 지식 편집 기법들은 텍스트 출력에서는 높은 성공률을 보이지만, 동일한 수정이 시각적 생성 결과로까지 일관되게 이어지는지는 불명확합니다.

#Review #Unified Multimodal Models #Knowledge Editing #Cross-Modal Transfer #Visual Generation #UniKE #Reasoning-Augmented

2026년 6월 3일

[논문리뷰] Deep Embedded Multiplicative DMD for Algebra-Preserving Koopman Learning

본 논문은 Koopman 연산자 학습 시 고차원 시스템에서의 표현력 문제와 대수적 구조 보존 사이의 상충 관계를 해결하고자 합니다.

#Review #Koopman Operator #Dynamic Mode Decomposition #Deep Learning #Algebra-Preserving #Autoencoder #Manifold Learning

2026년 6월 3일

[논문리뷰] DAR: Deontic Reasoning with Agentic Harnesses

본 논문은 LLM 기반의 Deontic Reasoning에서 발생하는 긴 법령의 구조적 복잡성과 정보 검색의 비효율성 문제를 해결하고자 합니다. 기존 Direct Reasoning 방식은 대규모 규정집을 한 번에 컨텍스트로 제공해야 하므로, 모델이 정작 필요한 규칙을 놓치거나 잘못 참조하는 한계가 있습니다 .

#Review #Deontic Reasoning #Agentic Harness #LLM #DeonticBench #Statutory Reasoning #Tool Use #Inference Efficiency

2026년 6월 3일

[논문리뷰] Cosmos 3: Omnimodal World Models for Physical AI

Physical AI 에이전트 학습을 위한 기존의 파편화된 파이프라인은 이해(Understanding)와 생성(Generation) 모듈이 분리되어 있어 데이터 효율성과 확장성이 낮습니다.

#Review #World Model #Physical AI #Mixture-of-Transformers #Omnimodal #Data-Driven Specialization #Synthetic Data #Action-Conditioned Generation

2026년 6월 3일

[논문리뷰] BraveGuard: From Open-World Threats to Safer Computer-Use Agents

본 논문은 컴퓨터 사용 에이전트가 직면한 고유한 보안 취약점인 '다단계 실행 궤적의 불투명성' 문제를 해결하는 것을 목적으로 합니다.

#Review #Computer-use Agents #Safety Guardrails #Trajectory-level Supervision #Open-world Threat Discovery #Self-evolving Defense #Agent Security

2026년 6월 3일

[논문리뷰] Benchmarks are Not Enough: RAMP for Runtime Assessing of Agentic Models in Production Systems

본 논문은 기존의 LLM 에이전트 평가 방식이 정적이고 단기적인 작업에 치중되어 있어, 실제 프로덕션 환경에서 요구되는 복잡한 장기 워크플로우를 반영하지 못하는 문제를 해결하고자 합니다.

#Review #Agentic Models #Runtime Assessment #Software Engineering #Long-horizon Workloads #Compiler Construction #Resurrection Protocol #Production Systems

2026년 6월 3일