Review

[논문리뷰] D^2-Monitor: Dynamic Safety Monitoring for Diffusion LLMs via Hesitation-Aware Routing

본 논문은 D-LLM의 안전성 monitoring 연구가 미흡하며, D-LLM의 오용 가능성이 증대함에 따라 효과적인 방어 메커니즘이 필요하다고 주장합니다.

#Review #Diffusion LLMs #Safety Monitoring #Hesitation-Aware Routing #Probe-based Monitors #Multi-step Trajectory #Sample Difficulty #Efficiency-effectiveness Tradeoff #Adversarial Inputs

2026년 5월 26일

[논문리뷰] Your Embedding Model is SMARTer Than You Think

본 논문은 single-vector multimodal retriever가 rich하고 sequential한 token sequence를 단일 global representation으로 압축하면서 발생하는 근본적인 information bottleneck 문제를 해결하고자 합니다.

#Review #Multimodal Retrieval #Single-Vector Embeddings #Multi-Vector Embeddings #Late Interaction #Information Bottleneck #Hidden States #Contrastive Learning #Plug-and-Play

2026년 5월 25일

[논문리뷰] WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation

최근 Interactive World Models의 발전에도 불구하고, 기존의 평가 방식은 단편적이며 체계적인 평가를 위한 통합된 표준이 부재하다.

#Review #Interactive World Models #Video Generation #Benchmark #Multi-turn Interaction #Evaluation Metrics

2026년 5월 25일

[논문리뷰] TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction

I was unable to access the content of the provided URL: https://arxiv.org/html/2605.26115.

2026년 5월 25일

[논문리뷰] Toward Native Multimodal Modeling: A Roadmap

본 논문은 기존 Large Language Models (LLMs)이 텍스트 전용 인터페이스에 근본적으로 제한되어 실제 세계의 풍부한 센서리 신호(sensory signals)를 통한 그라운딩(grounding)이 부족하다는 문제의식에서 출발합니다.

#Review #Native Multimodal Modeling #Cross-modal Fusion #Transformer Architectures #Multimodal LLMs #M2M Symmetric Modeling #Mid-Fusion #Early-Fusion

2026년 5월 25일

[논문리뷰] ThriftAttention: Selective Mixed Precision for Long-Context FP4 Attention

I am unable to access the content of the provided URL: https://arxiv.org/html/2605.23081. The browsing tool encountered an error while trying to fetch the page.

2026년 5월 25일

[논문리뷰] SkillEvolBench: Benchmarking the Evolution from Episodic Experience to Procedural Skills

본 논문은 LLM Agents가 실제 작업을 해결하면서 축적하는 풍부한 Episodic Experience가 재사용 가능한 Procedural Skills로 증류될 수 있는지 여부가 불분명하다는 핵심 문제를 제기한다.

#Review #LLM Agents #Procedural Skills #Skill Formation #Episodic Experience #Benchmarking #Skill Evolution #Abstraction Bottleneck #Deployment Transfer

2026년 5월 25일

[논문리뷰] QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks

본 논문은 Deep Research Agents의 광범위한 역량을 갖춘 훈련 방식의 불투명성과 기존 Open-weight 모델들의 한계점을 해결하고자 한다.

#Review #Deep Research Agents #Synthetic Data #Rubric Tree #Context Management #Reinforcement Learning #Fact Seeking #Citation Grounding #Report Synthesis

2026년 5월 25일

[논문리뷰] ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

I am sorry, but I was unable to fetch the content of the provided URL: https://arxiv.org/html/2605.20342.

2026년 5월 25일

[논문리뷰] Pantheon360: Taming Digital Twin Generation via 3D-Aware 360° Video Diffusion

I was unable to access the content of the provided URL: https://arxiv.org/html/2605.25449. The browsing tool encountered an error when trying to fetch the page.

2026년 5월 25일

[논문리뷰] On-Policy Adversarial Flow Distillation for Autoregressive Video Generation

제공된 URL https://arxiv.org/html/2605.26105 에서 논문 내용을 가져오는 데 실패했습니다. 현재로서는 해당 논문의 내용을 분석할 수 없어 요청하신 요약 및 Figure 정보 추출 작업을 완료할 수 없습니다. URL 접근에 지속적인 문제가 발생하고 있습니다.

2026년 5월 25일

[논문리뷰] MemForest: An Efficient Agent Memory System with Hierarchical Temporal Indexing

I am unable to browse the provided URL https://arxiv.org/html/2605.23986. The browsing tool reported an error when trying to fetch the content.

2026년 5월 25일

[논문리뷰] Macaron-A2UI: A Model for Generative UI in Personal Agents

본 논문은 Personal Agent가 복잡하고 사용자 중심적인 Task를 처리함에 따라, 기존의 Static Plain-Text Chat이 병목 현상으로 작용하는 문제를 해결하고자 한다.

#Review #Generative UI #Personal Agents #A2UI #Reinforcement Learning #Supervised Fine-tuning #Dialogue Systems

2026년 5월 25일

[논문리뷰] InstructSAM: Segment Any Instance with Any Instructions

죄송합니다. 제공해주신 논문 URL https://arxiv.org/html/2605.26102에서 내용을 가져오는 데 실패했습니다. 논문을 분석하고 요약하려면 해당 콘텐츠에 접근할 수 있어야 합니다. URL을 다시 확인해 주시거나 다른 접근 가능한 URL을 제공해 주시면 감사하겠습니다.

2026년 5월 25일

[논문리뷰] Helix4D: Complex 4D Mesh Generation

I apologize, but I was unable to access the content of the provided URL: https://arxiv.org/html/2605.26109. The browsing tool encountered an error while trying to fetch the page.

2026년 5월 25일

[논문리뷰] Geometry-Aware Image Flow Matching

기존의 Continuous Normalizing Flows (CNF), Diffusion models (DM), Flow Matching (FM)과 같은 발전된 생성 모델들은 이미지 데이터를 고차원 Euclidean space의 벡터로 간주하는 Euclidean geometry 가정을 기반으로 합니다.

#Review #Flow Matching #Spherical Geometry #Image Generation #Riemannian Manifold #Optimal Transport #Hyperspherical Projection #Generative Models

2026년 5월 25일

[논문리뷰] Foundation Protocol: A Coordination Layer for Agentic Society

I was unable to fetch the content from the provided URL: https://arxiv.org/html/2605.23218. The browsing tool reported an error.

2026년 5월 25일

[논문리뷰] DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning

I am sorry, but I was unable to fetch the content from the provided URL: https://arxiv.org/html/2605.25604. The browsing tool encountered an error when trying to access the page.

2026년 5월 25일

[논문리뷰] ControlLight: Towards Controllable, Consistent, and Generalizable Low-Light Enhancement

I am sorry, but I was unable to fetch the content of the provided URL: https://arxiv.org/html/2605.25569.

2026년 5월 25일

[논문리뷰] Claw-Anything: Benchmarking Always-On Personal Assistants with Broader Access to User's Digital World

현재 Large Language Model(LLM) 기반 agent 시스템은 user의 digital world 중 매우 제한적인 부분에만 접근하여 context-sensitive reasoning과 효과적인 assistance 제공에 심각한 한계를 보입니다.

#Review #Personal Assistant Agents #Benchmark #Context-Aware Reasoning #Multi-device Interaction #Proactive Assistance #Long-horizon Event Streams #LLM Agents #Digital World

2026년 5월 25일