Review

[논문리뷰] Where, What, Why, and Importance: Structured Defect Grounding for Text-to-Image Feedback

본 논문은 현대의 T2I 모델이 생성하는 이미지의 국소적이고 미묘한 결함을 효과적으로 진단하고 해결하지 못하는 기존 scalar 기반 평가 방식의 한계를 해결하고자 합니다.

#Review #Text-to-Image #Structured Defect Grounding #VLM #Diffusion Model Alignment #Reinforcement Learning #BoxFlow-GRPO #Dataset

2026년 6월 11일

[논문리뷰] WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces

기존의 컴퓨터 에이전트 평가 벤치마크는 주로 단일 브라우저 기반 작업에 국한되어 있어, 실제 데스크톱 환경의 복잡한 Long-Horizon 작업 수행 능력을 평가하는 데 한계가 있습니다.

#Review #Computer-Use Agent #Long-Horizon #Real-World Benchmark #Hybrid Interface #Human-Computer Interaction #Agent Evaluation

2026년 6월 11일

[논문리뷰] WEAVER, Better, Faster, Longer: An Effective World Model for Robotic Manipulation

본 논문은 기존 월드 모델들이 복잡한 매니퓰레이션 태스크를 수행할 때 겪는 High Latency와 Context Length의 제한 문제를 해결하고자 한다.

#Review #World Model #Robotic Manipulation #Autoregressive Inference #Transformer #Efficiency #Generative Modeling

2026년 6월 11일

[논문리뷰] Visual Para-Thinker++: A Single-Policy Multi-Agent Framework for Visual Reasoning

본 논문은 기존의 단일 체인 추론(Single-chain Reasoning) 방식이 시각적 추론 과정에서 범하는 조기 지각적 확신(Early Perceptual Commitment)과 환각(Hallucination) 문제를 해결하기 위해 고안되었습니다.

#Review #Multimodal Large Language Models #Multi-Agent Framework #Visual Reasoning #Role-Decoupled Optimization #Inference Efficiency

2026년 6월 11일

[논문리뷰] VideoMDM: Towards 3D Human Motion Generation From 2D Supervision

본 연구는 3D Human Motion 데이터셋의 희소성과 구축 비용 문제를 극복하기 위해 2D 영상으로부터 3D 모션을 생성하는 새로운 접근 방식을 제안합니다.

#Review #3D Human Motion Generation #Diffusion Models #2D Supervision #Motion Synthesis #Video Analysis

2026년 6월 11일

[논문리뷰] VIA-SD: Verification via Intra-Model Routing for Speculative Decoding

본 논문은 기존의 Speculative Decoding이 가진 이분법적(accept 또는 full recompute) 검증 구조의 한계를 극복하고자 합니다.

#Review #Speculative Decoding #Hierarchical Verification #Intra-Model Routing #KL Divergence #LLM Inference #Efficiency #Slim-Verifier

2026년 6월 11일

[논문리뷰] TreeSeeker: Tree-Structured Trial, Error, and Return in Deep Search

본 논문은 복잡한 Deep Search 과정에서 에이전트가 단일 선형 궤적을 맹목적으로 따르거나, 체계적인 규칙 없이 분기를 탐색하여 예산을 낭비하는 문제를 해결합니다.

#Review #Deep Search #Tree-Structured Search #Tree-Search #TreeMem #Textual UCB #Branch-and-Return #Agentic Workflow

2026년 6월 11일

[논문리뷰] ToolSense: A Diagnostic Framework for Auditing Parametric Tool Knowledge in LLMs

본 논문은 LLM의 도구 사용 능력을 평가할 때 기존의 End-to-End 방식이 모델의 내부 지식(Parametric Knowledge)과 추론 능력을 명확히 구분하지 못하는 한계를 해결하기 위해 제안되었습니다.

#Review #LLM #Tool Learning #Parametric Knowledge #Diagnostic Framework #Tool Auditing #Evaluation

2026년 6월 11일

[논문리뷰] Surflo: Consistent 3D Surface Flow Model with Global State

본 연구는 기존의 3D Scene Flow 추정 방식이 가지는 프레임 간의 기하학적 불일치 문제를 해결하는 것을 목표로 합니다. 기존 모델들은 주로 독립적인 프레임 페어 간의 대응 관계를 찾는 데 집중하여, 연속적인 시간 흐름 속에서 누적 오차가 발생하거나 장면의 표면 구조를 왜곡시키는 한계가 있습니다.

#Review #3D Scene Flow #Surface Flow #Global State #Point Cloud #Temporal Consistency

2026년 6월 11일

[논문리뷰] SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning

본 요청에 대해 제공된 논문 URL(https://arxiv.org/html/2606.13673)을 분석하려 시도하였으나, 현재 시스템 환경에서 해당 페이지에 대한 접근이 제한되어 있습니다. 따라서 논문 내용을 직접 확인하여 요약하는 것이 불가능합니다.

2026년 6월 11일

[논문리뷰] SG-OPD: Sign-Gated On-Policy Distillation via Sign-Consistency Gating and Phased Teacher Sampling

본 연구는 기존의 Off-policy Distillation이 지닌 데이터 고립성 문제와 Teacher-Student 간의 Distribution Mismatch를 해결하는 데 초점을 맞춥니다.

#Review #Knowledge Distillation #On-Policy Learning #Sign-Consistency #Phased Teacher Sampling #Large Language Models #Model Alignment

2026년 6월 11일

[논문리뷰] Robust-U1: Can MLLMs Self-Recover Corrupted Visual Content for Robust Understanding?

죄송합니다. 현재 요청하신 논문 링크(https://arxiv.org/html/2606.08063)에 대해 직접적인 접근이 제한되어 있습니다. 해당 URL은 최신 논문이거나 일시적인 접근 불가 상태일 수 있습니다.

2026년 6월 11일

[논문리뷰] Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language Models

본 논문은 대규모 언어 모델(LLM)의 안전성 평가가 고정된 쿼리 예산(fixed query budget)에 의존함에 따라 발생하는 심각한 정보 왜곡 문제를 해결하고자 합니다.

#Review #Adversarial Robustness #Compute-Aware Evaluation #FLOPs #Jailbreak Attacks #Risk-Compute Curves #Safety Alignment

2026년 6월 11일

[논문리뷰] Revisiting Articulated Parts Perception in Robot Manipulation

본 연구는 기존의 로봇 조작 연구들이 정적인 객체 인식에 편중되어, 관절형 객체의 복잡한 기구학적 특성을 충분히 반영하지 못하고 있다는 점을 해결하고자 한다.

#Review #Articulated Parts #Robot Manipulation #Part Segmentation #Motion Estimation #Geometric Reasoning

2026년 6월 11일

[논문리뷰] PianoKontext: Expressive Performance Rendering from Deadpan Context

본 논문은 기존의 음악 생성 모델이 표현적 타이밍(Expressive timing)과 다성 음악(Polyphonic music)의 복잡성을 제대로 모델링하지 못하는 문제를 해결하기 위해 PianoKontext를 제안한다.

#Review #Expressive Performance Rendering #Flow Matching #Latent Diffusion #Dynamic Time Warping #Music2Latent #DiT #RoPE

2026년 6월 11일

[논문리뷰] N-GRPO: Embedding-Level Neighbor Mixing for Enhanced Policy Optimization

본 연구는 LLM의 강화학습 과정 중 Rollout 단계에서 발생하는 효과적인 탐색(Exploration)의 부족과 기존 방법론의 한계점을 해결하고자 합니다.

#Review #Reinforcement Learning #Large Language Models #GRPO #Semantic Neighbor Mixing #Policy Optimization #Embedding Space #Latent Reasoning

2026년 6월 11일

[논문리뷰] MuJoCo-Drones-Gym: A GPU-Accelerated Multi-Drone Simulator for Control and Reinforcement Learning

본 논문은 기존 쿼드콥터 시뮬레이터들이 가진 물리적 정확성, Multi-agent 지원, 그리고 현대적인 Deep RL 파이프라인에 필요한 처리량(Throughput) 간의 Trade-off 문제를 해결하고자 합니다.

#Review #Multi-drone Simulator #MuJoCo #Reinforcement Learning #GPU Acceleration #MJX #Aerial Robotics #Gymnasium

2026년 6월 11일

[논문리뷰] MoVerse: Real-Time Video World Modeling with Panoramic Gaussian Scaffold

본 논문은 단일 NFOV 이미지로부터 사용자가 자유롭게 이동하며 탐색할 수 있는 spatially persistent한 3D 환경을 생성하는 것을 목표로 합니다.

#Review #World Model #3D Gaussian Splatting #Panoramic Generation #Video Rendering #Real-Time Interaction

2026년 6월 11일

[논문리뷰] MiniMax Sparse Attention

죄송합니다. 요청하신 논문 링크(https://arxiv.org/html/2606.13392)는 현재 외부에서 접근이 제한되어 있거나 유효하지 않은 것으로 확인됩니다. arXiv 서버의 일시적인 접근 제한 혹은 논문 게시 상태 변경 등의 이유로 해당 페이지의 콘텐츠를 읽어올 수 없습니다.

2026년 6월 11일

[논문리뷰] MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

본 논문은 대규모 언어 모델이 수학적 증명 문제에서 겪는 Hallucination과 Logical Inconsistency 문제를 해결하는 것을 핵심 목표로 합니다.

#Review #Mathematical Reasoning #Reinforcement Learning #Test-Time Scaling #Generative-Verifier #Formal Verification #Scalable Alignment

2026년 6월 11일