Review

[논문리뷰] Where Rectified Flows Leak: Characterising Membership Signals Along the Interpolation Path

생성 모델의 대규모 배포가 확산됨에 따라 데이터 프라이버시 및 저작권 문제가 심화되고 있으며, 모델이 학습 데이터를 직접적으로 복제하지 않더라도 내재된 정보를 기억(Memorisation)하고 있을 가능성이 제기됩니다.

#Review #Rectified Flows #Membership Inference Attack #Generative Models #Interpolation Path #Memorisation #Deep Learning Security

2026년 6월 8일

[논문리뷰] Trajectory-Refined Distillation

본 논문은 현대 LLM의 후행 학습에서 널리 사용되는 OPD가 구조적으로 직면한 Prefix Failure 문제를 해결하고자 합니다. 기존 연구들은 토큰 단위의 손실 함수 수정이나 특정 토큰의 가중치 조정을 통해 이 문제를 해결하려 했으나, 이는 실패한 궤적의 근본 원인을 수정하지 못하는 한계가 있었습니다 .

#Review #On-policy Distillation #Prefix Failure #Trajectory-Refined Distillation #Large Language Models #Self-distillation #Policy Gradient #Alignment

2026년 6월 8일

[논문리뷰] Text-to-Image Models Need Less from Text Encoders Than You Think

본 논문은 최신 Text-to-Image(TTI) 모델들이 복잡한 Text Encoder에 크게 의존하는 현상을 분석하고, 과연 이러한 모델들이 실제로 풍부한 문맥 정보를 활용하는지 의문을 제기합니다.

#Review #Text-to-Image #Diffusion Transformer #Text Encoder #Contextless Embedding #Bag-of-Position-Tagged-Words #Compositionality

2026년 6월 8일

[논문리뷰] SwiftVR: Real-Time One-Step Generative Video Restoration

본 논문은 실시간 비디오 스트리밍 환경에서 고해상도 복원을 수행하기 위한 제너레이티브 VR 모델의 배포 문제를 해결합니다.

#Review #Generative Video Restoration #Real-time #Diffusion Transformer #Shifted-Window Attention #Streaming Inference

2026년 6월 8일

[논문리뷰] SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

본 논문은 기존의 정적인 VQA나 시뮬레이터 종속적 벤치마크가 멀티모달 에이전트의 실제 환경에서의 동적 공간 추론 능력을 평가하는 데 한계가 있다는 점을 지적합니다. 대부분의 기존 연구는 privileged state 정보에 의존하거나 특정 환경에 고착화된 인터페이스를 사용하여 일반적인 공간 지능을 측정하기 어렵습니다 .

#Review #Spatial Reasoning #Multimodal Agents #Interactive Benchmark #Egocentric Vision #POMDP #Spatial Intelligence

2026년 6월 8일

[논문리뷰] SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating

본 논문은 현대의 Deep Research Agents가 성공률만을 극대화하려는 brute-force 학습 전략으로 인해 심각한 연산 비효율성에 직면했다는 문제를 제기합니다.

#Review #Web Agents #Training Efficiency #Reward Gating #Adaptive Efficiency Anchoring #Pareto Frontier #Reinforcement Learning #Supervised Fine-Tuning

2026년 6월 8일

[논문리뷰] Skill-RM: Unifying Heterogeneous Evaluation Criteria via Agent Skill

본 논문은 LLM post-training에서 활용되는 기존의 reward evaluation 방식이 이질적인 평가 기준을 통합하는 데 한계를 보이고 있다는 점을 지적한다.

#Review #Reward Modeling #Agent Skills #LLM-as-a-Judge #Reinforcement Learning #Instruction Following #Evidence-based Evaluation

2026년 6월 8일

[논문리뷰] Skill-3D: Evolving Scene-Aware Skills for Agentic 3D Spatial Reasoning

본 논문은 기존 MLLM 기반 에이전트들이 3D 공간 추론 작업에서 장면의 특성을 무시하고 획일적인 툴 사용 전략을 취함으로써 성능이 저하되는 문제를 해결하고자 한다.

#Review #Agentic AI #3D Spatial Reasoning #Scene-Aware Skills #Tool Learning #Skill Evolution

2026년 6월 8일

[논문리뷰] SigmaScale: LLM Compression with SVD-based Low-Rank Decomposition and Learned Scaling Matrices

본 논문은 기존의 LLM 압축 기법들이 가중치 분해 시 발생하는 정밀도 저하와 그로 인한 성능 손실을 효과적으로 극복하지 못한다는 문제를 해결하고자 한다.

#Review #LLM Compression #SVD #Low-Rank Decomposition #Weight Quantization #Model Pruning #Parameter Efficiency

2026년 6월 8일

[논문리뷰] Self-Evaluation Is Already There: Eliciting Latent Judge Calibration in Base LLMs with Minimal Data

본 논문은 LLM이 외부 Judge의 평가를 사전에 예측하여 스스로 자신의 답변을 재평가하거나 선별할 수 있는지에 대한 근본적인 의문을 탐구합니다.

#Review #Large Language Models #Self-Evaluation #Calibration #Reinforcement Learning #Elicitation #Distillation #Alignment

2026년 6월 8일

[논문리뷰] SWE-Explore: Benchmarking How Coding Agents Explore Repositories

본 논문은 기존의 SWE-bench와 같은 벤치마크가 이슈 해결 여부를 이분법적(성공/실패)으로만 판단하여, 에이전트의 내부적인 탐색, 위치 파악, 맥락 추출 능력을 측정하지 못하는 문제를 해결하고자 합니다 .

#Review #Repository-level Coding #Coding Agents #Repository Exploration #Line-level Benchmark #Context Retrieval #Software Engineering

2026년 6월 8일

[논문리뷰] Reasoning over Grammar: Can Synthetic Linguistic Reasoning Traces Enhance Low-Resource Machine Translation?

본 연구는 LLM을 활용한 극저자원 언어(extremely low-resource language)의 Machine Translation (MT) 성능을 향상시키는 것을 목표로 합니다.

#Review #Low-Resource Machine Translation #Large Language Models #Linguistic Reasoning Traces #In-Context Learning #Universal Dependencies #Chain-of-Thought

2026년 6월 8일

[논문리뷰] Reasoning Arena: Trace Tournaments When Verifiable Rewards Fall Short

본 논문은 RLVR의 고질적인 문제인 Non-diverse reward group으로 인한 학습 신호 소실과 컴퓨팅 자원 낭비를 해결하는 것을 목표로 합니다.

#Review #Reasoning Arena #RLVR #Trace Tournament #Bradley-Terry Model #Non-diverse Reward Group #Adaptive Routing

2026년 6월 8일

[논문리뷰] Pruning and Distilling Mixture-of-Experts into Dense Language Models

본 연구는 MoE 모델의 높은 메모리 요구량으로 인해 발생하는 배포 제약 문제를 해결하기 위해, 전문가 기반 구조를 효율적인 Dense 모델로 변환하는 체계적인 프레임워크를 제안한다.

#Review #Mixture-of-Experts #Knowledge Distillation #Model Pruning #D-Optimal Selection #Dense Language Models #Expert Scoring #Submodularity

2026년 6월 8일

[논문리뷰] Phase Marginalization for Patch-Grid Instability in Vision Transformers

본 논문은 Vision Transformers(ViT)의 패치화(patchification) 과정에서 발생하는 patch-grid phase instability 문제를 해결하고자 한다.

#Review #Vision Transformers #Patch-Grid Phase #Dense Prediction #Phase Marginalization #Test-Time Augmentation #Aliasing

2026년 6월 8일

[논문리뷰] PIPE-Cypher: Automatic Enterprise Benchmark Generation for Text-to-Cypher Systems

본 논문은 기업 내 Property Graph 환경에서 신뢰할 수 있는 Text2Cypher 평가를 위한 반복 가능하고 자동화된 벤치마크 생성 파이프라인의 부재 문제를 해결합니다.

#Review #Text2Cypher #Benchmark Generation #Property Graph #Execution Validation #Local LLM #Governed Generation

2026년 6월 8일

[논문리뷰] PBSD: Privileged Bayesian Self-Distillation for Long-Horizon Credit Assignment

본 논문은 long-horizon agentic task에서 발생하는 sparse reward로 인한 credit assignment의 근본적인 한계를 해결하고자 한다.

#Review #Reinforcement Learning #Long-Horizon Credit Assignment #Bayesian Inference #Self-Distillation #Search Agents #Agentic RL

2026년 6월 8일

[논문리뷰] Optical Reasoning: Rethinking Images as an Expressive Reasoning Medium Beyond Text

본 연구는 기존의 텍스트 기반 CoT(Chain-of-Thought)가 가지는 비효율성과 multimodal 태스크에서의 표현력 한계를 해결하고자 한다.

#Review #Optical Reasoning #Multimodal Large Language Models #Chain-of-Thought #Context Compression #Interleaved-modal Reasoning #Visual Reasoning

2026년 6월 8일

[논문리뷰] On the Geometry of On-Policy Distillation

본 논문은 OPD가 SFT와 RLVR의 특성을 모두 공유함에도 불구하고, 파라미터 공간에서의 구체적인 학습 동역학(training dynamics)은 제대로 규명되지 않았다는 점을 핵심 문제로 정의합니다.

#Review #On-policy Distillation #Parameter-space Geometry #Subspace Locking #SFT #RLVR #Large Language Models

2026년 6월 8일

[논문리뷰] OmniGameArena: A Unified UE5 Benchmark for VLM Game Agents with Improvement Dynamics

본 논문은 기존 VLM Agent 벤치마크가 단일 시도(First-attempt) 점수만을 보고하고, Solo 플레이 위주로 구성되어 있어 에이전트의 학습 및 개선 능력을 측정하지 못한다는 문제를 지적한다.

#Review #VLM Agents #Benchmark #Unreal Engine 5 #Improvement Dynamics #Agentic Reflection #Cold-start #Generalization

2026년 6월 8일