Review

[논문리뷰] RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

본 논문은 Verifiable Reward(검증 가능한 보상)가 부재한 Open-ended 환경에서 Deep Research 에이전트를 효율적으로 학습시키는 문제를 해결하고자 한다.

#Review #Meta-RL #Deep Research #Reinforcement Learning #Policy Decomposition #Rubric-guided #Stagewise Credit Assignment #Reflection Meta-Policy

2026년 5월 12일

[논문리뷰] MoCam: Unified Novel View Synthesis via Structured Denoising Dynamics

본 논문은 생성적 Novel View Synthesis에서 발생하는 기하학적 정보와 외형 정보 간의 근본적인 갈등 문제를 해결하고자 합니다.

#Review #Novel View Synthesis #Diffusion Models #Structured Denoising #Geometry-Appearance Disentanglement #4D Re-camera #Video Generative Models

2026년 5월 12일

[논문리뷰] Lite3R: A Model-Agnostic Framework for Efficient Feed-Forward 3D Reconstruction

본 논문은 현대의 Transformer 기반 3D reconstruction 파이프라인이 겪는 연산 효율성 및 저정밀도 실행 시의 불안정성 문제를 해결하고자 합니다.

#Review #3D Reconstruction #Transformer #Sparse Linear Attention #FP8-aware QAT #Model-Agnostic #Knowledge Distillation #Algorithm-System Co-design

2026년 5월 12일

[논문리뷰] Images in Sentences: Scaling Interleaved Instructions for Unified Visual Generation

본 논문은 기존 멀티모달 생성 모델들이 복잡한 다중 이미지 명령을 처리할 때 발생하는 성능 저하 문제를 해결하기 위해 제안되었습니다.

#Review #Multimodal Generation #Interleaved Instructions #Object Binding #Transformer #Multimodal Image Editing #Scalable Data Engine

2026년 5월 12일

[논문리뷰] From Web to Pixels: Bringing Agentic Search into Visual Perception

본 논문은 기존의 시각적 인지 모델이 이미지 내의 시각적 단서나 모델 내부의 Frozen Knowledge에만 의존하여 발생하는 한계점을 해결하고자 합니다.

#Review #Perception Deep Research #WebEyes #Pixel-Searcher #Multimodal Intelligence #Visual Grounding #Search-based Segmentation

2026년 5월 12일

[논문리뷰] Do not copy and paste! Rewriting strategies for code retrieval

본 연구는 코드 검색을 위한 기존의 임베딩 기반 기법들이 코드의 표면적인 문법적 특징에 과도하게 의존(Overfit)하여, 실제 의미론적 행동(Program behavior)을 파악하는 데 한계가 있다는 문제에서 출발합니다.

#Review #Code Information Retrieval #Large Language Models #Rewriting #Embedding #PseudoCode #Token Entropy #Representational Analysis

2026년 5월 12일

[논문리뷰] Continual Harness: Online Adaptation for Self-Improving Foundation Agents

본 논문은 embodied agent가 복잡하고 긴 호흡의 환경에서 명확한 도메인 스캐폴딩 없이도 자율적으로 학습하고 진화할 수 있는 체계를 구축하고자 합니다 .

#Review #Foundation Agents #Continual Harness #Online Adaptation #Embodied AI #In-Context Learning #Reset-Free Training #Process Reward Models

2026년 5월 12일

[논문리뷰] Beyond GRPO and On-Policy Distillation: An Empirical Sparse-to-Dense Reward Principle for Language-Model Post-Training

본 연구는 Language-Model post-training 시 제한된 labeled training data가 비효율적으로 사용되는 문제를 해결하고자 한다.

2026년 5월 12일

[논문리뷰] Agent-ValueBench: A Comprehensive Benchmark for Evaluating Agent Values

본 연구는 autonomous agents의 가치 체계가 기반이 되는 LLM의 가치와는 본질적으로 다르며, 이를 체계적으로 평가할 수 있는 도구가 부재하다는 문제 의식에서 출발합니다. 기존의 ValueBench나 ValueCompass와 같은 연구들은 주로 정적인 텍스트 생성 모델의 가치 평가에만 국한되어 있습니다.

#Review #Autonomous Agents #Value Alignment #Benchmark #Agentic Modality #Harness Alignment #Skill Steering

2026년 5월 12일

[논문리뷰] A Causal Language Modeling Detour Improves Encoder Continued Pretraining

본 논문은 도메인 적응을 위한 인코더의 Continued Pretraining에서 기존의 MLM 단독 학습 방식이 갖는 한계를 극복하고자 합니다. 저자들은 특정 도메인(특히 Biomedical) 데이터에서 모델이 충분한 성능을 내지 못하는 이유가 학습 목적 함수 자체의 경직성 때문임을 지적합니다.

#Review #Continued Pretraining #Causal Language Modeling #Masked Language Modeling #Domain Adaptation #Biomedical Encoders #CKA #Freeze Interventions #ModernBERT

2026년 5월 12일

[논문리뷰] Who Prices Cognitive Labor in the Age of Agents? Compute-Anchored Wages

본 논문은 AI 에이전트가 인지 노동 시장의 임금을 결정하는 방식에 대한 기존의 경제학적 오해를 바로잡고, 새로운 가격 결정 프레임워크를 제안한다.

#Review #AI Agents #Factor Pricing #Compute-Anchored Wage #Labor Market #Capital-to-Labor Conversion

2026년 5월 10일

[논문리뷰] What if AI systems weren't chatbots?

본 논문은 인공지능이 대화형 챗봇 인터페이스로 지나치게 빠르게 수렴하고 있다는 점을 지적하며, 이 패러다임이 가져오는 구조적인 사회적, 경제적, 환경적 폐해를 분석한다.

#Review #Conversational AI #Chatbots #User Agency #Sociotechnical Systems #Human-Computer Interaction #AI Governance #Environmental Justice

2026년 5월 10일

[논문리뷰] What Matters for Diffusion-Friendly Latent Manifold? Prior-Aligned Autoencoders for Latent Diffusion

본 논문은 기존 Latent Diffusion Models(LDMs)의 tokenizer들이 주로 reconstruction fidelity에만 초점을 맞추어 설계되어, 정작 확산 생성 모델의 학습에 적합한 latent space를 형성하지 못한다는 문제를 제기합니다 .

#Review #Latent Diffusion Models #Tokenizer #Latent Manifold #Prior Alignment #Autoencoder #Generative Modeling #Representation Learning

2026년 5월 10일

[논문리뷰] UniSD: Towards a Unified Self-Distillation Framework for Large Language Models

본 논문은 기존 LLM의 post-training 과정이 외부 모델에 지나치게 의존함으로써 발생하는 비용 문제와 보안 위험을 해결하기 위해 UniSD라는 통일된 Self-Distillation 프레임워크를 제안합니다.

#Review #Self-Distillation #Large Language Models #On-Policy Learning #Supervision Reliability #Representation Alignment #Training Stability

2026년 5월 10일

[논문리뷰] UniPrefill: Universal Long-Context Prefill Acceleration via Block-wise Dynamic Sparsification

본 논문은 기존의 prefill 가속 기법들이 최신 하이브리드 LLM 아키텍처와 연속 배치(continuous batching) 환경에 부적합하다는 문제를 해결합니다.

#Review #Long-Context LLM #Prefill Acceleration #Dynamic Sparsification #Hybrid Architectures #Continuous Batching #vLLM

2026년 5월 10일

[논문리뷰] Steering Visual Generation in Unified Multimodal Models with Understanding Supervision

본 논문은 최신 UMM이 이해와 생성 기능을 한 모델 내에 통합했음에도 불구하고, 실제로는 두 구성 요소가 상호작용 없이 분리된(Decoupled) 구조로 설계되어 성능 극대화에 한계가 있다는 문제를 지적합니다.

#Review #Unified Multimodal Models #Understanding-Oriented Post-Training #Generation Synergy #Flow Matching #Semantic Supervision #MetaQuery

2026년 5월 10일

[논문리뷰] SpecBlock: Block-Iterative Speculative Decoding with Dynamic Tree Drafting

본 논문은 기존 Speculative Decoding의 Drafter들이 가진 상반된 한계점을 극복하기 위해 제안되었다.

#Review #LLM Inference #Speculative Decoding #Tree-based Verification #Block-Iterative Drafting #Rank-guided Expansion #Serving-time Adaptation

2026년 5월 10일

[논문리뷰] Sparse Autoencoders as Plug-and-Play Firewalls for Adversarial Attack Detection in VLMs

본 논문은 최신 Vision-Language Models(VLMs)가 Adversarial 공격에 극도로 취약하며, 기존의 탐지 방식들은 실질적인 배포 환경에서의 강력한 공격이나 데이터 분포 변화에 대응하지 못한다는 문제를 해결하고자 합니다.

#Review #Vision-Language Models #Adversarial Attack Detection #Sparse Autoencoders #Plug-and-Play #Robustness #Out-of-Domain Generalization

2026년 5월 10일

[논문리뷰] SkCC: Portable and Secure Skill Compilation for Cross-Framework LLM Agents

본 연구는 서로 다른 에이전트 프레임워크가 스킬의 프롬프트 포맷에 대해 높은 민감도를 보이며, 이로 인해 발생하는 성능 차이와 기존 스킬의 보안 취약점을 해결하고자 한다.

#Review #LLM-Agents #Skill compilation #Prompt engineering #Format adaptation #Security hardening #Intermediate representation

2026년 5월 10일

[논문리뷰] Shallow Prefill, Deep Decoding: Efficient Long-Context Inference via Layer-Asymmetric KV Visibility

본 논문은 decoder-only 모델에서 long-context 추론 시 발생하는 Prefill 단계의 높은 계산 비용과 Decode 단계의 KV-cache 메모리 대역폭 한계를 해결하고자 합니다.

#Review #Long-Context Inference #KV-Cache #Phase-Asymmetric #Prefill #Decode #Transformer

2026년 5월 10일