Review

[논문리뷰] Distilling Feedback into Memory-as-a-Tool

본 논문은 LLM 의 추론 시 발생하는 높은 연산 비용과 반복적인 자기 수정 과정의 비효율성을 해결하고자 합니다. 특히, 기존 'System 2' 스케일링 방법론들이 매번 새로운 쿼리에 대해 처음부터 추론 과정을 반복하여 발생하는 지식 손실 과 계산 자원 낭비 문제를 극복하는 것을 목표로 합니다.

#Review #LLM #Continual Learning #Memory-Augmented Agents #Self-Correction #Feedback Distillation #Tool Use #Inference Cost Amortization #Rubric-based Learning

2026년 1월 11일

[논문리뷰] CaricatureGS: Exaggerating 3D Gaussian Splatting Faces With Gaussian Curvature

본 논문은 제어 가능하고 사실적인 3D 얼굴 캐리커처 아바타를 생성하는 데 있어 기존 메시 기반 방법론의 한계를 극복하고자 합니다.

#Review #3D Gaussian Splatting #Facial Caricaturization #Gaussian Curvature #Mesh Deformation #Photorealistic Rendering #Human Avatars #Local Affine Transformations

2026년 1월 11일

[논문리뷰] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

비디오 이해 태스크에서 Chain-of-Thought (CoT) 추론의 필요성과 이점을 재평가하고, 기존 CoT 방식이 때로는 직접 답변보다 성능이 낮고 비효율적임을 지적합니다. 이를 바탕으로, 필요한 경우에만 추론을 수행하여 효율성과 정확성을 동시에 개선하는 적응형 비디오 추론 프레임워크 를 개발하는 것이 목표입니다.

#Review #Video Understanding #Chain-of-Thought (CoT)#Reinforcement Learning (RL)#Adaptive Reasoning #Early Exit #Multimodal LLM #Video QA #Temporal Grounding

2026년 1월 8일

[논문리뷰] VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control

본 논문은 기존 비디오 월드 모델들이 카메라 및 다중 객체 모션에 대한 통합적이고 정밀한 제어에 어려움을 겪는 문제를 해결하고자 합니다.

#Review #Video World Model #4D Geometric Control #Gaussian Trajectories #Video Generation #Diffusion Models #Camera Control #Object Motion Control #Data Engine

2026년 1월 8일

[논문리뷰] Towards Open-Vocabulary Industrial Defect Understanding with a Large-Scale Multimodal Dataset

기존 산업용 결함 검사 시스템의 높은 오탐률, 낮은 적응성, 일반화 능력 부족, 그리고 블랙박스 모델의 해석 불가능성 한계를 극복하는 것이 목표입니다.

#Review #Industrial Defect Detection #Multimodal Dataset #Vision-Language Model #Diffusion Model #Open-Vocabulary Learning #Quality Inspection #Data Efficiency #Foundation Model

2026년 1월 8일

[논문리뷰] Token-Level LLM Collaboration via FusionRoute

논문은 여러 전문 LLM 간의 효과적인 토큰 수준 협업 을 통해 단일 모델보다 높은 품질의 응답을 생성하는 것을 목표로 합니다.

#Review #LLM Collaboration #Token-level Routing #Mixture-of-Experts #Complementary Logits #Preference Optimization #FusionRoute #Domain Adaptation

2026년 1월 8일

[논문리뷰] The Illusion of Specialization: Unveiling the Domain-Invariant 'Standing Committee' in Mixture-of-Experts Models

본 연구는 MoE(Mixture-of-Experts) 모델 이 희소 라우팅을 통해 도메인 특화(domain specialization)를 달성한다는 일반적인 가정에 의문을 제기합니다.

#Review #Mixture-of-Experts (MoE)#Sparse Routing #Domain Specialization #Load Balancing #Interpretability #Standing Committee #LLM

2026년 1월 8일

[논문리뷰] RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation

로봇 조작 데이터 수집의 어려움으로 인한 데이터 부족 및 다양성 한계를 극복하고, 기존 생성 모델이 간과했던 멀티-뷰(multi-view) 및 시간적 일관성(temporal coherence) 문제를 해결하여 로봇 정책 훈련에 필요한 고품질의 증강 데이터를 생성하는 것이 목표입니다.

#Review #Robot Manipulation #Data Augmentation #Video Generation #Diffusion Models #Multi-View #Visual Identity Prompting #Action-Guided Segmentation #Visuomotor Policy

2026년 1월 8일

[논문리뷰] RelayLLM: Efficient Reasoning via Collaborative Decoding

본 논문은 복잡한 추론 작업에서 대규모 언어 모델(LLM) 의 높은 연산 비용과 지연 시간 문제를 해결하면서, 소규모 언어 모델(SLM) 의 제한된 추론 능력을 보완하는 효율적인 방법을 제안합니다.

#Review #LLM #SLM #Collaborative Decoding #Token-level Intervention #Reinforcement Learning #GRPO #Efficient Reasoning #Resource Efficiency

2026년 1월 8일

[논문리뷰] Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing

본 논문은 In-Context Image Generation and Editing (ICGE) 태스크에서 사용자의 의도를 정확하게 이해하고 충실하게 실행하는 데 필요한 정확한 이해 능력과 생성 능력 간의 격차 를 해소하는 것을 목표로 합니다.

#Review #In-Context Image Generation #Image Editing #Multimodal Models #Chain-of-Thought #Structured Reasoning #Reinforcement Learning #Alignment #Diffusion Models

2026년 1월 8일

[논문리뷰] RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes

본 논문은 저조도 야간 환경에서 자동 화이트 밸런스(AWB) 보정의 신뢰성 및 일반화 문제를 해결하는 것을 목표로 합니다.

#Review #Auto White Balance (AWB)#Deep Reinforcement Learning (DRL)#Low-Light Imaging #Night-time Scenes #Color Constancy #Cross-Sensor Generalization #Statistical Methods #Curriculum Learning

2026년 1월 8일

[논문리뷰] Plenoptic Video Generation

본 논문은 기존 카메라 제어형 비디오 재렌더링 방법들이 다중 뷰 시나리오에서 일관된 시공간적 일관성을 유지하지 못하는 문제를 해결하는 것을 목표로 합니다.

#Review #Generative Video #Camera Control #Plenoptic Function #Autoregressive Model #Diffusion Transformer #3D FOV Retrieval #Spatio-Temporal Consistency

2026년 1월 8일

[논문리뷰] Memorization in 3D Shape Generation: An Empirical Study

3D 생성 모델이 훈련 데이터를 기억하는 현상이 데이터 유출 및 생성 결과의 다양성 저하를 초래할 수 있으나, 이에 대한 체계적인 연구가 부족했습니다.

#Review #3D Shape Generation #Memorization #Generative Models #Diffusion Models #Evaluation Framework #Generalization #Data Augmentation

2026년 1월 8일

[논문리뷰] Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers

대규모 언어 모델(LLM) 학습 시 Weight Decay(WD) 가 가중치 행렬의 스케일을 '노이즈-WD 평형' 상태에 고정시켜 데이터에 최적화된 스케일 학습을 방해하는 문제를 해결하는 것이 목표입니다.

#Review #Large Language Models #Weight Decay #Learnable Multipliers #Scale Adaptation #Optimization #µP Parametrization #Adam #Muon

2026년 1월 8일

[논문리뷰] GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

본 논문은 다중 보상(multi-reward) 설정에서 기존 Group Relative Policy Optimization (GRPO) 이 겪는 보상 신호 붕괴(reward signal collapse) 문제를 해결하는 것을 목표로 합니다.

#Review #Multi-reward RL #Policy Optimization #Reward Normalization #GRPO #GDPO #LLMs #Training Stability

2026년 1월 8일

[논문리뷰] Few Tokens Matter: Entropy Guided Attacks on Vision-Language Models

본 논문은 Vision-Language Model (VLM)의 autoregressive 생성 과정에서 모든 토큰이 모델 불안정성에 동일하게 기여한다는 기존 가정에 도전합니다.

#Review #Vision-Language Models #Adversarial Attacks #Entropy-Guided Attacks #Token Vulnerability #Harmful Content #Cross-Model Transferability #Autoregressive Generation

2026년 1월 8일

[논문리뷰] Enhancing Object Detection with Privileged Information: A Model-Agnostic Teacher-Student Approach

본 논문은 객체 탐지 성능을 향상시키기 위해 훈련 시에만 접근 가능한 특권 정보(Privileged Information, PI) 를 활용하는 LUPI(Learning Under Privileged Information) 패러다임을 통합하는 것을 목표로 합니다.

#Review #Object Detection #Privileged Information #Teacher-Student Learning #Knowledge Distillation #Model-Agnostic #Bounding Box Masks #UAV-based Detection

2026년 1월 8일

[논문리뷰] DocDancer: Towards Agentic Document-Grounded Information Seeking

본 연구는 기존 DocQA(Document Question Answering) 에이전트들의 비효율적인 도구 활용 및 폐쇄형 모델 의존성 문제를 해결하고자 합니다.

#Review #Agentic AI #Document Question Answering #Tool-use #Information Seeking #Synthetic Data Generation #Long-context Understanding #Multimodal Documents

2026년 1월 8일

[논문리뷰] DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs

논문은 대규모 언어 모델(LLMs)의 Chain-of-Thought (CoT) 추론에서 발생하는 노출 편향(exposure bias) 과 오류 누적 문제를 해결하는 것을 목표로 합니다.

#Review #Chain-of-Thought #Diffusion Models #Large Language Models #Reasoning #Error Correction #Preference Optimization #Denoising

2026년 1월 8일

[논문리뷰] AgentDevel: Reframing Self-Evolving LLM Agents as Release Engineering

본 논문은 LLM 에이전트의 자기 개선 방식이 종종 불안정하고 감사하기 어렵다는 문제점을 지적합니다.

#Review #LLM Agents #Release Engineering #Self-Improvement #Regression Testing #Continuous Integration #Flip-Centered Gating #Auditable Development #Software Engineering

2026년 1월 8일