Review

[논문리뷰] BenchPreS: A Benchmark for Context-Aware Personalized Preference Selectivity of Persistent-Memory LLMs

Large Language Models (LLMs)는 User Preferences를 Persistent Memory에 저장하여 여러 Interaction에서 Personalization을 지원하고 있습니다.

#Review #Large Language Models #Personalization #Persistent Memory #Context-Awareness #Preference Selectivity #Benchmark #Misapplication Rate #Appropriate Application Rate

2026년 3월 18일

[논문리뷰] Alignment Makes Language Models Normative, Not Descriptive

최근 Large Language Models (LLMs)는 homo silicus로서 인간 행동을 시뮬레이션하고 예측하는 데 광범위하게 활용되고 있습니다. 그러나 이러한 연구의 대부분은 aligned models 를 사용하며, alignment가 행동 예측에 중립적이거나 유익하다고 암묵적으로 가정합니다.

2026년 3월 18일

[논문리뷰] AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents

최근 LLM 기반 에이전트들은 장기적인 상호작용, 개인화된 지원 및 다단계 추론을 지원하기 위해 외부 메모리에 점점 더 의존하고 있습니다. 그러나 기존 메모리 시스템은 세 가지 핵심 문제에 직면합니다.

#Review #LLM Agents #Dialogue Memory #User-Centric #Adaptive Retrieval #Graph Memory #Multi-Agent Systems #Long-Horizon Reasoning

2026년 3월 18일

[논문리뷰] ACE-LoRA: Graph-Attentive Context Enhancement for Parameter-Efficient Adaptation of Medical Vision-Language Models

CLIP과 같은 VLMs 는 Natural image 분야에서 탁월한 성능을 보였고, 이는 Medical domain에도 큰 영감을 주었다. 그러나 기존 Medical VLMs 는 Specialist model과 Generalist model이라는 두 가지 극단적 형태로 나뉘어져 있었다.

2026년 3월 18일

[논문리뷰] WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation

최근 Video Diffusion Transformers (DiTs)의 발전으로 Interactive Gaming World Models은 생성된 환경을 Long-Horizon으로 탐색하는 능력을 보여주었지만, 여전히 정밀한 Action Control 및 3D Consistency 확보에 어려움을 겪고 있습니다.

#Review #Interactive Gaming World Models #Camera Pose #Geometric Representation #Diffusion Transformers #Action Control #3D Consistency #Lie Algebra

2026년 3월 17일

[논문리뷰] WiT: Waypoint Diffusion Transformers via Trajectory Conflict Navigation

최근 Flow Matching 모델은 Latent Autoencoder의 재구성 병목 현상을 피하기 위해 픽셀 공간에서 직접 작동합니다. 그러나 픽셀 매니폴드(manifold)의 의미론적 연속성이 부족하여 최적 운송 경로가 심하게 얽히게 됩니다.

#Review #Image Generation #Flow Matching #Trajectory Conflict #Diffusion Transformers #Waypoint Diffusion Transformers #Just-Pixel AdaLN

2026년 3월 17일

[논문리뷰] Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding

최근 MLRMs는 Visual Question Answering 등 다양한 태스크에서 뛰어난 성능을 보였지만, 여전히 심각한 Hallucinations 문제에 직면해 있습니다.

#Review #Multimodal Large Reasoning Models (MLRMs)#Hallucinations #Entropy-Aware Decoding #Latent Reasoning #Visual Grounding #Decoding Strategies #Uncertainty Mitigation

2026년 3월 17일

[논문리뷰] TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas

기존 Text-to-SQL 파싱 방법론들은 Full Schema Assumption 하에서 Large Language Models (LLMs) 의 발전과 함께 remarkable progress를 이루었습니다.

#Review #Text-to-SQL #Unknown Schema #Multi-Turn Reinforcement Learning #Tool Integration #POMDP #Dual-Track GRPO #Schema Grounding #Hallucination

2026년 3월 17일

[논문리뷰] SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

기존의 OLM Benchmark들은 주로 static, accuracy-centric 한 이해(understanding) Task에 초점을 맞추고 있어, 자연스러운 대화에서 Dynamic한 Cues를 탐색하는 Social Interactivity의 핵심 역량을 충분히 평가하지 못하고 있습니다.

#Review #Omni-modal LLMs #Social Interactivity #Benchmark #Speaker Identification #Turn-taking #Interruption Generation #Audio-Visual Integration

2026년 3월 17일

[논문리뷰] SegviGen: Repurposing 3D Generative Model for Part Segmentation

기존 3D Part Segmentation 방법론들은 몇 가지 본질적인 한계에 직면해 있습니다.

#Review #3D Part Segmentation #Generative Models #Diffusion Models #Latent Space #Limited Supervision #Multi-Task Learning

2026년 3월 17일

[논문리뷰] SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering?

LLM 기반 에이전트들은 자동화된 코드 생성부터 CI/CD 파이프라인 구성 및 인프라 관리 등 광범위한 software engineering (SWE) 태스크에 점차 활용되고 있습니다.

2026년 3월 17일

[논문리뷰] Rethinking UMM Visual Generation: Masked Modeling for Efficient Image-Only Pre-training

기존 UMM visual generation component의 pre-training은 비효율적인 paradigm과 희소하며 높은 품질의 text-image paired data에 대한 의존성이라는 중대한 제약에 직면해 있습니다.

2026년 3월 17일

[논문리뷰] Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context

Large Language Models (LLMs)는 extended context windows에도 불구하고 long context 내 정보를 신뢰성 있게 추출하고, 추론하며, 활용하는 데 어려움을 겪고 있다.

2026년 3월 17일

[논문리뷰] Qianfan-OCR: A Unified End-to-End Model for Document Intelligence

기존의 OCR 시스템은 비용, 정확도, 기능성 측면에서 Trade-off를 겪고 있습니다.

2026년 3월 17일

[논문리뷰] Online Experiential Learning for Language Models

현재 대규모 언어 모델(LLM) 개선 패러다임은 주로 offline training에 의존하며, 인간 annotation 또는 simulated environment를 활용합니다. 그러나 이러한 방식은 실제 배포 환경에서 축적되는 풍부한 경험을 전혀 활용하지 못한다는 근본적인 한계가 있습니다.

#Review #Online Experiential Learning (OEL)#Context Distillation #Language Models #Reward-Free Learning #Catastrophic Forgetting #Token Efficiency #On-Policy Learning

2026년 3월 17일

[논문리뷰] One-Eval: An Agentic System for Automated and Traceable LLM Evaluation

Large Language Models (LLMs)의 산업 시스템 도입이 가속화됨에 따라 reliable evaluation 은 model lifecycle 전반에 걸쳐 critical한 구성 요소가 되었습니다. 그러나 현재의 evaluation practices는 상당한 manual effort 를 요구합니다.

2026년 3월 17일

[논문리뷰] Mixture of Style Experts for Diverse Image Stylization

기존의 Diffusion-based Stylization 방법론들은 이미지의 Color Transfer 에 주로 집중하여 Complex Semantics 및 Material Details 를 효과적으로 처리하지 못하는 한계가 있었습니다.

#Review #Image Stylization #Mixture of Experts (MoE)#Diffusion Models #Semantic-aware Stylization #Style Transfer #LoRA

2026년 3월 17일

[논문리뷰] MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

최근 Large Language Models (LLMs)는 유창한 텍스트 생성 및 광범위한 질문 답변 능력에서 상당한 발전을 이루었지만, scientific analysis, financial reasoning, open-ended research와 같은 많은 real-world 문제들은 단순한 conversational ability를 넘어선다.

#Review #Research Agents #Long-Horizon Reasoning #Verification #Agentic LLM #Multi-Step Problem Solving #Reinforcement Learning

2026년 3월 17일

[논문리뷰] M^3: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM

uncalibrated monocular video로부터 streaming reconstruction을 수행하는 것은 고정밀 pose estimation과 동적 환경에서의 계산적으로 효율적인 online refinement가 모두 필요하므로 여전히 challenging합니다.

#Review #SLAM #Streaming Reconstruction #3D Gaussian Splatting #Monocular Video #Foundation Models #Dense Matching

2026년 3월 17일

[논문리뷰] Kinema4D: Kinematic 4D World Modeling for Spatiotemporal Embodied Simulation

Embodied AI 분야에서 로봇 trajectory를 세계 환경 내에서 roll out하는 능력은 demonstration 확장, policy evaluation 및 reinforcement learning에 매우 중요합니다.

#Review #Embodied AI #4D Generative World Model #Spatiotemporal-aware #Kinematic Control #Robotic Simulation #Diffusion Transformer #Pointmap

2026년 3월 17일