Review

[논문리뷰] NVIDIA OmniDreams: Real-Time Generative World Model for Closed-Loop Autonomous Vehicle Simulation

본 논문은 기존의 Reconstruction-based 자율주행 시뮬레이터가 가진 제약 사항인 데이터 의존성과 새로운 장면(Novel scene)에 대한 일반화 부족 문제를 해결하기 위해 OmniDreams를 제안한다. 기존 방식은 캡처된 데이터 환경 내부에서만 가상 시나리오를 구성할 수 있어 확장성이 매우 제한적이다.

#Review #Generative World Model #Autonomous Vehicle Simulation #Closed-Loop #Autoregressive Diffusion #World-Action Model #Vision-Language-Action

2026년 6월 2일

[논문리뷰] Mitigating Perceptual Judgment Bias in Multimodal LLM-as-a-Judge via Perceptual Perturbation and Reward Modeling

본 연구는 MLLM이 평가자(Judge)로 활용될 때 발생하는 Perceptual Judgment Bias를 해결하여 평가의 신뢰성을 제고하고자 합니다. 기존 MLLM 평가자들은 시각적으로 잘못된 응답임에도 불구하고 논리적으로 그럴듯한 텍스트가 포함되어 있으면 높은 점수를 부여하는 경향이 있습니다 .

#Review #Multimodal LLM-as-a-Judge #Perceptual Judgment Bias #Reward Modeling #Perceptual Perturbation #GRPO #Visual Grounding

2026년 6월 2일

[논문리뷰] MIRA: Mid-training Rubric Anchoring for Source-Aware Data Selection

본 논문은 이질적인(Heterogeneous) Mid-training 데이터 혼합물에서 효과적인 데이터 선택이 어렵다는 문제를 해결하고자 합니다.

#Review #Mid-training #Data Selection #Rubric Discovery #LLM #Distillation #Source-Aware #Scalability

2026년 6월 2일

[논문리뷰] MERIT: Learning Disentangled Music Representations for Audio Similarity

본 논문은 기존 음악 유사도 모델이 여러 음악적 요소를 하나의 Monolithic 점수로 융합하여 표현함에 따라 발생하는 해석 가능성 및 세밀한 쿼리 제어의 한계를 해결하고자 합니다 .

#Review #Music Representation Learning #Disentanglement #Audio Similarity #Representation Learning #Contrastive Learning #Self-Supervised Learning

2026년 6월 2일

[논문리뷰] Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories

본 논문은 현대의 LLM이 배포 이후 새로운 정보를 지속적으로 학습하지 못하는 '정적(Static)'인 한계와, 업데이트 시 발생하는 Catastrophic Forgetting (CF) 문제를 해결하고자 합니다.

#Review #Continual Learning #Language Models #Memory Consolidation #Knowledge Seeding #Self-Improvement #Dreaming #Catastrophic Forgetting

2026년 6월 2일

[논문리뷰] KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks

본 논문은 test-time scaling 환경에서 발생하는 KV-Cache 양자화의 오류 누적 문제를 해결하는 데 집중합니다. 기존의 양자화 방식은 주로 고정된 긴 컨텍스트를 다루는 prefill 설정에서 평가되었으나, 실제 디코딩 과정에서는 토큰 생성마다 오류가 반복적으로 누적되어 추론 품질이 급격히 저하됩니다 .

#Review #KV-Cache Quantization #Variance Normalization #Error Accumulation #Reasoning Tasks #Hadamard Rotation #Dual-Scaling

2026년 6월 2일

[논문리뷰] Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking

본 논문은 기존 휴머노이드 모션 트래킹 연구가 겪고 있는 데이터 및 모델 규모의 한계와 그로 인한 일반화 성능 저하 문제를 해결하고자 합니다. 기존의 연구들은 주로 소규모 MLP 기반 정책에 의존해왔으며, 이는 정교한 모션 추적과 범용적인 일반화 사이의 고질적인 트레이드오프(trade-off)를 유발했습니다 .

#Review #Humanoid Motion Tracking #Transformer #Zero-Shot Generalization #Large-scale Motion Data #Harmonic Motion Embedding #DAgger Distillation

2026년 6월 2일

[논문리뷰] From Activation to Causality: Discovery of Causal Visual Representations in the Human Brain

본 논문은 뇌의 시각적 개념 표상을 결정하는 데 있어 기존의 Activation-based 방법론이 갖는 근본적인 한계를 해결하고자 합니다.

#Review #fMRI #Causal Representation Discovery #Visual Concept Localization #Generative Models #Counterfactual Stimuli #BrainCause

2026년 6월 2일

[논문리뷰] Domain-Specific Data Synthesis for LLMs via Minimal Sufficient Representation Learning

본 논문은 LLM의 도메인 특화 적응(Domain-Specific Adaptation) 과정에서 발생하는 데이터 확보 문제를 해결하고자 한다.

#Review #Domain-Specific Data Synthesis #LLMs #Minimal Sufficient Representation Learning #Prompt Tuning #Contrastive Disentanglement #Domain Adaptation

2026년 6월 2일

[논문리뷰] Diagnosing Harmful Continuation in Answer-Correct Long-CoT Training Traces

본 연구는 답변이 정확한 Long-CoT 데이터라도 그 내부의 추론 궤적에 따라 모델 학습의 유효성이 크게 달라질 수 있다는 점을 문제로 제기합니다. 기존 연구들은 데이터 선택이나 단순한 길이 절삭(truncation)에 의존하여 추론 단계의 품질을 근본적으로 규명하지 못했습니다.

#Review #Long-CoT #Supervised Fine-Tuning #Harmful Continuation #Uncertainty–Geometry Mismatch #Reasoning Trace #Boundary Proxy

2026년 6월 2일

[논문리뷰] Decoupled Residual Denoising Diffusion Models for Unified and Data Efficient Image-to-Image Translation

본 논문은 기존 coupled diffusion models가 unified I2I translation 과제에서 겪는 성능 한계를 해결하고자 합니다.

#Review #Diffusion Models #Image-to-Image Translation #Domain Harmonization #Data Efficiency #Residual Learning #Manifold Lifting

2026년 6월 2일

[논문리뷰] Decentralized Instruction Tuning: Conflict-Aware Splitting and Weight Merging

본 연구는 대규모 instruction tuning에서 발생하는 Gradient Interference와 시스템 통신 병목이라는 두 가지 핵심 문제를 동시에 해결하고자 한다.

#Review #Instruction Tuning #Model Merging #Decentralized Optimization #Gradient Interference #Vision-Language Models #PCA Decomposition

2026년 6월 2일

[논문리뷰] ClawHub Security Signals: When VirusTotal, Static Analysis, and SkillSpector Disagree

본 논문은 LLM 에이전트 생태계에서 핵심 소프트웨어 단위인 Agent Skills의 보안 문제를 다루며, 서로 다른 보안 스캐너(VirusTotal, Static Analysis, SkillSpector)들이 동일한 스킬에 대해 불일치하는 결과를 보일 때 이를 어떻게 해석하고 대응할 것인지에 대한 문제를 제기한다.

#Review #Agent Skills #LLM Agents #Software Supply Chain #Security Scanning #Scanner Disagreement #Trust Artifacts #OpenClaw

2026년 6월 2일

[논문리뷰] Bootstrap Your Generator: Unpaired Visual Editing with Flow Matching

본 논문은 대규모 paired dataset 없이도 instruction-based visual editing이 가능한 범용 프레임워크인 ByG (Bootstrap Your Generator)를 제안합니다 .

#Review #Flow Matching #Unpaired Editing #Cycle Consistency #Straight-Through Estimation #Gradient Routing #Bootstrap #Visual Editing

2026년 6월 2일

[논문리뷰] Benchmarking Visual State Tracking in Multimodal Video Understanding

본 논문은 최신 Multimodal Large Language Models (MLLMs)가 비디오의 지속적인 역동성을 이해하고 상태를 추적하는 능력, 즉 Visual State Tracking 능력이 결여되어 있다는 점을 지적한다 .

#Review #Multimodal Large Language Models #Video Understanding #Visual State Tracking #Benchmark #Visual Perception #Agentic Frameworks

2026년 6월 2일

[논문리뷰] BA-T: An Iterative Transformer for Two-View Bundle Adjustment

본 연구는 기존의 feed-forward 3D 재구성 모델들이 의존하는 heavy decoder stack의 비효율성과 기하학적 self-correction 메커니즘의 부재를 해결하고자 합니다.

#Review #Bundle Adjustment #Iterative Transformer #Implicit Latent Space #Two-View Reconstruction #Pose Estimation #Geometric Consistency

2026년 6월 2일

[논문리뷰] AutoMedBench: Towards Medical AutoResearch with Agentic AI Models

본 논문은 기존 의료 AI 벤치마크가 End-to-End 연구 과정의 복잡성을 간과하고 최종 결과물 평가에만 치중하여, 에이전트의 행동 특성이나 실패 원인을 파악하기 어렵다는 문제점을 해결하고자 합니다 .

#Review #Medical-AI #Autonomous Agents #Benchmark #Research Automation #Workflow-Aware Evaluation #LLM

2026년 6월 2일

[논문리뷰] Adaptive Auto-Harness: Sustained Self-Improvement for Agentic System Deployment on Open-Ended Task Streams

본 논문은 기존의 Auto-Harness 시스템들이 고정된 오프라인 벤치마크에서는 우수한 성능을 보이지만, 실제 Open-Ended Task Streams 환경에서는 성능 저하를 겪는다는 문제를 해결합니다 .

#Review #Agentic System #Auto-Harness #Open-Ended Task Streams #Multi-Agent Evolution #Solve-Time Adaptation #Non-Stationarity #Human-in-the-Loop

2026년 6월 2일

[논문리뷰] AURA: Action-Gated Memory for Robot Policies at Constant VRAM

로봇 에이전트가 끊김 없이 지속적으로 동작하는 환경에서 기존의 Transformer KV-cache 방식은 에피소드 길이에 따라 메모리 요구량이 선형적으로 증가하여 에지 하드웨어의 메모리 대역폭을 심각하게 제한합니다.

#Review #Robot Policies #VRAM #Action-Utility Gate #Fast-Weight Memory #Inference Efficiency #POMDP

2026년 6월 2일

[논문리뷰] A Multi-AI-agent Framework Enabling End-to-end Finite Element Analysis for Solid Mechanics Problems

FEA는 현대 공학의 필수 요소이나, 입문자에게 요구되는 높은 학습 곡선과 복잡한 시뮬레이션 설정 오류로 인해 진입 장벽이 매우 높습니다. 기존의 API 기반 자동화 방식은 고정된 스크립트와 템플릿에 의존하여 설계 변경 시 유연성이 부족하다는 한계를 가집니다.

#Review #AI agent #Finite Element Analysis (FEA)#Large Language Models (LLM)#Multi-agent framework #Retrieval-Augmented Generation (RAG)#Solid mechanics

2026년 6월 2일