Review

[논문리뷰] Code-Space Response Oracles: Generating Interpretable Multi-Agent Policies with Large Language Models

arXiv에 게시된 'Code-Space Response Oracles: Generating Interpretable Multi-Agent Policies with Large Language Models' 논문에 대한 자세한 리뷰입니다.

#Review #Multi-Agent Reinforcement Learning #Policy-Space Response Oracles #Large Language Models #Program Synthesis #Interpretable AI #Game Theory #Code Generation

2026년 3월 11일

[논문리뷰] Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams

arXiv에 게시된 'Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams' 논문에 대한 자세한 리뷰입니다.

#Review #Online Adaptation #Continual Learning #Knowledge Streams #Large Language Models #Benchmarking #State Tracking #Retrieval Augmented Generation #Agentic Memory

2026년 3월 11일

[논문리뷰] CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR

Jiajun Song이 arXiv에 게시한 'CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR' 논문에 대한 자세한 리뷰입니다.

#Review #Reinforcement Learning #Verifiable Rewards (RLVR)#Contrastive Learning (CL)#Policy Optimization #Large Language Models (LLMs)#Generalization #Robustness #Reasoning Tasks

2026년 3월 11일

[논문리뷰] Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

arXiv에 게시된 'Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning' 논문에 대한 자세한 리뷰입니다.

#Review #Reinforcement Learning #Large Language Models #Natural Language Feedback #Exploration #Group-Level Feedback #Self-Refinement #Sample Efficiency

2026년 3월 11일

[논문리뷰] Any to Full: Prompting Depth Anything for Depth Completion in One Stage

Taichi Liu이 arXiv에 게시한 'Any to Full: Prompting Depth Anything for Depth Completion in One Stage' 논문에 대한 자세한 리뷰입니다.

#Review #Depth Completion #Monocular Depth Estimation (MDE)#Prompt Learning #Domain Generalization #Pattern Agnostic #One-stage Learning #Robotic Perception #Scale Consistency

2026년 3월 11일

[논문리뷰] VLM-SubtleBench: How Far Are VLMs from Human-Level Subtle Comparative Reasoning?

arXiv에 게시된 'VLM-SubtleBench: How Far Are VLMs from Human-Level Subtle Comparative Reasoning?' 논문에 대한 자세한 리뷰입니다.

#Review #Vision-Language Models #Comparative Reasoning #Subtle Differences #Benchmark #Multi-modal AI #Image Comparison #VQA #Fine-grained Analysis

2026년 3월 10일

[논문리뷰] Towards a Neural Debugger for Python

arXiv에 게시된 'Towards a Neural Debugger for Python' 논문에 대한 자세한 리뷰입니다.

#Review #Neural Debuggers #Python Execution Traces #Large Language Models (LLMs)#Markov Decision Process (MDP)#Program Understanding #Code Generation #Inverse Execution #CruxEval

2026년 3월 10일

[논문리뷰] Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

arXiv에 게시된 'Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs' 논문에 대한 자세한 리뷰입니다.

#Review #LLMs #Reasoning #Parametric Knowledge #Factual Recall #Hallucination #Computational Buffer #Factual Priming #Chain-of-Thought

2026년 3월 10일

[논문리뷰] The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness

Divya Chaudhary이 arXiv에 게시한 'The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness' 논문에 대한 자세한 리뷰입니다.

#Review #Logical Reasoning #Situational Awareness #LLMs #Deceptive Alignment #AI Safety #RAISE Framework #Self-Modeling #Deduction #Induction #Abduction

2026년 3월 10일

[논문리뷰] Streaming Autoregressive Video Generation via Diagonal Distillation

arXiv에 게시된 'Streaming Autoregressive Video Generation via Diagonal Distillation' 논문에 대한 자세한 리뷰입니다.

#Review #Video Generation #Autoregressive Models #Diffusion Models #Distillation #Real-time #Streaming #Temporal Coherence #Flow Matching

2026년 3월 10일

[논문리뷰] Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports

Yuqing Shao이 arXiv에 게시한 'Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports' 논문에 대한 자세한 리뷰입니다.

#Review #Spatial Intelligence #Vision-Language Models #Sports Analytics #3D Reconstruction #Dataset #Benchmark #Racket Sports #Human-Centric AI

2026년 3월 10일

[논문리뷰] SAHOO: Safeguarded Alignment for High-Order Optimization Objectives in Recursive Self-Improvement

Divya Chaudhary이 arXiv에 게시한 'SAHOO: Safeguarded Alignment for High-Order Optimization Objectives in Recursive Self-Improvement' 논문에 대한 자세한 리뷰입니다.

#Review #Recursive Self-Improvement #Alignment Drift #AI Safety #Goal Drift Index (GDI)#Constraint Preservation #Regression Risk #Capability Alignment Ratio (CAR)

2026년 3월 10일

[논문리뷰] Reward Prediction with Factorized World States

Hongbo Zhao이 arXiv에 게시한 'Reward Prediction with Factorized World States' 논문에 대한 자세한 리뷰입니다.

#Review #Reward Prediction #World Models #State Representation #Large Language Models #Zero-shot Learning #Reinforcement Learning #Planning #Factorization

2026년 3월 10일

[논문리뷰] Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs

arXiv에 게시된 'Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs' 논문에 대한 자세한 리뷰입니다.

#Review #Multimodal LLMs #Modality Gap #Visual Text Understanding #Error Analysis #Self-Distillation #Text-to-Image Conversion #Reasoning Collapse

2026년 3월 10일

[논문리뷰] Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion

arXiv에 게시된 'Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion' 논문에 대한 자세한 리뷰입니다.

#Review #Multimodal AI #Discrete Diffusion Models #Masked Language Modeling #Unified Generative Models #Any-to-Any #Speech-to-Image #Visual Question Answering

2026년 3월 10일

[논문리뷰] MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants

Yuante Li이 arXiv에 게시한 'MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants' 논문에 대한 자세한 리뷰입니다.

#Review #Large Language Models (LLMs)#Code Generation #HTML #Interactive Applications #Benchmark #MINIAPPBENCH #Agentic Evaluation #MINIAPPEVAL #Real-World Principles #Human-AI Interaction

2026년 3월 10일

[논문리뷰] MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

arXiv에 게시된 'MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data' 논문에 대한 자세한 리뷰입니다.

#Review #Vision-Language Models #Self-Evolution #Reinforcement Learning #Zero-Data #Multi-Agent Systems #Code Generation #Synthetic Data

2026년 3월 10일

[논문리뷰] InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

ganlinyang이 arXiv에 게시한 'InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing' 논문에 대한 자세한 리뷰입니다.

#Review #Unified Multimodal Models #Multimodal Large Language Model #Image Generation #Image Editing #Chain-of-Thought #Data Synthesis #Low-parameter Models

2026년 3월 10일

[논문리뷰] Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing

arXiv에 게시된 'Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing' 논문에 대한 자세한 리뷰입니다.

#Review #3D Scene Editing #Reinforcement Learning #Multi-view Consistency #Diffusion Models #Reward Modeling #3D Gaussian Splatting #FLUX-Kontext #VGGT

2026년 3월 10일

[논문리뷰] Fish Audio S2 Technical Report

arXiv에 게시된 'Fish Audio S2 Technical Report' 논문에 대한 자세한 리뷰입니다.

#Review #Text-to-Speech (TTS)#Multi-speaker #Multi-turn #Instruction Following #Dual-Autoregressive #Reinforcement Learning (RL)#Data Pipeline #SGLang

2026년 3월 10일