Review

[논문리뷰] ESC-Skills: Discovering and Self-Evolving Skills for Emotional Support Conversations

본 논문은 기존 ESC 시스템들이 주로 end-to-end 방식에 의존하여 해석 가능성이 낮고 체계적인 기술 개선이 어렵다는 문제를 해결하고자 합니다.

#Review #Emotional Support Conversations #Skill-centric Framework #Intervention Units #Self-Evolutionary #Large Language Models #Simulation-based Verification

2026년 5월 27일

[논문리뷰] DenoiseRL: Bootstrapping Reasoning Models to Recover from Noisy Prefixes

본 논문은 LLM의 추론 성능 향상을 위해 외부의 강력한 teacher 모델이나 복잡하게 큐레이션된 학습 데이터에 의존해야 하는 기존 RL 패러다임의 한계를 해결하고자 합니다. 기존 방식들은 학습 데이터의 품질이나 교사의 지식 수준에 따라 성능이 제약되는 structural limitation을 가지고 있습니다.

#Review #Reinforcement Learning #Reasoning Models #Denoising Reasoning #Weak-to-Strong Generalization #Self-correction #Large Language Models

2026년 5월 27일

[논문리뷰] CubePart: An Open-Vocabulary Part-Controllable 3D Generator

기존의 3D 생성 모델은 모놀리식 메쉬(monolithic mesh)를 생성하거나, 사용자가 제어할 수 없는 임의의 파트 단위로만 분해하여 게임 엔진이나 물리 시뮬레이션 환경에 필요한 특정 구조와 정렬하기 어렵습니다.

#Review #3D Generation #Part-Controllable #Open-Vocabulary #Diffusion Transformer #Schema-driven #Game Asset

2026년 5월 27일

[논문리뷰] Clark Hash: Stateless Sparse Johnson-Lindenstrauss Quantization for Neural Embeddings

본 논문은 대규모 신경망 임베딩(neural embeddings)을 저장할 때 발생하는 과도한 메모리 및 스토리지 비용 문제를 해결하기 위해 Clark Hash를 제안합니다.

#Review #Neural Embeddings #Johnson-Lindenstrauss #Quantization #Sparse Projection #Stateless Codec #Dimensionality Reduction

2026년 5월 27일

[논문리뷰] Chartographer: Counterfactual Chart Generation for Evaluating Vision-Language Models

본 논문은 기존의 Chart QA 벤치마크가 VLM의 진정한 시각적 추론 능력을 정확히 측정하지 못하고, 단순한 시각적 패턴 매칭이나 사전 학습된 파라메트릭 지식에 의한 '지름길(Shortcut)'을 활용하고 있다는 문제를 제기합니다.

#Review #Vision-Language Models #Chart QA #Counterfactual Generation #Visual Reasoning #Shortcut Learning #Generalization

2026년 5월 27일

[논문리뷰] AutoScientists: Self-Organizing Agent Teams for Long-Running Scientific Experimentation

본 논문은 과학적 탐구의 장기적인 연구 과정에서 발생하는 비효율적인 실험 반복과 고립된 탐색 문제를 해결하기 위해 AutoScientists를 제안합니다 .

#Review #Multi-agent Systems #Scientific Experimentation #Self-Organization #Autonomous Discovery #LLM Agents #BioML-Bench

2026년 5월 27일

[논문리뷰] AgentFugue: Agent Scaling for Long-Horizon Tasks through Collective Reasoning

본 논문은 대규모 언어 모델(LLM) 기반 에이전트의 Long-Horizon Tasks 수행 능력 향상에 Scaling Out 전략이 기여할 수 있는지에 대한 연구를 수행한다.

#Review #Agent Scaling #Collective Reasoning #Long-Horizon Tasks #Shared Reasoning Hub #Multi-Agent Systems #Homogeneous Teams #Heterogeneous Teams #Reinforcement Learning

2026년 5월 27일

[논문리뷰] Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

본 논문은 vision-language models(VLMs)의 agentic reasoning 과정에서 발생하는 '도구 사용의 비효율성' 문제를 해결하고자 합니다.

#Review #Multimodal Agentic Reasoning #Reinforcement Learning #GRPO #AXPO #Tool-call Resampling #Thinking-Acting Gap #Vision-Language Models

2026년 5월 27일

[논문리뷰] AgensFlow: A Coordination-Policy Substrate for Multi-Agent Systems

본 논문은 LLM 기반의 다중 에이전트 시스템에서 발생하는 조율 불투명성과 고정된 파이프라인의 경직성 문제를 해결하고자 합니다.

#Review #Multi-Agent Systems #Online Policy Learning #Coordination Substrate #Large Language Models #Task Signatures #Relative Trajectory Evaluation

2026년 5월 27일

[논문리뷰] Advancing Creative Physical Intelligence in Large Multimodal Models

본 연구는 대규모 다중모달 모델(LMM)이 인식 및 추론 능력은 크게 발전했음에도 불구하고, 비일상적인 상황에서 사물을 창의적으로 재사용하는 물리적 지능이 여전히 부족하다는 문제의식에서 출발합니다.

#Review #Multimodal AI #Creative Tool Repurposing #Physical Affordance #Visual Grounding #Direct Preference Optimization (DPO)#Interactive Benchmark

2026년 5월 27일

[논문리뷰] AI Research Agents Narrow Scientific Exploration

본 연구는 AI 연구 에이전트가 과학적 발견의 범위를 실질적으로 확장하는지, 아니면 기존 연구의 주변부에 머무르는지를 규명하는 것을 목적으로 합니다.

#Review #AI Research Agents #Scientific Discovery #Ideation #Citation Analysis #Research Breadth #Bibliographic Coupling

2026년 5월 27일

[논문리뷰] The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

본 논문은 large language model (LLM)이 long-horizon agentic workflow로 전환됨에 따라 발생하는 efficiency 및 cost bottleneck 문제와 intrinsically complex, high-stakes task 해결의 어려움을 다룹니다.

#Review #Mixture-of-Experts (MoE)#Mini Activations #Agentic AI #Self-Evolution #Reinforcement Learning (RL)#Multi-Token Prediction (MTP)

2026년 5월 26일

[논문리뷰] SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

본 논문은 현재 Spatial Foundation Models (SFMs)이 standard dataset에서 인상적인 성능을 보여주지만, 다양한 downstream task, 임의의 viewpoint, 변화하는 scene domain, 다양한 input density, 그리고 특정 hardware constraint에 걸쳐 robust하게 generalizing할 수 있는 all-round player인지에 대한 근본적인…

#Review #Spatial Foundation Models #3D Reconstruction #Benchmark #Domain Generalization #Input Density #Embodied AI

2026년 5월 26일

[논문리뷰] Soap2Soap: Long Cinematic Video Remaking via Multi-Agent Collaboration

본 논문은 Long-horizon Video-to-Video Generation의 핵심 과제인 Long Cinematic Video Remaking 문제를 해결하고자 합니다.

#Review #Long-Video Remaking #Multi-Agent System #Dual-Bridge Consistency #Character Identity #Narrative Fidelity #Video-to-Video Generation

2026년 5월 26일

[논문리뷰] Share More, Search Less: Collaborative Parallel Thinking for Efficient Test-Time Scaling

기존 병렬 Test-Time Scaling (TTS) 방법론은 Information Isolation Bottleneck이라는 중요한 한계점을 가지고 있습니다.

#Review #Test-Time Scaling #Collaborative Parallel Thinking #Large Language Models #Information Sharing #Redundant Exploration #Accuracy-Latency Pareto Frontier #Mathematical Reasoning

2026년 5월 26일

[논문리뷰] MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research

모바일 GUI Agent 연구는 빠른 발전을 보였지만, 현재 평가 및 훈련 환경은 근본적인 Trade-off 문제에 직면해 있다.

#Review #Mobile GUI Agent #Simulation Environment #Reinforcement Learning #Verifiable Outcome Signals #Interaction Fidelity #MobileGym-Bench #Sim-to-Real Transfer

2026년 5월 26일

[논문리뷰] LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

본 논문은 기존 Audio-Visual Generation 벤치마크가 Minute-Scale Content의 평가 요구사항을 충족하지 못하는 문제를 해결하고자 한다.

#Review #Audio-Visual Generation #Long Video Generation #Evaluation #Benchmark #T2AV #I2AV #V2AV #MLLM-assisted assessment

2026년 5월 26일

[논문리뷰] LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

The End of the content of the urls browsed.

2026년 5월 26일

[논문리뷰] Geometry-Aware Representation Denoising for Robust Multi-view 3D Reconstruction

본 논문은 Degraded Input Condition 하에서 Multi-view 3D Reconstruction의 Robustness를 향상시키기 위해 Geometry-Aware Representation Denoising (GARD) 프레임워크를 제안한다.

#Review #Multi-view 3D Reconstruction #Image Restoration #Representation Denoising #Diffusion Models #Geometry-Aware Features #Feed-Forward Models #Camera Pose Estimation

2026년 5월 26일

[논문리뷰] EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation

본 연구는 generative video foundation models의 빠른 발전으로 professional-grade cinematic synthesis에 대한 수요가 증가함에 따라, Reinforcement Learning (RL) 및 agentic workflows로의 전환에 필요한 신뢰할 수 있는 평가의 bottleneck 문제를 해결하고자 한다.

#Review #Video Generation #Benchmarking #Cinematic Quality #VLM #Chain-of-Thought #Human-Machine Alignment #Evaluation Framework #Reinforcement Learning

2026년 5월 26일