최신 포스트

[논문리뷰] TheoremGraph: Bridging Formal and Informal Mathematics

현대 수학 연구는 거대하고 파편화되어 있어 수학적 결과들의 의존성 구조를 명확히 파악하기 어렵습니다. 논문 저자들은 informal한 문헌(arXiv 등)이 주로 문서 수준의 인용에 의존하는 반면, formal 라이브러리(Lean 등)는 매우 제한된 범위 내에서만 세밀한 의존성을 관리한다는 한계를 지적합니다.

#Review #Formal-Informal Mathematics #Dependency Graph #LeanGraph #Neural Theorem Proving #Cross-modal Retrieval #Autoformalization

2026년 6월 29일

[논문리뷰] The Surprising Effectiveness of Video Diffusion Models for Hand Motion Reconstruction

본 논문은 기존의 egocentric 4D 손 모션 재구성 방법론이 직면한 심각한 병목 현상을 해결하고자 합니다. 기존 방식들은 이미지 기반 탐지기(Detector)에 의존하거나, 제한된 데이터로 학습된 시간적 모듈을 사용하여 심한 은닉 상황에서 성능이 저하되는 한계가 있습니다 .

#Review #Video Diffusion Models #Hand Motion Reconstruction #Egocentric Video #4D Reconstruction #Embodied AI #Occlusion Reasoning

2026년 6월 29일

[논문리뷰] TUA-Bench: A Benchmark for General-Purpose Terminal-Use Agents

본 논문은 기존 컴퓨터 사용 벤치마크들이 GUI 환경이나 특정 도메인(주로 코딩)에 편향되어 있어, 일반적인 터미널 환경에서의 범용적인 에이전트 능력을 평가하는 데 한계가 있다는 문제 의식에서 출발합니다.

#Review #Terminal-Use Agents #General-Purpose Benchmark #Command-Line Interface #Execution-Grounded Evaluation #Scientific Workflows

2026년 6월 29일

[논문리뷰] TACO: Tool-Augmented Credit Optimization for Agentic Tool Use

본 논문은 에이전트의 불필요하거나 오도하는 도구 호출 문제를 해결하기 위해, 도구 호출 자체의 기여도를 정밀하게 평가하는 최적화 프레임워크를 제안한다.

#Review #Agentic Tool Use #Reinforcement Learning #Multimodal Models #Credit Assignment #Tool-Augmented Credit Optimization #GRPO #Differential Answer-Probe Reward

2026년 6월 29일

[논문리뷰] Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent

본 논문은 에이전트 모델의 성능을 향상시키기 위한 기존의 파라미터 스케일링 전략이 갖는 높은 비용과 재현성 문제를 해결하기 위해 에이전트 호라이즌(Horizon) 확장을 제안합니다 .

#Review #Agents-A1 #Long-Horizon #Knowledge-Action Graph #Mixture-of-Experts #On-Policy Distillation #Salient Vocabulary Alignment

2026년 6월 29일

[논문리뷰] SafePyramid: A Hierarchical Benchmark for In-context Policy Guardrailing

본 논문은 기존의 고정된 위험 분류 체계에 의존하는 Guardrail이 실제 애플리케이션의 가변적인 요구사항을 충족하지 못하는 문제를 해결하고자 합니다 .

#Review #In-context Policy Guardrailing #Safety Benchmark #Hierarchical Evaluation #LLM Safety #Rule Dependency #Policy Framework

2026년 6월 29일

[논문리뷰] ReasoningLens: Hierarchical Visualization and Diagnostic Auditing for Large Reasoning Models

본 연구는 LRMs가 생성하는 지나치게 긴 Chain-of-Thought 추론 과정이 야기하는 '투명성 부담(Transparency burden)' 문제를 해결하고자 합니다.

#Review #Large Reasoning Models #Chain-of-Thought #Diagnostic Auditing #Hierarchical Visualization #Agentic Diagnosis #Systemic Profiling

2026년 6월 29일

[논문리뷰] ReFreeKV: Towards Threshold-Free KV Cache Compression

본 논문은 기존의 KV cache pruning 연구들이 특정 데이터셋이나 도메인에 종속된 Budget Threshold 설정에 지나치게 의존하여, 실제 환경의 가변적인 입력 처리에 한계가 있다는 점을 지적한다.

#Review #KV Cache Compression #Threshold-Free #Large Language Models #Attention Sparsity #Inference Efficiency #Dynamic Budgeting

2026년 6월 29일

[논문리뷰] RaysUp: Ultra-light Universal Feature Upsampling via Geometry-Aware Ray Representation

본 논문은 현대 컴퓨터 비전의 핵심인 VFM이 가지는 고해상도 정보 부족 문제를 해결하기 위해 RaysUp을 제안한다 . 기존의 feature upsampling 방식들은 고정된 2D 인접 영역에 의존하거나 특정 모델에 종속되어 재학습이 필요한 등 범용성과 효율성 측면에서 한계가 있다.

#Review #Feature Upsampling #Vision Foundation Models #Ray Representation #Geometry-Aware #Cross-Attention #3D Geometric Priors

2026년 6월 29일

[논문리뷰] PoseShield: Neural Collision Fields for Human Self-Collision Resolution

본 논문은 SMPL 기반의 인간 자세 추정 및 모션 생성 모델에서 발생하는 고질적인 자기 충돌(self-collision) 문제를 해결하는 것을 목적으로 합니다.

#Review #SMPL #Self-Collision #Eikonal Equation #Neural Collision Field #Constrained Optimization #Motion Synthesis #Pose Space

2026년 6월 29일

[논문리뷰] PolicyGuard: A Dialogue-Grounded Sub-Agent Verifier for Policy Adherence in LLM Agents

본 논문은 기존의 Safeguarding 기술이 주로 악의적인 콘텐츠나 jailbreak 방지에만 치중하고 있어, 에이전트의 복잡한 절차적 정책 준수(Policy adherence) 문제를 해결하는 데 한계가 있다는 점을 지적합니다 .

#Review #LLM Agents #Policy Adherence #Dialogue-Grounded #Verifier #Tool-Calling #Safeguarding #Procedural Compliance

2026년 6월 29일

[논문리뷰] One Scene, Two Depths: Probing Geometric Ambiguity in Monocular Foundation Models

본 논문은 현대의 monocular depth foundation models가 투명한 장면(transparent scenes)과 같은 다층 기하학적 구조를 단일 scalar depth로만 표현해야 하는 근본적인 한계(single-layer constraint)를 해결하고자 한다 .

#Review #Monocular Depth Estimation #Geometric Ambiguity #Laplacian Visual Prompting #Foundation Models #Ordinal Benchmark #Layered Geometry

2026년 6월 29일

[논문리뷰] One Forward Beats Two: InnerZoom for Accurate and Efficient GUI Grounding

본 논문은 MLLM 기반의 GUI Grounding에서 나타나는 비효율성과 정확도 저하 문제를 해결하고자 합니다. 기존의 ZoomIn 계열 방식은 타겟 영역을 외부에서 크롭하여 두 번 추론(Two-pass)함으로써 정확도를 높였으나, 이는 Latency를 증가시키고 계산 비용을 높이는 원인이 됩니다.

#Review #GUI Grounding #MLLM #Cross-Layer Evidence #Coordinate Generation #InnerZoom #Efficient Inference #Region-to-Point Gap

2026년 6월 29일

[논문리뷰] OSWorld2.0: Benchmarking Computer Use Agents on Long-Horizon Real-World Tasks

본 논문은 기존의 컴퓨터 사용 벤치마크들이 지나치게 단기적이고 단순한 작업 위주로 구성되어 있어, 실제 실무 환경에서의 복잡한 Long-Horizon 업무를 평가하기에 한계가 있다는 점을 지적한다.

#Review #Computer-Use Agents #Long-Horizon Tasks #Benchmark #Multimodal Agents #Reasoning #Task-Level Planning #Autonomous Agents

2026년 6월 29일

[논문리뷰] Nemotron-Labs-Diffusion-Image: Advancing Masked Discrete Diffusion for High-Resolution Image Synthesis

본 논문은 기존 Masked Discrete Diffusion Model이 가진 자기 교정 능력의 부재와 대규모 코드북 학습의 어려움을 해결하기 위해 Nemotron-Labs-Diffusion-Image (NLD-Image)를 제안한다.

#Review #Masked Discrete Diffusion #Text-to-Image Synthesis #Token Editing #Grouped Cross-Entropy #Codebook Sparsity #Self-Correction #High-Resolution Generation

2026년 6월 29일

[논문리뷰] Monte Carlo Energy Aggregation for Mobile 3D Gaussian Splatting

본 논문은 3DGS를 모바일 플랫폼에 배포할 때 발생하는 높은 추론 및 저장 비용 문제를 해결하는 것을 목적으로 합니다.

#Review #3D Gaussian Splatting #Mobile Rendering #Monte Carlo Specular Energy Aggregator #Spherical Harmonics #Multi-view Densification #Real-time Rendering

2026년 6월 29일

[논문리뷰] MIMFlow: Integrating Masked Image Modeling with Normalizing Flows for End-to-End Image Generation

본 논문은 Normalizing Flows (NFs)의 엄격한 가역성이 저수준 픽셀 디테일에 모델 용량을 과도하게 소모하게 하여, 고수준 시맨틱 구조를 포착하는 데 한계가 있다는 문제를 해결하고자 합니다.

#Review #Normalizing Flows #Masked Image Modeling #End-to-End Generation #Variational Inference #Latent Representation #Token Bottleneck

2026년 6월 29일

[논문리뷰] LiveEdit: Towards Real-Time Diffusion-Based Streaming Video Editing

본 논문은 실시간 스트리밍 비디오 편집 환경에서 발생하는 Attention distribution shift와 Spatial-temporal token redundancy 문제를 해결하고자 한다 .

#Review #Streaming Video Editing #Diffusion Models #Distillation #Real-Time Inference #Attention Distribution #Mask Cache #Autoregressive Generation

2026년 6월 29일

[논문리뷰] Learning Transferable Dynamics Priors from Action to World Modeling

본 논문은 대규모 로봇 데이터를 활용하여 범용적인 Dynamics Priors를 학습하고, 이를 통해 로봇 학습의 시뮬레이터와 정책 성능을 동시에 향상시키는 것을 목표로 합니다.

#Review #Robot Learning #World Modeling #Diffusion Models #Dynamics Priors #Action-Conditioned #Policy Evaluation #Sim-to-Real

2026년 6월 29일

[논문리뷰] Large-Scale Tunnel Air-Ground Collaboration With FLISP: Fast LiDAR-IMU Synchronized Path Planner

대규모 수력 발전 터널과 같은 대형 인프라 점검은 현재 수작업에 의존하고 있어 매우 위험하고 비효율적입니다. 기존의 map-based multi-robot 시스템은 이러한 긴 터널 환경에서 SLAM 드리프트와 계산 부하 문제로 인해 안정적인 운용이 어렵습니다.

#Review #Path Planning #LiDAR-IMU #Air-Ground Collaboration #Tunnel Inspection #Mapless #Heterogeneous Multi-Robot #Obstacle Avoidance

2026년 6월 29일