최신 포스트

[논문리뷰] ClawHub Security Signals: When VirusTotal, Static Analysis, and SkillSpector Disagree

본 논문은 LLM 에이전트 생태계에서 핵심 소프트웨어 단위인 Agent Skills의 보안 문제를 다루며, 서로 다른 보안 스캐너(VirusTotal, Static Analysis, SkillSpector)들이 동일한 스킬에 대해 불일치하는 결과를 보일 때 이를 어떻게 해석하고 대응할 것인지에 대한 문제를 제기한다.

#Review #Agent Skills #LLM Agents #Software Supply Chain #Security Scanning #Scanner Disagreement #Trust Artifacts #OpenClaw

2026년 6월 2일

[논문리뷰] Bootstrap Your Generator: Unpaired Visual Editing with Flow Matching

본 논문은 대규모 paired dataset 없이도 instruction-based visual editing이 가능한 범용 프레임워크인 ByG (Bootstrap Your Generator)를 제안합니다 .

#Review #Flow Matching #Unpaired Editing #Cycle Consistency #Straight-Through Estimation #Gradient Routing #Bootstrap #Visual Editing

2026년 6월 2일

[논문리뷰] Benchmarking Visual State Tracking in Multimodal Video Understanding

본 논문은 최신 Multimodal Large Language Models (MLLMs)가 비디오의 지속적인 역동성을 이해하고 상태를 추적하는 능력, 즉 Visual State Tracking 능력이 결여되어 있다는 점을 지적한다 .

#Review #Multimodal Large Language Models #Video Understanding #Visual State Tracking #Benchmark #Visual Perception #Agentic Frameworks

2026년 6월 2일

[논문리뷰] BA-T: An Iterative Transformer for Two-View Bundle Adjustment

본 연구는 기존의 feed-forward 3D 재구성 모델들이 의존하는 heavy decoder stack의 비효율성과 기하학적 self-correction 메커니즘의 부재를 해결하고자 합니다.

#Review #Bundle Adjustment #Iterative Transformer #Implicit Latent Space #Two-View Reconstruction #Pose Estimation #Geometric Consistency

2026년 6월 2일

[논문리뷰] AutoMedBench: Towards Medical AutoResearch with Agentic AI Models

본 논문은 기존 의료 AI 벤치마크가 End-to-End 연구 과정의 복잡성을 간과하고 최종 결과물 평가에만 치중하여, 에이전트의 행동 특성이나 실패 원인을 파악하기 어렵다는 문제점을 해결하고자 합니다 .

#Review #Medical-AI #Autonomous Agents #Benchmark #Research Automation #Workflow-Aware Evaluation #LLM

2026년 6월 2일

[논문리뷰] Adaptive Auto-Harness: Sustained Self-Improvement for Agentic System Deployment on Open-Ended Task Streams

본 논문은 기존의 Auto-Harness 시스템들이 고정된 오프라인 벤치마크에서는 우수한 성능을 보이지만, 실제 Open-Ended Task Streams 환경에서는 성능 저하를 겪는다는 문제를 해결합니다 .

#Review #Agentic System #Auto-Harness #Open-Ended Task Streams #Multi-Agent Evolution #Solve-Time Adaptation #Non-Stationarity #Human-in-the-Loop

2026년 6월 2일

[논문리뷰] AURA: Action-Gated Memory for Robot Policies at Constant VRAM

로봇 에이전트가 끊김 없이 지속적으로 동작하는 환경에서 기존의 Transformer KV-cache 방식은 에피소드 길이에 따라 메모리 요구량이 선형적으로 증가하여 에지 하드웨어의 메모리 대역폭을 심각하게 제한합니다.

#Review #Robot Policies #VRAM #Action-Utility Gate #Fast-Weight Memory #Inference Efficiency #POMDP

2026년 6월 2일

[논문리뷰] A Multi-AI-agent Framework Enabling End-to-end Finite Element Analysis for Solid Mechanics Problems

FEA는 현대 공학의 필수 요소이나, 입문자에게 요구되는 높은 학습 곡선과 복잡한 시뮬레이션 설정 오류로 인해 진입 장벽이 매우 높습니다. 기존의 API 기반 자동화 방식은 고정된 스크립트와 템플릿에 의존하여 설계 변경 시 유연성이 부족하다는 한계를 가집니다.

#Review #AI agent #Finite Element Analysis (FEA)#Large Language Models (LLM)#Multi-agent framework #Retrieval-Augmented Generation (RAG)#Solid mechanics

2026년 6월 2일

[논문리뷰] A Local Perturbation Theory for Cross-Domain Interference and Recovery in Multi-Domain RL

본 논문은 순차적 Multi-Domain RL에서 발생하는 선택적 성능 저하 메커니즘을 규명하고 이를 해결하기 위한 이론적 토대를 구축한다. 기존 연구들은 이를 catastrophic forgetting 또는 global gradient conflict로 설명하려 했으나, 실제 실험 결과는 이러한 설명들과 불일치한다 .

#Review #Multi-Domain RL #Cross-Domain Interference #Local Perturbation Theory #Gradient Conflict #Domain Refresh #Second-Order Damage #Active Routes

2026년 6월 2일

[sglang] SGLang NIXL HiCache 리팩토링 및 O_DIRECT 지원 추가: 성능 향상과 안정성 강화

SGLang의 NIXL HiCache 커넥터 리팩토링 및 O_DIRECT 지원 추가로 I/O 성능 향상 및 안정성 개선.

#SGLang #NIXL #HiCache #O_DIRECT #성능 최적화 #KV Cache

2026년 6월 1일

[vllm] vLLM의 FP8 Scaled MM 최적화: Padding 제거를 통한 20% 성능 향상

vLLM에서 FP8 행렬 곱셈 시 불필요한 Padding을 제거하여 커널 성능을 약 20% 개선한 사례를 분석합니다.

#vLLM #CUDA #Optimization #FP8 #DeepLearning

2026년 6월 1일

[vllm] [vLLM 분석] DeepSeek V4의 Sparse FP8 Compressor 커널 최적화: CuteDSL을 통한 성능 극대화

vLLM에서 DeepSeek V4의 KV 캐시 압축 효율을 높이기 위해 CuteDSL 커널을 최적화하여 최대 1.67배의 성능 향상을 달성한 과정을 살펴봅니다.

#vLLM #DeepSeek-V4 #CUDA #CuteDSL #Kernel-Optimization #FP8

2026년 6월 1일

[uv] uv의 로컬 휠(Wheel) 압축 해제 성능 회귀 문제 해결: astral_async_zip 버전 업데이트

astral_async_zip 라이브러리의 버전을 rc4에서 정식 버전으로 업데이트하여 로컬 휠 압축 해제 성능 저하를 해결한 사례를 분석합니다.

#Rust #uv #Performance #Optimization #Packaging

2026년 6월 1일

[논문리뷰] X-Stream: Exploring MLLMs as Multiplexers for Multi-Stream Understanding

본 논문은 기존의 영상 이해 연구가 주로 단일 스트림 기반에 머물러 있어, 실제 환경에서 요구되는 멀티 스트림 간의 협업 및 이해 능력을 평가하지 못한다는 한계를 지적합니다 .

#Review #Multi-Stream Understanding #MLLMs #Multiplexing #Streaming Benchmark #Online Inference #Cross-Stream Reasoning

2026년 6월 1일

[논문리뷰] Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models

본 논문은 Spatial Intelligence를 구축하는 데 있어 VLM과 VGM 중 어느 사전 학습(Pre-training) 패러다임이 더 우수한 표현 체계(Representation substrate)를 제공하는지 분석한다 .

#Review #Spatial Intelligence #Vision-Language Models #Video Generation Models #Frozen-Feature Probing #Representation Learning #Semantic Tagging #3D Geometry Prediction

2026년 6월 1일

[논문리뷰] Where to Look: Can Foundation Models Reach a Target Viewpoint Through Active Exploration?

본 논문은 Foundation Models가 수동적인 시각적 이해를 넘어, 능동적인 탐색을 통해 3D 공간에서 목표 시점을 정확히 재현할 수 있는지 질문합니다 . 기존 연구들은 주로 사전에 수집된 데이터에 의존하여 '무엇이 어디에 있는가'를 묻는 정적인 공간 지능에 집중해 왔습니다.

#Review #Target Viewpoint Reproduction #TVRBench #Active Exploration #Foundation Models #Spatial Intelligence #Embodied AI #GRPO #SFT

2026년 6월 1일

[논문리뷰] When Does Multi-Agent RL Improve LLM Workflows? Workflow, Scale, and Policy-Sharing Tradeoffs

본 논문은 다중 에이전트 LLM 워크플로우의 end-to-end 강화학습 시 발생하는 성능 불안정성과 그 원인을 체계적으로 규명하는 것을 목표로 합니다. 기존 연구들은 개별 워크플로우에 특화된 알고리즘을 제안하는 데 그쳤으며, 왜 특정 환경에서 학습이 성공하거나 실패하는지에 대한 근본적인 메커니즘을 설명하지 못했습니다 .

#Review #Multi-Agent RL #LLM Workflows #Reinforcement Learning #Policy-Sharing #Gradient Dynamics #Role Drift

2026년 6월 1일

[논문리뷰] VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion

본 논문은 autoregressive 영상 확산 모델에서 streaming 생성 시 발생하는 방대한 KV 캐시 메모리 비용 문제를 해결하고자 합니다.

#Review #Video Diffusion #Multi-Head Latent Attention #KV Cache #Autoregressive Generation #Low-Rank Latent #Streaming Video #3D-RoPE

2026년 6월 1일

[논문리뷰] VLMs are Good Teachers for Video Reasoning via Adaptive Test-Time Optimization

본 연구는 기존의 'Reasoning with Video' 패러다임에서 VGM들이 높은 시각적 품질에도 불구하고 논리적 추론이나 특정 규칙 준수에서 시스템적인 한계를 보인다는 문제에 주목합니다 .

#Review #Video Generation Models #Video Reasoning #Vision-Language Models #Test-Time Optimization #LoRA #Differentiable Rewards

2026년 6월 1일

[논문리뷰] Unified Neural Scaling Laws

본 논문은 기존의 Neural Scaling Laws가 가진 예측 한계를 극복하고, 다차원적인 변수가 동시에 변화하는 복잡한 환경에서 모델 성능을 정확히 예측하는 문제를 해결합니다.

#Review #Neural Scaling Laws #Multivariate Scaling #Functional Form #Extrapolation #Deep Learning #Model Performance #Hyperparameter Optimization

2026년 6월 1일