최신 포스트

[논문리뷰] Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

본 논문은 vision-language models(VLMs)의 agentic reasoning 과정에서 발생하는 '도구 사용의 비효율성' 문제를 해결하고자 합니다.

#Review #Multimodal Agentic Reasoning #Reinforcement Learning #GRPO #AXPO #Tool-call Resampling #Thinking-Acting Gap #Vision-Language Models

2026년 5월 27일

[논문리뷰] AgensFlow: A Coordination-Policy Substrate for Multi-Agent Systems

본 논문은 LLM 기반의 다중 에이전트 시스템에서 발생하는 조율 불투명성과 고정된 파이프라인의 경직성 문제를 해결하고자 합니다.

#Review #Multi-Agent Systems #Online Policy Learning #Coordination Substrate #Large Language Models #Task Signatures #Relative Trajectory Evaluation

2026년 5월 27일

[논문리뷰] Advancing Creative Physical Intelligence in Large Multimodal Models

본 연구는 대규모 다중모달 모델(LMM)이 인식 및 추론 능력은 크게 발전했음에도 불구하고, 비일상적인 상황에서 사물을 창의적으로 재사용하는 물리적 지능이 여전히 부족하다는 문제의식에서 출발합니다.

#Review #Multimodal AI #Creative Tool Repurposing #Physical Affordance #Visual Grounding #Direct Preference Optimization (DPO)#Interactive Benchmark

2026년 5월 27일

[논문리뷰] AI Research Agents Narrow Scientific Exploration

본 연구는 AI 연구 에이전트가 과학적 발견의 범위를 실질적으로 확장하는지, 아니면 기존 연구의 주변부에 머무르는지를 규명하는 것을 목적으로 합니다.

#Review #AI Research Agents #Scientific Discovery #Ideation #Citation Analysis #Research Breadth #Bibliographic Coupling

2026년 5월 27일

[openclaw] Node.js 오디오 코덱 성능 최적화: TypedArray를 활용한 효율적인 PCM 처리

Node.js 오디오 코덱의 핵심 경로에서 TypedArray를 사용하여 PCM 데이터 처리를 최적화한 PR 분석

#Node.js #성능 최적화 #오디오 코덱 #TypedArray #Buffer

2026년 5월 26일

[sglang] 성능 최적화의 함정: DeepSeek-V3.2 정확도 붕괴를 막기 위한 SGLang의 긴급 롤백 분석

EAGLE 드래프트 모델에서 Softmax를 생략하는 최적화가 DeepSeek-V3.2 MTP 모델의 정확도를 96%나 떨어뜨린 이유와 그 해결책을 분석합니다.

#SGLang #Speculative Decoding #DeepSeek-V3 #Performance Optimization #LLM Inference

2026년 5월 26일

[vllm] vLLM, GDN Prefill 커널을 CuteDSL로 최적화하여 성능 향상

vLLM의 GDN Prefill 연산에서 새로운 CuteDSL 기반 커널을 도입하여 성능을 크게 개선했습니다.

#vLLM #GDN #CuteDSL #최적화 #성능 #LLM

2026년 5월 26일

[논문리뷰] The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

본 논문은 large language model (LLM)이 long-horizon agentic workflow로 전환됨에 따라 발생하는 efficiency 및 cost bottleneck 문제와 intrinsically complex, high-stakes task 해결의 어려움을 다룹니다.

#Review #Mixture-of-Experts (MoE)#Mini Activations #Agentic AI #Self-Evolution #Reinforcement Learning (RL)#Multi-Token Prediction (MTP)

2026년 5월 26일

[논문리뷰] SpatialBench: Is Your Spatial Foundation Model an All-Round Player?

본 논문은 현재 Spatial Foundation Models (SFMs)이 standard dataset에서 인상적인 성능을 보여주지만, 다양한 downstream task, 임의의 viewpoint, 변화하는 scene domain, 다양한 input density, 그리고 특정 hardware constraint에 걸쳐 robust하게 generalizing할 수 있는 all-round player인지에 대한 근본적인…

#Review #Spatial Foundation Models #3D Reconstruction #Benchmark #Domain Generalization #Input Density #Embodied AI

2026년 5월 26일

[논문리뷰] Soap2Soap: Long Cinematic Video Remaking via Multi-Agent Collaboration

본 논문은 Long-horizon Video-to-Video Generation의 핵심 과제인 Long Cinematic Video Remaking 문제를 해결하고자 합니다.

#Review #Long-Video Remaking #Multi-Agent System #Dual-Bridge Consistency #Character Identity #Narrative Fidelity #Video-to-Video Generation

2026년 5월 26일

[논문리뷰] Share More, Search Less: Collaborative Parallel Thinking for Efficient Test-Time Scaling

기존 병렬 Test-Time Scaling (TTS) 방법론은 Information Isolation Bottleneck이라는 중요한 한계점을 가지고 있습니다.

#Review #Test-Time Scaling #Collaborative Parallel Thinking #Large Language Models #Information Sharing #Redundant Exploration #Accuracy-Latency Pareto Frontier #Mathematical Reasoning

2026년 5월 26일

[논문리뷰] MobileGym: A Verifiable and Highly Parallel Simulation Platform for Mobile GUI Agent Research

모바일 GUI Agent 연구는 빠른 발전을 보였지만, 현재 평가 및 훈련 환경은 근본적인 Trade-off 문제에 직면해 있다.

#Review #Mobile GUI Agent #Simulation Environment #Reinforcement Learning #Verifiable Outcome Signals #Interaction Fidelity #MobileGym-Bench #Sim-to-Real Transfer

2026년 5월 26일

[논문리뷰] LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

본 논문은 기존 Audio-Visual Generation 벤치마크가 Minute-Scale Content의 평가 요구사항을 충족하지 못하는 문제를 해결하고자 한다.

#Review #Audio-Visual Generation #Long Video Generation #Evaluation #Benchmark #T2AV #I2AV #V2AV #MLLM-assisted assessment

2026년 5월 26일

[논문리뷰] LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

The End of the content of the urls browsed.

2026년 5월 26일

[논문리뷰] Geometry-Aware Representation Denoising for Robust Multi-view 3D Reconstruction

본 논문은 Degraded Input Condition 하에서 Multi-view 3D Reconstruction의 Robustness를 향상시키기 위해 Geometry-Aware Representation Denoising (GARD) 프레임워크를 제안한다.

#Review #Multi-view 3D Reconstruction #Image Restoration #Representation Denoising #Diffusion Models #Geometry-Aware Features #Feed-Forward Models #Camera Pose Estimation

2026년 5월 26일

[논문리뷰] EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation

본 연구는 generative video foundation models의 빠른 발전으로 professional-grade cinematic synthesis에 대한 수요가 증가함에 따라, Reinforcement Learning (RL) 및 agentic workflows로의 전환에 필요한 신뢰할 수 있는 평가의 bottleneck 문제를 해결하고자 한다.

#Review #Video Generation #Benchmarking #Cinematic Quality #VLM #Chain-of-Thought #Human-Machine Alignment #Evaluation Framework #Reinforcement Learning

2026년 5월 26일

[논문리뷰] D^2-Monitor: Dynamic Safety Monitoring for Diffusion LLMs via Hesitation-Aware Routing

본 논문은 D-LLM의 안전성 monitoring 연구가 미흡하며, D-LLM의 오용 가능성이 증대함에 따라 효과적인 방어 메커니즘이 필요하다고 주장합니다.

#Review #Diffusion LLMs #Safety Monitoring #Hesitation-Aware Routing #Probe-based Monitors #Multi-step Trajectory #Sample Difficulty #Efficiency-effectiveness Tradeoff #Adversarial Inputs

2026년 5월 26일

[sglang] SGLang EAGLE 디코딩 최적화: 불필요한 Softmax 연산 제거로 성능 향상

SGLang EAGLE 디코딩에서 topk=1일 때 불필요한 Softmax 연산을 제거하여 성능을 개선했습니다.

#SGLang #EAGLE #Speculative Decoding #Performance Optimization #Softmax #Top-k Sampling

2026년 5월 25일

[cpython] Python의 os.fork 후 발생하던 성능 프로파일링 충돌 문제 해결 및 최적화 분석

os.fork 후 발생하던 CPython의 성능 프로파일링 충돌 문제를 해결하고, 코드 재사용성을 높인 최적화 분석.

#Python #CPython #Performance #Optimization #fork #Profiling

2026년 5월 25일

[sglang] SGLang Diffusion 최적화: CFG Gating을 통한 추론 속도 20% 향상

Classifier-free guidance(CFG)의 불필요한 연산을 줄이는 CFG Gating 기법을 도입하여 Denoising 단계의 성능을 25% 개선했습니다.

#SGLang #Diffusion #Optimization #LLM #Inference

2026년 5월 25일