최신 포스트

[논문리뷰] CogOmniControl: Reasoning-Driven Controllable Video Generation via Creative Intent Cognition

본 연구는 기존 비디오 생성 모델들이 사용자의 창의적 의도를 정확히 해석하지 못하고, 제어 가능성(Controllability)이 제한적이라는 문제 해결을 목표로 합니다. 기존 모델들은 단순한 텍스트-비디오 매핑에 의존하여 복잡한 물리적 제약이나 구체적인 카메라 움직임을 구현하는 데 한계를 보입니다.

#Review #Video Generation #Controllable Generation #Reasoning-Driven #Cognitive Intent #Multimodal Understanding #Latent Diffusion Models

2026년 5월 19일

[논문리뷰] Code-Guided Reasoning for Small Language Models: Evaluating Executable MCQA Scaffolds

본 연구는 SLMs의 제한된 추론 능력을 극복하기 위해 코드 실행 기반의 구조화된 추론 환경을 도입하는 것을 핵심 목표로 합니다. 기존의 Chain-of-Thought (CoT) 기법은 복잡한 다단계 추론 과정에서 Hallucination이나 논리적 비약이 발생하기 쉽다는 한계가 존재합니다.

#Review #Small Language Models #Chain-of-Thought #Executable Scaffolds #MCQA #Code-Guided Reasoning #Symbolic Execution

2026년 5월 19일

[논문리뷰] CEPO: RLVR Self-Distillation using Contrastive Evidence Policy Optimization

본 논문은 RLVR 환경에서 기존 정책 최적화 방식들이 겪는 불균일한 credit assignment 문제를 해결하기 위해 CEPO를 제안합니다. 기존의 GRPO와 같은 방식은 전체 시퀀스에 동일한 보상을 부여하여 결정적 추론 단계와 단순 서술 토큰을 구분하지 못하는 한계가 있습니다.

#Review #RLVR #Credit Assignment #Self-Distillation #Contrastive Learning #Policy Optimization #Information Leakage

2026년 5월 19일

[논문리뷰] AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

본 논문은 기존의 자동화된 과학 연구 시스템들이 연구의 반복적이고 비선형적인 특성을 제대로 모델링하지 못하는 한계를 해결하기 위해 제안되었습니다. 현재의 시스템들은 주로 단일 에이전트의 선형 파이프라인에 의존하며, 실험 실패 시 모든 진행 상황을 포기하고, 이전 실행으로부터 학습하지 못하는 치명적인 단점이 있습니다 .

#Review #Autonomous Research #Multi-Agent Debate #Self-Healing Execution #Human-in-the-Loop #Scientific Integrity #Cross-Run Evolution #ARC-Bench

2026년 5월 19일

[논문리뷰] Aurora: Unified Video Editing with a Tool-Using Agent

본 논문은 현대의 통합형 비디오 편집 모델들이 모델이 처리할 수 있는 형식의 입력(model-ready input)을 전제로 설계되어 있어, 실제 사용자의 불완전한 자연어 요청을 처리하는 데 한계가 있다는 문제에서 출발합니다.

#Review #Video Editing #Tool-Using Agent #Unified Diffusion Transformer #Visual Underspecification #Instruction Following

2026년 5월 19일

[논문리뷰] Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

본 연구는 빠르게 발전하는 Video Generation 모델들의 품질을 정밀하게 평가하기 위한 표준화된 도구가 부족하다는 점을 해결하고자 한다. 현재의 Video Generation 모델들은 뛰어난 시각적 결과물을 제공하지만, 여전히 고유한 형태의 시각적 오류인 아티팩트를 빈번하게 발생시킨다.

#Review #Multimodal Large Language Models #AI-Generated Videos #Artifact Detection #Video Quality Assessment #Benchmarking

2026년 5월 19일

[논문리뷰] Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

본 논문은 LLM의 추론 능력을 강화하기 위한 on-policy self-distillation 기법이 수학적 추론 과제에서 일관된 성능 향상을 보이지 못하는 문제를 해결합니다.

#Review #Reinforcement Learning #Self-Distillation #Reasoning #Pointwise Mutual Information #LLM #GRPO #Jensen-Shannon Divergence

2026년 5월 19일

[논문리뷰] Active Learners as Efficient PRP Rerankers

본 요청은 논문 분석을 위해 제공해주신 URL(https://arxiv.org/html/2605.14236)에 접근을 시도하였으나, 네트워크 오류로 인해 논문 본문 내용을 직접 추출할 수 없었습니다.

2026년 5월 19일

[cpython] CPython 성능 최적화: 임시 리스트를 튜플로 변환할 때의 '아이템 스틸' 기법

CPython 3.14에서 도입된 INTRINSIC_LIST_TO_TUPLE 최적화를 통해 불필요한 메모리 복사를 제거하고 성능을 8%까지 끌어올린 과정을 살펴봅니다.

#CPython #Python Internals #Optimization #Performance #C-API

2026년 5월 18일

[loki] Grafana Loki: Range Aggregation 성능 최적화와 메모리 할당 감소

overlapping window 시나리오에서 불필요한 메모리 할당을 제거하여 성능을 39% 향상시킨 사례 분석

#Golang #Grafana Loki #Performance #Optimization #Memory Management

2026년 5월 18일

[vllm] vLLM Qwen3.5 GDN 최적화: `einops.rearrange`를 `torch.flatten`으로 교체하여 20배 성능 향상!

vLLM에서 Qwen3.5 GDN 레이어의 `einops.rearrange`를 `torch.flatten`으로 교체하여 Python 오버헤드를 줄이고 최대 21배의 속도 향상을 달성한 최적화 사례.

#vLLM #PyTorch #Optimization #Performance #DeepLearning #Qwen3.5 #einops #flatten

2026년 5월 18일

[transformers] Hugging Face Transformers: Continuous Batching에 Tensor Parallelism 도입하기

Continuous Batching 환경에서 Tensor Parallelism을 지원하여 대규모 언어 모델의 추론 성능을 극대화하는 최적화 기법 분석.

#HuggingFace #Transformers #TensorParallelism #ContinuousBatching #LLM

2026년 5월 18일

[sglang] DeepSeekV4 Fused MoE Triton 커널 지원 추가: 성능 최적화 분석

DeepSeekV4 모델의 Fused MoE Triton 커널 지원을 추가하여 추론 성능을 향상시킨 PR 분석

#AI #LLM #Optimization #Triton #DeepSeekV4 #MoE

2026년 5월 18일

[논문리뷰] Where Should Diffusion Enter a Language Model? Geometry-Guided Hidden-State Replacement

본 논문은 기존의 연속적 diffusion 언어 모델이 오토레그레시브 Transformer보다 성능이 뒤처지는 문제를 해결하고자 한다. 기존 연구들은 주로 토큰 수준의 확산이나 복잡한 continuous-to-discrete recovery 과정에서 발생하는 오차를 한계로 지적한다.

#Review #Diffusion-Transformer Hybrid #Hidden-State Reconstruction #Geometry-Guided #Diffusion-Friendly #Representation Geometry #Locate-and-Replace

2026년 5월 18일

[논문리뷰] VideoSeeker: Incentivizing Instance-level Video Understanding via Native Agentic Tool Invocation

본 논문은 기존의 LLM 기반 비디오 이해 모델들이 겪는 공간적·시간적 참조의 모호성 문제를 해결하기 위해 VideoSeeker를 제안한다.

#Review #Large Vision-Language Models #Instance-level Video Understanding #Visual Prompts #Agentic Tool Invocation #Reinforcement Learning #Data Synthesis Pipeline

2026년 5월 18일

[논문리뷰] Targeted Neuron Modulation via Contrastive Pair Search

LLM이 유해한 요청을 거부하도록 Instruction-tuning되지만, 이러한 Safety behavior의 Mechanistic basis는 여전히 불분명하다.

#Review #Neuron Modulation #Contrastive Neuron Attribution #Refusal Mechanisms #Alignment Fine-tuning #Mechanistic Interpretability #Behavioral Steering #MLP Neurons

2026년 5월 18일

[논문리뷰] TOBench: A Task-Oriented Omni-Modal Benchmark for Real-World Tool-Using Agents

본 논문은 실세계의 복잡한 전문 워크플로우를 수행하는 Agent의 능력과 이를 평가하는 기존 벤치마크 사이의 격차를 해소하고자 합니다.

#Review #Agentic AI #Omni-modal #Tool-using Agents #Model Context Protocol #Closed-loop Verification #Benchmark

2026년 5월 18일

[논문리뷰] Stop When Reasoning Converges: Semantic-Preserving Early Exit for Reasoning Models

본 논문은 LRM이 복잡한 문제 해결 과정에서 정답을 찾은 후에도 반복적인 검증이나 재구성을 수행하며 자원을 낭비하는 Overthinking 문제를 해결하고자 합니다 .

#Review #Large Reasoning Models #Early Exit #Chain of Thought #Semantic Redundancy #Inference Efficiency #Answer Verification

2026년 5월 18일

[논문리뷰] StableVLA: Towards Robust Vision-Language-Action Models without Extra Data

본 논문은 기존 VLA 모델들이 훈련 데이터에 포함되지 않은 실세계의 다양한 시각적 노이즈(센서 노이즈, 모션 블러 등)에 매우 취약하다는 점을 지적합니다. 현재의 VLA 모델은 주로 깨끗한 환경에서만 평가되며, 실제 배포 시 시각적 왜곡이 발생하면 성능이 급격히 저하되는 'robustness gap'을 보입니다.

#Review #Vision-Language-Action Models #Information Bottleneck #Robustness #Modality Alignment #Embodied AI #Adapter Design

2026년 5월 18일

[논문리뷰] SkillsVote: Lifecycle Governance of Agent Skills from Collection, Recommendation to Evolution

본 연구는 대규모 오픈소스 Skill 생태계의 비정형성, 중복성, 품질 불균형 문제를 해결하고 에이전트의 효율적인 경험 재사용을 가능하게 하는 체계적인 거버넌스 프레임워크를 제안합니다.

#Review #LLM Agents #Agent Skills #Lifecycle Governance #Skill Recommendation #Attribution #Skill Evolution

2026년 5월 18일