최신 포스트

[cpython] CPython의 PyCriticalSection2 최적화: 중복 락 획득 방지

CPython의 PyCriticalSection2에서 이미 획득한 락을 재귀적으로 다시 획득하지 않도록 최적화하여 성능을 개선했습니다.

#CPython #Concurrency #Optimization #Locking #Internals

2026년 6월 19일

[cpython] CPython 3.14: PyCriticalSection2의 동일 락 재획득 방지 최적화 분석

CPython 3.14에서 PyCriticalSection2의 동일 락 재획득 방지 최적화 분석 및 그 의미를 살펴봅니다.

#Python #CPython #Optimization #Concurrency #Critical Section

2026년 6월 19일

[triton] Triton Autotuner 최적화: Pruned Config가 하나일 때 불필요한 벤치마크 생략하기

Triton Autotuner에서 설정이 하나로 압축될 경우, 불필요한 벤치마킹 과정을 건너뛰어 성능을 개선한 사례를 분석합니다.

#Triton #Autotuner #Performance #Optimization #Compiler

2026년 6월 18일

[ray] Ray RLlib의 비동기 학습 성능 최적화: PULL 기반 EnvRunnerStateServer 도입

RLlib의 비동기 알고리즘(IMPALA, APPO)에서 가중치 동기화 방식을 PUSH에서 PULL 모델로 전환하여 오프폴리시 지연을 20% 개선했습니다.

#Ray #RLlib #ReinforcementLearning #DistributedSystems #PerformanceOptimization

2026년 6월 18일

[vllm] vLLM Mooncake KV 오프로딩 최적화: 불필요한 KV 조회 건너뛰기

vLLM의 Mooncake KV 오프로딩 성능 향상: 불필요한 KV 조회 건너뛰고 스토리지 오버헤드 감소

#vLLM #LLM #KV Cache #Optimization #Performance

2026년 6월 18일

[sglang] Mamba GDN의 컨볼루션 캐시 최적화: 메모리 사용량 절반으로 줄이기

Mamba 및 GDN 모델에서 컨볼루션 캐시 메모리 사용량을 절반으로 줄이는 최적화 기법을 소개합니다.

#Mamba #GDN #최적화 #메모리 관리 #SGLang

2026년 6월 18일

[sglang] SGLang의 Linear-Attention 성능 최적화: int8 체크포인트 풀 도입

Linear-attention 모델의 Radix 캐시 효율을 int8 양자화로 2배 높여, 메모리 제약 없이 더 많은 프리픽스를 재사용하는 최적화 기법.

#SGLang #Linear-Attention #Optimization #Quantization #LLM

2026년 6월 18일

[논문리뷰] Understanding the Behaviors of Environment-aware Information Retrieval

본 논문은 다양한 Retriever 환경에서 LLM이 범용적인 쿼리 방식만을 사용하는 것이 비효율적이라는 문제 의식에서 출발합니다.

#Review #Retrieval-Augmented Generation (RAG)#Reinforcement Learning (RL)#Query Formulation #Retriever-aware #Structural Drift #Branching Rollout #Group Relative Policy Optimization (GRPO)

2026년 6월 18일

[논문리뷰] Thinking with Visual Grounding

본 논문은 기존 VLM(Vision-Language Model)의 추론 과정이 언어적 논리에는 치중되어 있으나, 정작 그 논리의 근거가 되는 이미지 내 특정 영역을 명시하지 않아 검증이 어렵다는 문제를 해결하고자 합니다.

#Review #Visually Grounded Thinking #Vision-Language Models #Reinforcement Learning #Visual Grounding #SAM3 #Spatial Reasoning

2026년 6월 18일

[논문리뷰] Taylor-Calibrate: Principled Initialization for Hybrid Linear Attention Distillation

본 논문은 하이브리드 모델로의 전환 시 발생하는 부적절한 재귀적 파라미터 초기화 문제를 해결하고자 합니다. 기존 연구들은 Transformer의 가중치를 복사하는 데 집중하지만, 새롭게 도입되는 GDN의 동역학(decay, gate 등)을 고려하지 않아 초기 모델이 최적화되지 않은 상태에서 학습을 시작하게 됩니다 .

#Review #Hybrid Linear Attention #Gated DeltaNet #Model Distillation #Initialization #Softmax Attention #Knowledge Distillation #Recurrent Dynamics

2026년 6월 18일

[논문리뷰] Selective Synergistic Learning for Video Object-Centric Learning

본 논문은 기존 VOCL 연구에서 encoder와 decoder 사이의 구조적 비대칭성으로 인해 발생하는 학습 불안정성과 정보 정렬의 비효율성을 해결합니다.

#Review #Video Object-Centric Learning #Selective Distillation #Pseudo-labeling #Transitive Merging #Slot Attention #Encoder-Decoder Alignment

2026년 6월 18일

[논문리뷰] S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

본 논문은 기존 VLM들이 정적인 단일 프레임 관찰에 의존하여 연속적이고 진화하는 3D 환경에서의 공간 추론에 한계를 보인다는 점을 해결하고자 합니다 . 기존 모델들은 파편화된 2D 시각 정보에 의존하기 때문에 공간적 일관성(spatial consistency) 유지와 고도화된 3D 기하학적 이해가 어렵습니다.

#Review #Spatial Intelligence #Vision-Language Models (VLM)#Agentic Paradigm #Spatio-Temporal Reasoning #Tool-Use #Spatial Evidence Accumulation

2026년 6월 18일

[논문리뷰] Rethinking Shrinkage Bias in LLM FP4 Pretraining: Geometric Origin, Systemic Impact, and UFP4 Recipe

본 논문은 LLM pretraining에서 FP4 사용 시 관찰되는 훈련 불안정성의 근본 원인으로 E2M1 포맷의 기하학적 결함을 지목합니다. 기존 연구들은 이상치 처리를 위해 RHT를 사용하지만, 이는 텐서의 분포를 비대칭적인 E2M1 빈으로 집중시켜 오히려 양자화 품질을 저하시키는 결과를 초래합니다 .

#Review #FP4 #Shrinkage Bias #E2M1 #E1M2 #Random Hadamard Transform #LLM Pretraining #Quantization

2026년 6월 18일

[논문리뷰] Playful Agentic Robot Learning

본 논문은 기존의 Code-as-Policy 시스템이 외부 명령에 의존하는 Task-driven 방식으로 작동하여, 실제 작업이 주어지기 전에는 재사용 가능한 Skill을 습득하지 못한다는 한계를 해결하고자 한다.

#Review #Learning through Play #Agentic Robotics #Continual Skill Learning #Code-as-Policy #Robot Manipulation

2026년 6월 18일

[논문리뷰] No Resource, No Benchmarks, No Problem? Evaluating and Improving LLMs for Code Generation in No-Resource Languages

본 연구는 LLM의 코드 생성 능력이 학습 데이터가 풍부한 High-Resource 언어에 편중되어, 신생 기업에서 사용하는 No-Resource 언어에 대한 지원이 전무하다는 점을 해결하고자 한다.

#Review #Large Language Models #Code Generation #No-Resource Languages #Benchmark #Fine-Tuning #Pre-training

2026년 6월 18일

[논문리뷰] Multi-LCB: Extending LiveCodeBench to Multiple Programming Languages

본 논문은 기존의 코드 생성 평가가 특정 언어에 편향되어 있어 LLM의 실질적인 다국어 코딩 능력을 측정하지 못하는 한계를 해결하고자 한다. LiveCodeBench(LCB)는 지속적인 업데이트와 엄격한 오염 방지 제어를 통해 우수한 성능을 입증했으나, 오직 Python 언어만을 지원한다는 결정적인 단점이 존재한다 .

#Review #Code Generation #Multi-lingual Benchmark #Large Language Models #LiveCodeBench #Contamination-aware #Cross-lingual Evaluation

2026년 6월 18일

[논문리뷰] Moebius: 0.2B Lightweight Image Inpainting Framework with 10B-Level Performance

본 논문은 10B-level industrial foundation model이 제공하는 고품질의 image inpainting 성능을 유지하면서도, 실제 배포가 불가능할 정도로 높은 연산 비용과 메모리 요구량을 해결하고자 합니다.

#Review #Image Inpainting #Diffusion Models #Knowledge Distillation #Model Compression #Latent Space Optimization #Lightweight Architecture #LλMI Block

2026년 6월 18일

[논문리뷰] LooseControlVideo: Directorial Video Control using Spatial Blocking

본 연구는 고품질 비디오 생성 모델에서 사용자 의도를 반영한 정밀한 3D 공간 제어와 복잡한 다중 객체 상호작용의 부재를 해결하고자 합니다.

#Review #Video Generation #Video Editing #Diffusion Transformer #3D-Aware Control #Spatial Blocking #DNOCS #Motion Orchestration

2026년 6월 18일

[논문리뷰] JanusMesh: Fast and Zero-Shot 3D Visual Illusion Generation via Cross-Space Denoising

본 논문은 텍스트 기반의 프롬프트로부터 서로 다른 시점에서 상이한 의미를 갖는 3D Visual Illusion을 효율적으로 생성하는 문제를 해결하고자 한다 .

#Review #3D Visual Illusion #Zero-Shot Generation #Cross-Space Denoising #SDF Blending #View-Conditioned Texture Synthesis #CLIP-guided Orientation Search #Rectified Flow

2026년 6월 18일

[논문리뷰] JAMER: Project-Level Code Framework Dataset and Benchmark on Professional Game Engines

본 논문은 프로페셔널 게임 엔진 환경에서 프로젝트 레벨의 코드 프레임워크를 생성하고 평가하는 AI 기술의 부재를 해결하고자 한다 . 기존 연구들은 주로 단일 파일 생성이나 간단한 게임 로직에 국한되어 있으며, 게임의 복잡한 런타임 행동을 정량적으로 평가할 수 있는 방법론이 부족하였다.

#Review #Game Engine #Code Framework #Software Engineering #Benchmark #Dataset #Godot #Deterministic Evaluation

2026년 6월 18일