최신 포스트

[논문리뷰] End-to-End Context Compression at Scale

본 연구는 긴 문맥(long-context) 처리가 LLM의 핵심 역량임에도 불구하고, 기하급수적으로 증가하는 KV Cache 메모리 점유율과 이로 인한 추론 속도 저하 문제를 해결하고자 합니다.

#Review #Context Compression #KV Cache #Latent Context Language Models #Encoder-Decoder #End-to-End Training #Model Efficiency

2026년 6월 8일

[논문리뷰] EmpiriGraph-Psy: A Dataset and LLM Pipeline for Extracting Empirical Relation Graphs from Psychology Abstracts

본 논문은 심리학과 같은 변수 지향적(Variable-oriented) 학문 분야의 과학적 지식을 구조화하기 위해 EmpiriGraph-Psy를 제안합니다.

#Review #Scientific Relation Extraction #Knowledge Graphs #Psychology #LLM Pipeline #Empirical Research #Variable Normalization

2026년 6월 8일

[논문리뷰] Echo-Memory: A Controlled Study of Memory in Action World Models

본 논문은 Action World Models에서 발생하는 근본적인 Memory 실패 문제를 해결하기 위해 연구를 시작했다 . 기존의 연구들은 서로 다른 Backbone, Training recipe, Evaluation protocol을 사용하여 메모리 성능을 정확하게 비교하는 것이 불가능했습니다.

#Review #Action World Models #Video Diffusion #Memory Mechanism #Open-domain Return #Replay Consistency #State-Space Memory #Context Compression

2026년 6월 8일

[논문리뷰] EMMA: Extracting Multiple physical parameters from Multimodal Data

본 연구는 실제 환경에서 작동하는 자율 주행 플랫폼이나 드론과 같은 시스템의 물리적 파라미터를 파편화된 멀티모달 데이터로부터 정교하게 추정하는 문제를 해결합니다.

#Review #Multimodal Data #Physical Parameter Extraction #Liquid Time-Constant Network #Physics-Informed #Digital Twin #Implicit Dynamics #Forced Dynamical Systems

2026년 6월 8일

[논문리뷰] DuMate-DeepResearch: An Auditable Multi-Agent System with Recursive Search and Rubric-Grounded Reasoning

본 논문은 기존의 Deep Research(DR) 시스템들이 직면한 4가지 핵심적인 한계점을 해결하고자 합니다. 첫째, 불충분하게 정의된 연구 범위 속에서 긴 호흡의 계획을 수행할 때 발생하는 복잡성 문제입니다. 둘째, 단일 에이전트 환경에서 하위 작업의 분해 및 스케줄링 과정 중 발생하는 오류 전파의 위험입니다.

#Review #Deep Research #Multi-Agent System #Graph-Based Dynamic Planning #Recursive Execution #Rubric-Grounded Reasoning #Auditability #Test-Time Optimization

2026년 6월 8일

[논문리뷰] DEI: Diversity in Evolutionary Inference for Quality-Diversity Search

본 논문은 기존의 병렬 LLM 기반 탐색이 컴퓨팅 자원의 확장에만 초점을 맞출 뿐, 모델의 인지적 다양성을 간과하고 있다는 문제를 해결하고자 합니다.

#Review #Quality-Diversity Search #Large Language Models #Evolutionary Algorithms #Digital Red Queen #Heterogeneous Ensemble #Distributed Optimization

2026년 6월 8일

[논문리뷰] Cosine Misleads: Auxiliary Losses Reshape Vision Language Models, Not Their Latents

본 논문은 LVR 프레임워크에서 latent와 타깃 간의 정렬 지표인 Cosine 유사도가 모델의 성능을 제대로 반영하지 못하는 '오도(Misleading)' 현상을 해결하고자 한다 .

#Review #Vision-Language Models #Latent Visual Reasoning #Information Bottleneck #Linear Probing #Auxiliary Loss #Faithfulness #Diagnostic

2026년 6월 8일

[논문리뷰] CoVEBench: Can Video Editing Models Handle Complex Instructions?

본 논문은 기존 비디오 편집 벤치마크들이 단순하고 고립된 편집 작업에만 초점을 맞추어, 실제 사용자의 복잡한 편집 요구사항을 반영하지 못하는 한계를 해결하고자 합니다 .

#Review #Compositional Video Editing #Instruction-guided Editing #Benchmark #Instruction Compliance #Video Fidelity #MLLM-based Evaluation #Fine-grained Diagnostics

2026년 6월 8일

[논문리뷰] Chiaroscuro Attention: Spending Compute in the Dark

본 연구는 표준 Transformer가 모든 토큰에 대해 일관되게 고비용의 O(n²d) self-attention을 적용하는 비효율성을 해결하고자 합니다.

#Review #CHIAR-Former #Spectral Entropy #DCT(Discrete Cosine Transform)#Routing Collapse #Operator Routing #Transformer Efficiency

2026년 6월 8일

[논문리뷰] CIPER: A Unified Framework for Cross-view Image-retrieval and Pose-estimation

본 논문은 기존의 Cross-view geo-localization 접근 방식인 이미지 검색(Image Retrieval)과 포즈 추정(Pose Estimation)이 별도의 파이프라인으로 운용되어 발생하는 비효율성을 해결하고자 합니다 .

#Review #Cross-view Geo-localization #Image Retrieval #Pose Estimation #Transformer #Multi-task Learning #Bidirectional Cross-attention

2026년 6월 8일

[논문리뷰] Bayesian-Agent: Posterior-Guided Skill Evolution for LLM Agent Harnesses

본 논문은 기존의 heuristic한 방식이나 단순한 성공/실패 횟수에 의존하는 Agent Skill 업데이트가 비효율적이며, noisy한 편집으로 인해 오히려 성능 저하를 초래할 수 있다는 문제를 해결하고자 한다.

#Review #LLM Agent #Bayesian Evidence #Skill Evolution #SOP #Harness Engineering #Posterior-Guided Optimization

2026년 6월 8일

[논문리뷰] Answer Presence Drives RAG Rewriting Gains

본 논문은 RAG 파이프라인에서 Rewriter 도입으로 얻는 성능 향상이 실제 정답 문자열 노출에 의한 것인지, 혹은 증거 문서의 질적 개선(Curation)에 의한 것인지 규명하고자 합니다.

#Review #Retrieval-Augmented Generation (RAG)#LLM Rewriting #Causal Intervention #Answer-string Surfacing #Sentinel-Fragility #Audit Protocol

2026년 6월 8일

[논문리뷰] AHA-WAM:Asynchronous Horizon-Adaptive World-Action Modeling with Observation-Guided Context Routing

본 논문은 기존 World-Action Model(WAM)이 월드 모델링과 액션 실행을 동일한 시간 해상도로 강제 결합함으로써 발생하는 구조적 비효율 문제를 해결하고자 합니다 .

#Review #Robot Learning #Embodied Manipulation #World-Action Model #Diffusion Transformer #Asynchronous Inference #Horizon-Adaptive #Observation-Guided Context Routing

2026년 6월 8일

[논문리뷰] A Geometric Account of Activation Steering through Angle-Norm Decomposition

기존의 Additive Steering은 단순히 특정 방향의 벡터를 더하는 방식으로, 이는 개념 제어(Angular)와 hidden state의 크기 변화(Radial)를 동시에 발생시켜 제어의 기하학적 의미를 모호하게 만듭니다 .

#Review #Activation Steering #Angle-Norm Decomposition #Representation Engineering #LLM Geometry #Spherical Steering

2026년 6월 8일

[axolotl] ScatterMoE LoRA 최적화: Grouped-Gram 및 Sync-free 역전파 구현

대규모 MoE 모델의 LoRA 학습 시 발생하는 병목을 해결하기 위해 Grouped-Gram 커널과 동기화 없는 역전파 경로를 도입하여 성능을 최대 2.2배 개선했습니다.

#PyTorch #Triton #MoE #LoRA #PerformanceOptimization

2026년 6월 7일

[cpython] Python re 모듈의 findall, sub, subn 성능 개선: PyList_AppendTakeRef 도입

Python re 모듈의 findall, sub, subn 함수에서 리스트 생성 시 불필요한 참조 카운트 연산을 제거하여 성능을 개선했습니다.

#Python #CPython #Performance #Regex #Optimization

2026년 6월 7일

[cpython] CPython 내부 최적화: Reference Stealing을 통한 Frame Locals 수집 속도 향상

CPython의 frame.f_locals.items() 성능을 4% 향상시킨 Reference Stealing 기법과 내부 API 최적화 분석

#Python #CPython #Optimization #C-API #ReferenceCounting

2026년 6월 7일

[sglang] SGLang의 Ideogram4 추론 성능 최적화: Denoising 루프 내 오버헤드 제거

Ideogram4 모델의 Denoising 루프에서 반복적으로 수행되던 마스크 메타데이터 생성 및 스케줄 계산을 사전 연산으로 최적화하여 성능을 개선했습니다.

#SGLang #Diffusion #Optimization #Performance #Ideogram4

2026년 6월 7일

[논문리뷰] dots.tts Technical Report

본 논문은 기존의 이산적(Discrete) 토큰 기반 TTS 모델이 가진 표현력의 한계를 극복하고, 연속적인(Continuous) latent 공간에서 안정적인 AR 음성 생성을 구현하고자 합니다.

#Review #Text-to-Speech #Continuous Latent #Flow-Matching #Autoregressive #AudioVAE #Self-Correction #MeanFlow Distillation

2026년 6월 7일

[논문리뷰] Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings

본 논문은 LLM이 우수한 zero-shot 능력을 갖추고 있음에도 불구하고, 범용 text embedding 모델로 활용될 때는 성능이 저하되는 원인을 분석하고 해결하고자 한다.

#Review #Large Language Model #Text Embedding #Mechanistic Interpretability #Unembedding Matrix #Dimensionality Reduction #Logit Lens #Edge Spectrum

2026년 6월 7일