최신 포스트

[논문리뷰] OpenSkill: Open-World Self-Evolution for LLM Agents

본 논문은 LLM 에이전트가 배포 후 외부의 정답이나 지도 없이 스스로 학습할 수 있는 'Open-World Self-Evolution' 환경에서의 불확실성을 해결하고자 합니다 .

#Review #Open-World Self-Evolution #LLM Agents #Supervision-Free #Skill Evolution #Virtual Verifier #Knowledge Acquisition #Model Transferability

2026년 6월 7일

[논문리뷰] Measuring Model Robustness via Fisher Information: Spectral Bounds, Theoretical Guarantees, and Practical Algorithms

본 논문은 딥러닝 모델의 견고성 평가가 특정 공격(Attack-dependent)에 과도하게 의존하고 있으며, 이론적 근거가 부족하다는 점을 해결하고자 한다. 기존의 Lipschitz constant나 CLEVER score와 같은 지표들은 확장성(Scalability)이 낮거나 확률적 해석력이 부족하다는 한계가 있다.

#Review #Model Robustness #Fisher Information Matrix #Spectral Norm #Adversarial Vulnerability #Interpretability #Deep Learning

2026년 6월 7일

[논문리뷰] MMAE: A Massive Multitask Audio Editing Benchmark

본 연구는 instruction-based audio editing 분야의 급격한 발전에도 불구하고, 이를 체계적으로 평가할 수 있는 통합적인 인프라가 부재하다는 문제점을 해결하고자 합니다.

#Review #Audio Editing #Benchmark #Multitask Learning #Rubric-based Evaluation #Instruction Following #Consistency

2026년 6월 7일

[논문리뷰] LayerRoute: Input-Conditioned Adaptive Layer Skipping via LoRA Fine-Tuning for Agentic Language Models

본 논문은 에이전트 시스템에서 도구 호출과 계획 수립처럼 서로 다른 복잡도를 가진 작업이 수행됨에도 불구하고, 모든 단계에 동일한 연산량을 투입하는 비효율성을 해결하고자 합니다. 기존 LLM 추론 시스템은 고정된 transformer 레이어 구조를 사용하여 모든 토큰에 대해 동일한 컴퓨팅 비용을 소모합니다.

#Review #Layer Skipping #Agentic LLM #LoRA #Adaptive Inference #Straight-Through Estimator #Model Efficiency

2026년 6월 7일

[논문리뷰] LLM Explainability with Counterfactual Chains and Causal Graphs

본 논문은 LLM의 추론 과정이 불투명하여 고위험 영역에서의 신뢰성 확보가 어렵다는 문제를 해결하고자 합니다. 기존의 어텐션 분석이나 특징 기여도(feature attribution) 방식은 본질적으로 상관관계에 기반하고 있어, LLM의 복잡한 추론 메커니즘을 명확하게 설명하는 데 한계가 있습니다.

#Review #LLM Explainability #Causal Graphs #Counterfactual Chains #Concept Discovery #MCMC #Predictive Fidelity

2026년 6월 7일

[논문리뷰] LIMMT: Less is More for Motion Tracking

본 논문은 휴머노이드 모션 트래킹 학습에서 무분별한 데이터 확장(Data Scaling)이 오히려 성능 저하를 초래한다는 문제점을 지적합니다.

#Review #Motion Tracking #Humanoid Robot #Data-Centric AI #Physics-based Simulation #Imitation Learning #Data Curation

2026년 6월 7일

[논문리뷰] How Far Can Chord-Symbol Time-Series Adaptation Carry Genre Identity? Capabilities and Boundaries in Multi-Genre Chord-Symbol Modeling

본 논문은 chord-symbol 시계열 데이터가 실제 음악 장르의 정체성을 얼마나 담아낼 수 있는지, 그 표현력의 한계는 어디인지를 규명하는 것을 목적으로 한다.

#Review #Chord-symbol modeling #Genre identity #PEFT #LoRA #Music Transformer #Representation boundary

2026년 6월 7일

[논문리뷰] HarnessForge: Joint Harness and Policy Evolution for Adaptive Agent Systems

본 논문은 LLM agent 시스템의 Meta-adaptation을 수행할 때 발생하는 '실행 호환성(Executable Compatibility) 결여' 문제를 해결합니다.

#Review #LLM Agents #Meta-Adaptation #Harness-Policy Co-evolution #Agent System Design #Reasoning Policy Alignment

2026년 6월 7일

[논문리뷰] GENEB: Why Genomic Models Are Hard to Compare

본 논문은 현재 유전체 머신러닝 분야가 파편화된 벤치마크와 상호 호환되지 않는 평가 프로토콜로 인해 모델 간의 정당한 비교가 불가능한 문제에 직면해 있다고 지적한다 .

#Review #Genomic Foundation Models #Benchmark #Probing #Cross-Model Evaluation #Architecture #Pretraining #Genomics

2026년 6월 7일

[논문리뷰] Entropy as a Structural Prior: How a Log-Barrier on DiT Belief Space Drives Musical Diversity and Development

본 논문은 확산 모델(Diffusion Model) 학습 시 확신에 기반한 그래디언트 가중치 부여가 모델의 오류를 증폭시킬 수 있다는 기존의 고정관념을 반박하고, 이를 통해 구조적 이점을 얻을 수 있음을 입증합니다.

#Review #Diffusion Models #Belief Space #Music Generation #LoRA #Implicit Curriculum #Entropy #Log-Barrier

2026년 6월 7일

[논문리뷰] Direct 3D-Aware Object Insertion via Decomposed Visual Proxies

본 연구는 기존의 Object insertion 기술이 2D image plane에 국한되어 있어, 사용자가 원하는 물체의 3D pose를 정밀하게 제어하지 못하는 한계를 해결하고자 합니다.

#Review #Object Insertion #Pose-Controllable #Decomposed Visual Proxies #3D-Aware #Diffusion Model #Image Synthesis

2026년 6월 7일

[논문리뷰] Critic-R: Improving Agentic Search using Instruction-tuned Retrievers with Natural Language Introspective Feedback

본 논문은 Agentic Search 환경에서 기존 Retriever의 경직성이 전체 시스템 성능의 병목 현상(bottleneck)을 유발한다는 점을 해결하고자 합니다. 기존 연구들은 주로 Reasoning Agent만을 최적화하거나, Retriever를 고정된 블랙박스로 간주하는 한계를 보입니다.

#Review #Agentic Search #Retrieval-Augmented Generation #Instruction-tuned Retriever #Inference-time Scaling #Contrastive Learning #Introspective Feedback

2026년 6월 7일

[논문리뷰] Compress-Distill: Reasoning Trace Compression for Efficient Knowledge Distillation

본 논문은 최신 Reasoning 모델들이 생성하는 긴 Chain-of-Thought 추론 과정이 Distillation 시 비용을 크게 증가시키고, Student 모델이 지나치게 장황한 답변을 생성하도록 유도한다는 점에 주목합니다.

#Review #Knowledge Distillation #Chain-of-Thought #Reasoning Trace #Model Compression #Supervised Fine-tuning #Inference Efficiency #Large Language Models

2026년 6월 7일

[논문리뷰] Augmenting Attention with Exponentially Decaying Memory Improves Query-Aware KV Sparsity

본 논문은 Long-context LLM의 추론 효율성을 높이기 위한 기존 Query-aware sparse inference 기법들의 성능 한계를 극복하는 것을 목표로 한다.

#Review #Efficient Inference #Query-Aware Sparsity #KV-cache #Exponentially Decaying Memory #RAT+#Long-Context LLM

2026년 6월 7일

[논문리뷰] AnchorWorld: Embodied Egocentric World Simulation with View-based Evolution Customization

기존의 interactive world model들은 주로 키보드/마우스 입력이나 단순한 텍스트 프롬프트에 의존하여, 인간의 실제 동작(full-body motion)에 기반한 자연스러운 상호작용을 반영하지 못하는 한계가 있습니다.

#Review #Embodied AI #Egocentric World Simulation #World Customization #Human Action Control #Anchor-View Priors #Video Generation

2026년 6월 7일

[vllm] vLLM의 GDN 어텐션 최적화: Prefill과 Decode 배치 분리를 통한 2배 성능 향상

Mixed 배치에서 Prefill과 Decode를 분리하여 GDN 어텐션 연산 효율을 극대화하고 1.93배의 커널 속도 향상을 달성했습니다.

#vLLM #LLM #Performance #Optimization #CUDA #GDN

2026년 6월 6일

[sglang] UniPC 스케줄러에서 GPU 동기화 제거를 통한 성능 최적화 분석

UniPC 스케줄러의 GPU 동기화 오버헤드를 제거하여 성능을 개선한 코드 변경 분석.

#PyTorch #Optimization #GPU #UniPC Scheduler #sglang

2026년 6월 6일

[hermes-agent] CLI 사용자 경험 개선: 백그라운드 캐시 워밍을 통한 모델 선택기 응답 속도 최적화

사용자 입력 전 백그라운드에서 모델 캐시를 미리 로드하여 /model 명령어 응답 시간을 1.5초에서 136ms로 단축했습니다.

#Python #Performance #CLI #Optimization #Async

2026년 6월 5일

[sglang] [SGLang] LingBot 실시간 서빙 최적화: 카메라 컨디셔닝 캐싱과 전송 프로토콜 개선

LingBot의 실시간 지연시간을 10% 이상 단축시킨 카메라 컨디셔닝 캐싱 및 전송 레이어 최적화 기법을 살펴봅니다.

#SGLang #Diffusion #Optimization #Realtime #PyTorch #Performance

2026년 6월 5일

[uv] uv, 대규모 워크스페이스 탐색 속도 1.8배 향상: 중복 파일 읽기 제거

uv가 대규모 워크스페이스 탐색 시 pyproject.toml 파일을 중복으로 읽는 문제를 해결하여 성능을 크게 개선했습니다.

#uv #성능 최적화 #Rust #Python #빌드 도구

2026년 6월 5일