최신 포스트

[논문리뷰] WildCity: A Real-World City-Scale Testbed for Rendering, Simulation, and Spatial Intelligence

본 논문은 AI가 도시 규모의 복잡한 환경에서 공간 지능(Spatial Intelligence)을 갖추도록 학습하기 위한 real-world city-scale testbed의 부재 문제를 해결합니다.

#Review #City-scale Reconstruction #3D Gaussian Splatting #Autonomous Driving #Neural Rendering #Spatial Intelligence #Digital Twin #Closed-loop Simulation

2026년 7월 8일

[논문리뷰] Wake up for Touch! Mask-isolated Tactile Alignment Learning in MLLMs

본 논문은 sMLLM(Small MLLM, $\le$ 3B 파라미터 규모)에 촉각 센싱 능력을 통합할 때 발생하는 성능 저하 문제를 해결합니다.

#Review #Multimodal Large Language Models #Tactile Alignment #Catastrophic Forgetting #Model Sparsity #Parameter Isolation #Edge Robotics

2026년 7월 8일

[논문리뷰] Teaching LLMs a Low-Resource Language: Enhancing Code Completion in Pharo

본 연구는 Pharo와 같은 저자원 프로그래밍 언어의 생태계에서 LLM 기반의 코드 완성 도구가 부재하다는 점을 해결하고자 합니다.

#Review #Pharo #Low-resource language #Code completion #LLM #Fine-tuning #In-IDE support

2026년 7월 8일

[논문리뷰] Sparse Delta Memory: Scaling the State of Linear RNNs through Sparsity

본 논문은 Linear RNN 계열 모델들이 긴 문맥을 처리할 때 겪는 메모리 병목 현상을 해결하고자 합니다.

#Review #Linear RNNs #Sparse Delta Memory #Product Key Memory #Long-context Retrieval #IsoFLOP #State Scaling

2026년 7월 8일

[논문리뷰] Single-Rollout Asynchronous Optimization for Agentic Reinforcement Learning

본 연구는 대규모 언어 모델(LLM)의 에이전트 학습 과정에서 기존의 동기식 RL 방식이 가진 효율성 한계와 비동기식 RL이 직면한 학습 불안정성 문제를 해결하고자 한다.

#Review #Reinforcement Learning #Asynchronous RL #Single-Rollout #Agentic RL #Token-level Clipping #Value-model Training

2026년 7월 8일

[논문리뷰] Scaling Mixture-of-Experts Video Pretraining for Embodied Intelligence

기존의 비디오 생성 모델들은 주로 시각적 품질과 창의성에 집중하고 있어, embodied intelligence가 요구하는 물리적 현실성(physical realism)과 제어 가능성(controllability)이 부족한 도메인 불일치 문제를 겪고 있습니다.

#Review #Mixture-of-Experts #Video Pretraining #Embodied Intelligence #Diffusion Transformer #Reinforcement Learning #Scalability

2026년 7월 8일

[논문리뷰] RoboDojo: A Unified Sim-and-Real Benchmark for Comprehensive Evaluation of Generalist Robot Manipulation Policies

본 논문은 기존 로봇 매니퓰레이션 벤치마크가 지닌 평가의 단편성과 시뮬레이션-실세계 간의 괴리 문제를 해결하기 위해 RoboDojo를 제안한다.

#Review #Robot Manipulation #Generalist Robot Policy #Benchmark #Sim-to-Real #Embodied Intelligence #Evaluation Protocol

2026년 7월 8일

[논문리뷰] Infinite Worlds with Versatile Interactions

본 논문은 interactive world model이 실시간성과 장기적 안정성을 동시에 확보하지 못하는 한계를 해결하고자 합니다.

#Review #World Models #Causal Video Generation #Interactive Simulation #Agentic Harness #Diffusion Transformer #Long-horizon Stability

2026년 7월 8일

[논문리뷰] Imagined Rollouts are Kinematic, Not Dynamic: A Diagnosis of Long-Horizon World-Model Failure

본 논문은 현대의 World Models가 장기 예측에서 겪는 성능 저하가 단순히 '오차 누적(compounding error)'의 결과가 아니라, 모델이 물리적 역학(dynamics)을 배우지 못하고 구조적으로 운동학(kinematics) 수준에서만 작동하기 때문임을 증명합니다.

#Review #World Models #Kinematic Fallback #iKCE #Long-Horizon Failure #Embodied AI #Dynamic Imagination

2026년 7월 8일

[논문리뷰] Dual Latent Memory in Vision-Language-Action Models for Robotic Manipulation

본 논문은 기존 VLA 모델들이 지닌 Markovian assumption으로 인한 temporal short-horizon bias를 해결하고자 합니다 .

#Review #Vision-Language-Action Models #Latent Memory #Robotic Manipulation #Long-horizon Tasks #Dual-scale Vault #Memory-augmented Reasoning

2026년 7월 8일

[논문리뷰] Automating the Design of Embodied Agent Architectures

본 연구는 기존의 수동적인 Embodied 에이전트 아키텍처 설계 방식에서 벗어나, 이를 자동화(AAS)할 수 있는지 검증하고자 합니다 .

#Review #Embodied Agents #Agent Architecture Search #LLM Agents #AgentCanvas #KDLoop

2026년 7월 8일

[논문리뷰] Accurate, Interdisciplinary and Transparent Structure-property Understanding with Deep Native Structural Reasoning

본 연구는 단백질, 화학 물질, 무기 결정 등 과학적 구조(Structure)와 물성(Property) 간의 복잡한 관계를 해석하는 과정에서 기존 AI 시스템이 겪는 표현력과 추론의 한계를 해결하고자 합니다.

#Review #Foundation Model #Structure-property Relationship #Multimodal Reasoning #Scientific AI #Chain-of-thought #Native Structural Reasoning

2026년 7월 8일

[vllm] vLLM, Diffusion-Gemma 샘플러 메모리 최적화: 요청 기반 타일링으로 OOM 문제 해결

vLLM에서 Diffusion-Gemma 모델의 샘플링 과정 중 발생하는 메모리 OOM 문제를 요청 기반 타일링으로 해결한 PR을 분석합니다.

#vLLM #Diffusion-Gemma #최적화 #메모리 관리 #LLM 추론

2026년 7월 7일

[uv] uv-pep440: 일반적인 버전 문자열 파싱 2배 가속화 최적화 분석

uv-pep440 크레이트에서 `x.y.z` 형태의 버전 문자열 파싱을 최적화하여 성능을 2배 향상시킨 PR 분석.

#Rust #uv #pep440 #optimization #performance #parsing #software-engineering

2026년 7월 7일

[uv] uv의 휠 태그 호환성 검사 최적화: 불필요한 메모리 할당 제거하기

uv의 휠 태그 호환성 검사 과정에서 발생하는 불필요한 Vec 할당을 제거하여 성능을 최대 5.6배 개선한 사례를 분석합니다.

#Rust #uv #Performance #Optimization #Packaging

2026년 7월 7일

[triton] Triton: Blackwell 아키텍처를 위한 TMEM Load-Reduce 연산 퓨전 최적화

Blackwell sm103+ GPU에서 TMEM Load와 Row Reduction을 단일 PTX 명령어로 퓨전하여 성능을 개선했습니다.

#Triton #Blackwell #GPU #Optimization #Compiler

2026년 7월 7일

[논문리뷰] Where to cut, how deep: BPE and Unigram-LM on chemistry SMILES

본 논문은 화학 언어 모델에서 당연하게 여겨지는 BPE 토큰화 방식이 최선의 선택인지 의문을 제기하며, 화학적 특수 환경에서 BPE와 Unigram-LM이 서로 다른 어휘 사전을 구축하는지 검증합니다.

#Review #Chemistry SMILES #Tokenizer #BPE #Unigram-LM #Subword Algorithm #Vocabulary #Granularity

2026년 7월 7일

[논문리뷰] When Classic Cache Policies Fail: Learning-Augmented Replacement for Semantic Retrieval Buffers

본 논문은 기존의 FIFO, LRU, LFU 등 고전적인 캐시 정책이 LLM 에이전트의 semantic 워크로드에서 체계적으로 실패한다는 문제를 정의한다.

#Review #Semantic Caching #LLM Agents #Cache Replacement #Online Learning #Thompson Sampling #Regret Bounds

2026년 7월 7일

[논문리뷰] Vision as Unified Multimodal Generation

본 논문은 기존 컴퓨터 비전 분야가 각 작업(task)별로 최적화된 아키텍처와 독립적인 손실 함수(loss function)를 사용하는 파편화된 시스템에 의존하고 있다는 문제점을 지적합니다. 이로 인해 다양한 시각적 감독 신호를 통합, 재사용 및 결합하는 데 구조적인 한계가 발생합니다.

#Review #Unified Multimodal Generation #Computer Vision #Foundation Models #Instruction Tuning #Dense Prediction #SenseNova-Vision #Multimodal Learning

2026년 7월 7일

[논문리뷰] TurnOPD: Making On-Policy Distillation Turn-Aware for Efficient Long-Horizon Agent Training

본 논문은 장기 계획 및 에이전트 환경에서 OPD가 겪는 자원 비효율성과 최적화 불균형 문제를 해결하기 위해 고안되었습니다.

#Review #On-Policy Distillation #Long-Horizon Agents #Turn-Aware #Rollout-Depth Budgeting #Efficiency #Reinforcement Learning

2026년 7월 7일