최신 포스트

[논문리뷰] UP: Unbounded Positive Asymmetric Optimization for Breaking the Exploration-Stability Dilemma

본 연구는 기존 RL 프레임워크가 사용하는 Importance Sampling (IS) 기반의 클리핑 메커니즘이 LLM의 복잡한 추론 경로 탐색을 구조적으로 제한한다는 문제를 해결합니다.

#Review #Reinforcement Learning #Large Language Models #Exploration-Stability Dilemma #Importance Sampling #Asymmetric Optimization #Probability Capacity

2026년 7월 9일

[논문리뷰] PhyMRI-SR: Toward Physics-Aware MRI Image Super-Resolution

본 논문은 기존 MRI Super-Resolution(SR) 연구가 저해상도 입력을 고정된 목표로 간주하고 결정론적 매핑만을 수행한다는 한계를 지적합니다. 하지만 실제 MRI 획득 과정에서 해상도와 SNR은 물리적으로 긴밀하게 결합되어 있어, 고정된 입력이 항상 최적의 정보를 담고 있는 것은 아닙니다 .

#Review #MRI Super-Resolution #Physics-Aware Reconstruction #2D Gaussian Splatting #Resolution-SNR Trade-off #Meta-Learning #Biophysical Constraints

2026년 7월 9일

[논문리뷰] OpenCoF: Learning to Reason Through Video Generation

본 논문은 기존 비디오 생성 모델들이 시각적 사실성(Visual Realism)은 뛰어나지만, 정교한 논리적 추론(Reasoning) 능력이 부족하다는 문제점을 해결하고자 합니다.

#Review #Chain-of-Frame #Video Generation #Reasoning #OpenCoF-17K #Wan-CoF #Visual Reasoning Tokens #Textual Reasoning Tokens

2026년 7월 9일

[논문리뷰] LongE2V: Long-Horizon Event-based Video Reconstruction, Prediction, and Frame Interpolation with Video Diffusion Models

본 논문은 기존 event-based vision 모델들이 겪는 성능 한계와 작업별 파편화 문제를 해결하기 위해 LongE2V를 제안한다.

#Review #Event-based Vision #Video Diffusion Models #Video Reconstruction #Long-horizon Prediction #Frame Interpolation #Autoregressive Unrolling

2026년 7월 9일

[논문리뷰] Linear Attention Architectures: Mechanisms, Trade-offs, and Cross-Layer Routing

본 논문은 Transformer의 self-attention이 긴 컨텍스트에서 가지는 $O(T^2)$ 연산 비용 문제를 해결하기 위해, Recurrent-memory 기반 Linear Attention 아키텍처들의 구조적 특성을 체계적으로 분석합니다.

#Review #Linear Attention #Recurrent Associative Memory #DeltaNet #Cross-Layer Routing #Architecture Trade-offs #CLVR #CLER

2026년 7월 9일

[논문리뷰] Jet-Long: Efficient Long-Context Extension with Dynamic Bifocal RoPE

기존의 Zero-shot context extension 방법들은 고정된 하나의 리스케일링 팩터를 사용하므로, 짧은 컨텍스트에서의 충실도와 긴 컨텍스트에서의 외삽(extrapolation) 성능 사이에서 불가피한 트레이드오프를 겪습니다.

#Review #Long-Context Extension #Zero-shot #RoPE #Bifocal RoPE #Inclusion–Exclusion Attention #CuTe Kernel

2026년 7월 9일

[논문리뷰] Ideas Have Genomes: Benchmarking Scientific Lineage Reasoning and Lineage-Grounded Idea Generation

본 연구는 현행 AI 시스템이 논문 생성 및 연구 지원 시 혈통적 계승 구조를 이해하지 못하고 표면적인 topical proximity에 의존하는 문제를 해결하고자 한다.

#Review #Scientific Lineage #Idea Genome #GenomeDiff #IG-Bench #Automated Research #Lineage Competence

2026년 7월 9일

[논문리뷰] Flash-BoN: Instant Drafts for Inference-Time Scaling in Diffusion Models

기존의 Inference-Time Scaling 연구들은 주로 중간 디노이징 단계에서 빈번한 검증을 통해 후보를 탐색하거나 안내하는 방식에 집중해 왔으나, 정작 생성 자체에 드는 비용을 과도하게 무시하고 있다 .

#Review #Inference-Time Scaling #Diffusion Models #Draft Generation #Wall-clock Efficiency #Multi-stage Verification #Discrete Optimization

2026년 7월 9일

[논문리뷰] Enhancing In-context Panoramic Generation via Geometric-aware Pretraining

본 논문은 기존 파노라마 이미지 생성 모델이 겪는 3D 기하학적 일관성 부족 문제를 해결하기 위해 제안되었다.

#Review #Panoramic Generation #In-context Learning #Geometry-aware Pretraining #Flow Matching #Velocity Circular Padding #Canvas360Dataset

2026년 7월 9일

[논문리뷰] DrugGen 2: A disease-aware language model for enhancing drug discovery

본 논문은 기존의 약물 생성 모델들이 질병의 맥락을 고려하지 않고 표적 단백질이나 일반적인 분자 특성에만 의존하여 생성된 약물의 치료적 타당성이 부족하다는 문제를 해결하고자 합니다 .

#Review #Drug Design #Drug Repositioning #Large Language Model #Reinforcement Learning #Disease-Aware #GRPO #SMILES

2026년 7월 9일

[논문리뷰] CineMobile: On-Device Image-to-Video Diffusion for Cinematic Camera Motion Generation

본 논문은 최신 Diffusion Transformers(DiTs) 모델이 뛰어난 비디오 생성 성능에도 불구하고, 거대한 파라미터 크기와 다단계 추론 과정으로 인해 모바일 기기에서의 실시간 및 효율적 생성이 어렵다는 문제를 해결하고자 합니다.

#Review #Diffusion Transformers #Image-to-Video #On-Device AI #Model Compression #Step Distillation #Hybrid Quantization #Cinematic Motion

2026년 7월 9일

[논문리뷰] CausalDS: Benchmarking Causal Reasoning in Data-Science Agents

본 논문은 현대의 LLM 기반 데이터 과학 에이전트들이 복잡한 인과적 추론을 수행하는 능력이 부족하거나 불투명하다는 문제를 해결하고자 합니다.

#Review #Causal Reasoning #Data-Science Agents #Structural Causal Models #Benchmarking #Identifiability #Uncertainty Quantification #Tool Use

2026년 7월 9일

[논문리뷰] Can Dialects Be Steered Like Languages? Sparse Neurons and Distributed Directions in Arabic LLMs

본 논문은 현대의 Arabic LLM들이 MSA(Modern Standard Arabic) 데이터에 과도하게 편향되어 방언 생성 능력이 부족하다는 문제를 해결하고자 합니다.

#Review #Arabic LLMs #Dialect Steering #Mechanistic Interpretability #Activation Steering #Sparse Neurons #Inference-time Intervention

2026년 7월 9일

[논문리뷰] ARDY: Autoregressive Diffusion with Hybrid Representation for Interactive Human Motion Generation

본 논문은 실시간 인터랙티브 환경에서 정교한 텍스트 제어와 긴 지평의 kinematic constraints를 동시에 만족하는 고품질 인간 움직임 생성 모델을 제안합니다 .

#Review #Interactive Motion Generation #Autoregressive Diffusion #Hybrid Representation #Kinematic Constraints #Motion Tokenizer #Two-Stage Denoiser #Streaming Generation

2026년 7월 9일

[논문리뷰] A Sparse and Truncated State Vector Simulator for Peaked Circuits

본 논문은 Peaked Circuits의 효율적인 시뮬레이션을 위해 메모리 및 연산 자원을 절감할 수 있는 Sparse and Truncated State Vector 시뮬레이터를 제안한다. 기존의 Dense 시뮬레이터는 O(2^n)의 메모리를 요구하여 큐비트 수가 증가함에 따라 확장성에 한계가 있다.

#Review #Quantum Circuit Simulation #Sparse State Vector #Truncated Simulation #Peaked Circuits #GPU Acceleration #Vectorized Operations

2026년 7월 9일

[논문리뷰] A Quantized Native Runtime for On-Device Semantic Audio Generation

본 연구는 Stable Audio 3와 같은 최첨단 생성형 음악 모델을 클라우드 데이터센터가 아닌 로컬 및 임베디드 기기에서 구동하고자 할 때 발생하는 문제들을 해결하는 것을 목표로 한다.

#Review #On-Device Audio Generation #Quantization #Stable Audio 3 #Activation Steering #Sonic Seasoning #C/CUDA Runtime

2026년 7월 9일

[onnxruntime] ONNX Runtime WebGPU: Intel Xe-3LPG를 위한 고성능 GEMM 최적화 분석

Intel Xe-3LPG 아키텍처에서 vec4 로드와 B 타일 더블 버퍼링을 통해 GEMM 성능을 평균 12.7% 향상시킨 최적화 기법을 분석합니다.

#WebGPU #ONNX Runtime #GEMM #GPU Optimization #Intel Xe-3LPG

2026년 7월 8일

[flashinfer] FlashInfer, 초저병렬성 환경에서의 CP 델타 규칙 사전 계산 최적화

FlashInfer가 초저병렬성 환경에서 CP 델타 규칙 사전 계산 성능을 개선했습니다.

#FlashInfer #LLM #최적화 #GPU #CUDA

2026년 7월 8일

[flashinfer] FlashInfer의 BF16 GEMM 성능 극대화: CUDA Graph와 Cold L2 Cache 도입

FlashInfer의 SM100 타겟 BF16 GEMM 연산에 CUDA Graph와 Cold L2 Cache를 적용하여 오버헤드를 줄이고 성능 안정성을 확보한 사례를 분석합니다.

#FlashInfer #CUDA #GEMM #PerformanceOptimization #GPU

2026년 7월 8일

[sglang] SGLang MoE Shared Expert 최적화: 3개 커널을 1개로 융합하여 GPU 오버헤드 제거

SGLang에서 MoE Shared Expert 처리 시 3개의 GPU 커널을 1개로 융합하여 성능을 개선했습니다.

#SGLang #MoE #Kernel Fusion #Triton #GPU Optimization #AMD AITER

2026년 7월 8일