최신 포스트

[논문리뷰] MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier

본 논문은 대규모 언어 모델(LLM)을 활용한 과학적 발견 과정, 특히 P(hypothesis|background)의 직접적인 모델링이 지닌 조합론적 복잡성(O(Nk)) 으로 인한 비실용성을 해결하는 것을 목표로 합니다.

#Review #Scientific Discovery #LLM Training #Combinatorial Complexity #Hierarchical Search #Bounded Composition #Motivation Planning #Tractable Training #TOMATO-STAR Dataset

2026년 3월 5일

[논문리뷰] MASQuant: Modality-Aware Smoothing Quantization for Multimodal Large Language Models

다중 모달리티 대규모 언어 모델(MLLMs)에서 채널별 스무딩 양자화(channel-wise smoothing quantization) 기법이 시각 및 텍스트 토큰 활성화의 큰 차이로 인해 실패하는 문제를 해결하는 것이 목표입니다.

#Review #Multimodal LLMs #Post-Training Quantization #Modality-Aware Smoothing #Cross-Modal Compensation #Quantization #Model Compression #SVD-based Whitening

2026년 3월 5일

[논문리뷰] Locality-Attending Vision Transformer

본 논문은 이미지 분류 훈련 후 Vision Transformer (ViT)의 dense prediction 성능, 특히 segmentation 성능을 향상 시키는 것을 목표로 합니다.

#Review #Vision Transformer #Semantic Segmentation #Attention Mechanism #Locality Bias #Gaussian Kernel #Patch Representation #Foundation Models

2026년 3월 5일

[논문리뷰] Large Multimodal Models as General In-Context Classifiers

본 논문은 대규모 멀티모달 모델(LMMs)이 이미지 분류 작업에서 대조 학습 기반 시각-언어 모델(VLMs)보다 성능이 떨어진다는 기존 인식을 재고하고, 인컨텍스트 학습(ICL)이 LMMs의 분류 능력을 얼마나 향상시킬 수 있는지 탐구합니다.

#Review #Large Multimodal Models #In-Context Learning #Image Classification #Open-World Classification #Zero-Shot Learning #Vision-Language Models #CLIP

2026년 3월 5일

[논문리뷰] KARL: Knowledge Agents via Reinforcement Learning

본 논문은 기업 검색 에이전트가 복잡하고 검증하기 어려운 에이전트성 검색 태스크에서 최첨단 성능 을 달성하도록 강화 학습 을 통해 훈련하는 시스템인 KARL 을 제안합니다.

#Review #Reinforcement Learning #Knowledge Agents #Enterprise Search #Grounded Reasoning #Multi-task Learning #Off-policy RL #Test-time Compute #Agentic Synthesis

2026년 3월 5일

[논문리뷰] HiFi-Inpaint: Towards High-Fidelity Reference-Based Inpainting for Generating Detail-Preserving Human-Product Images

본 논문은 인간-제품 이미지 생성 시 제품 디테일의 높은 충실도(high-fidelity) 보존 을 보장하는 문제를 해결하고자 합니다.

#Review #Reference-Based Inpainting #High-Fidelity Image Generation #Human-Product Images #Diffusion Models #Detail Preservation #Attention Mechanisms #Loss Functions #Dataset Construction

2026년 3월 5일

[논문리뷰] DreamWorld: Unified World Modeling in Video Generation

기존 비디오 생성 모델들이 시각적 사실성만을 추구하고 세계에 대한 일관된 이해가 부족한 한계를 해결하는 것이 목표입니다. 물리적 상식, 3D 및 시간적 일관성과 같은 이질적인 세계 지식 을 비디오 생성기에 통합하고, 이로 인해 발생하는 시각적 불안정성과 시간적 깜빡임 문제를 완화하고자 합니다.

#Review #Video Generation #World Modeling #Diffusion Models #Multi-modal Integration #Temporal Consistency #Spatial Geometry #Semantic Consistency #Constraint Annealing

2026년 3월 5일

[논문리뷰] Distribution-Conditioned Transport

본 논문은 기계 학습에서 흔히 발생하는, 훈련 중 관찰되지 않은 소스 및 타겟 분포로 전이 모델을 일반화 하는 문제를 해결하는 것을 목표로 합니다.

#Review #Distribution-Conditioned Transport #Generative Distribution Embeddings #Optimal Transport #Flow Matching #Semi-Supervised Learning #Generalization #Single-cell Genomics #Batch Effect Transfer

2026년 3월 5일

[논문리뷰] DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval

본 논문은 LLM 에이전트가 Python 중심의 학습 데이터로 인해 R 통계 생태계의 풍부한 통계 방법론을 활용하는 데 어려움을 겪는 문제를 해결하고자 합니다.

#Review #LLM Agents #R Statistical Ecosystem #Retrieval-Augmented Generation #Distribution-Aware Retrieval #R Package Knowledge Base #Statistical Analysis #Embedding Models

2026년 3월 5일

[논문리뷰] AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

본 연구는 기존 멀티모달 벤치마크들이 단일 턴 시각 추론이나 특정 도구 사용 능력에 치우쳐 있어 현실성, 시각적 미묘함, 장기적인 도구 사용을 요구하는 실제 에이전트의 능력을 충분히 포착하지 못하는 문제를 해결하고자 합니다.

#Review #Multimodal Agents #Visual Reasoning #Tool Use #Benchmark #Long-Horizon Tasks #Realistic Scenarios #Agentic Intelligence

2026년 3월 5일

[Triton] AMD ConvertToBufferOps에서 i64 offset 지원

2026년 3월 6일

[triton] Multi-CTA 예제에서 Program ID를 Shared Memory에 저장하여 재계산 방지

CLC 타일 스케줄러에서 planar snake ID를 shared memory에 저장하여 consumer와 epilogue 파티션 간 재계산을 제거한 최적화를 분석합니다.

#Triton #Gluon #GPU #MultiCTA #Optimization

2026년 3월 5일

[Open WebUI] KaTeX 유니코드 정규식 사전 컴파일로 마크다운 렌더링 87% 병목 제거

Open WebUI에서 KaTeX 수식 감지 시 매번 유니코드 정규식을 컴파일하던 병목을 모듈 로드 시 한 번만 컴파일하도록 변경하고, katexStart 함수를 문자 단위 스캔으로 재작성한 최적화를 분석합니다.

#Open WebUI #TypeScript #Performance #Regex #KaTeX #Unicode

2026년 3월 5일

[feast] Feast 성능 최적화: Timestamp 변환 비용 절감으로 온라인 피처 서빙 가속화

Feast의 _convert_rows_to_protobuf 함수에서 Timestamp 변환을 최적화하여 성능을 크게 개선했습니다.

#Feast #Python #성능 최적화 #Protobuf #Timestamp #Feature Store

2026년 3월 5일

[Loki] 컨텍스트 취소 시 downstreamer goroutine 누수 방지

Loki 쿼리 프론트엔드의 downstreamer에서 컨텍스트 취소 시 goroutine이 영구적으로 블로킹되는 누수를 select로 수정한 PR 분석.

#Grafana Loki #Go #Goroutine Leak #Context Cancellation #Channel #Bug Fix

2026년 3월 5일

[Axolotl] MXFP4 양자화 지원 추가

torchao의 MXFakeQuantizeConfig를 활용한 MXFP4 QAT 지원 구현 분석

#Axolotl #Quantization #MXFP4 #QAT #LLM

2026년 3월 5일

[논문리뷰] T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning

본 논문은 대규모 언어 모델(LLM)이 복잡한 텍스트 처리, 특히 장문 컨텍스트 환경에서 겪는 어려움을 해결하고자 합니다.

#Review #Benchmarking #Text-to-Structure #LLM Prompting #Structure-of-Thought #Multihop Reasoning #Graph Extraction #Scientific Documents #Text Processing

2026년 3월 4일

[논문리뷰] Specificity-aware reinforcement learning for fine-grained open-world classification

본 논문은 오픈 월드 환경에서 미세 분류를 수행할 때, 대규모 멀티모달 모델(LMMs) 이 지나치게 일반적인 예측을 내놓는 경향을 해결하고자 합니다. 모델의 정확성 을 저해하지 않으면서 예측의 구체성(specificity) 을 향상시키는 것이 주된 연구 목표입니다.

#Review #Open-World Classification #Fine-Grained Classification #Reinforcement Learning #LMMs #Specificity-Aware Reward #GRPO #LLM-as-a-Judge #Cross-Domain Generalization

2026년 3월 4일

[논문리뷰] SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration

기존 벤치마크들이 정적이고 단발적인 기능적 정확성 평가에 치중하여 실제 소프트웨어 개발의 복잡한 요구사항 변화와 장기적인 기능 반복을 포착하지 못하는 문제를 해결하는 것이 목표입니다.

#Review #LLM Agents #Software Engineering #Code Maintenance #Continuous Integration #Benchmark #Code Generation #Long-term Evaluation #Technical Debt

2026년 3월 4일

[논문리뷰] RIVER: A Real-Time Interaction Benchmark for Video LLMs

대부분의 Multimodal Large Language Models (MLLMs)이 오프라인 패러다임으로 작동하여 실시간 상호작용 능력이 부족하다는 문제를 해결하고자 합니다.

#Review #Multimodal LLMs #Real-time Interaction #Video Understanding #Benchmark #Temporal Reasoning #Long-term Memory #Proactive Response

2026년 3월 4일