최신 포스트

[논문리뷰] CC30k: A Citation Contexts Dataset for Reproducibility-Oriented Sentiment Analysis

본 논문은 AI/ML 논문 내 인용 문맥에서 재현성(reproducibility) 지향 감성을 식별하기 위한 CC30k 데이터셋 을 구축하는 것을 목표로 합니다. 이는 계산적 재현성 연구를 위한 자원 부족 문제를 해결하고, 대규모 언어 모델(LLM)이 재현성 관련 감성을 효과적으로 예측하도록 훈련하는 기반을 마련합니다.

#Review #Citation Contexts #Reproducibility #Sentiment Analysis #Large Language Models #Crowdsourcing #Dataset #Machine Learning #Science of Science

2025년 11월 13일

[논문리뷰] Black-Box On-Policy Distillation of Large Language Models

본 논문은 내부 로짓이나 파라미터에 접근할 수 없는 블랙박스(black-box) 대규모 언어 모델(LLM) 을 대상으로, 학생 모델이 교사 모델의 텍스트 출력만을 학습하는 온-정책(on-policy) 증류(distillation) 방법을 개발하는 것을 목표로 합니다.

#Review #Large Language Models (LLMs)#Knowledge Distillation (KD)#Black-box Distillation #Generative Adversarial Networks (GANs)#On-policy Learning #Reinforcement Learning #Minimax Game #Model Compression

2025년 11월 13일

[논문리뷰] Benchmarking Diversity in Image Generation via Attribute-Conditional Human Evaluation

현재 텍스트-투-이미지(T2I) 모델이 종종 동질적인 이미지를 생성하며 다양성이 부족하다는 문제를 해결하고자 합니다.

#Review #Text-to-Image Models #Diversity Evaluation #Human Evaluation #Attribute-Conditional #Vendi Score #Generative AI #Benchmarking

2025년 11월 13일

[논문리뷰] AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models

본 논문은 3D 환경에서 자연어 명령을 기반으로 물체의 상호작용 가능한 요소(affordance elements)를 식별하고, 해당 요소의 3D 마스크 , 동작 유형 , 동작 축 방향 을 포함하는 구조화된 트립렛을 예측하는 Fine-grained 3D Embodied Reasoning 이라는 새로운 태스크를 제안합니다.

#Review #3D Embodied Reasoning #Multimodal Large Language Models (MLLMs)#Chain-of-Thought (CoT)#Affordance Grounding #Motion Estimation #View Synthesis #Active Perception

2025년 11월 13일

[Triton] AxisInfoAnalysis에서 미방문 operand 처리 버그 수정

2025년 11월 14일

[pydantic-ai] FastMCPToolset Temporal 통합 — MCP 툴셋 공통 추상화

FastMCPToolset을 Temporal 워크플로우에서 사용할 수 있도록 공통 TemporalMCPToolset 추상 클래스를 도입

#Python #Pydantic AI #MCP #Temporal #Architecture

2025년 11월 13일

[Ray Core] 메모리 스토어와 플라즈마 스토어에서 참조 카운터 분리 리팩터링

Ray의 CoreWorker에서 메모리 스토어와 플라즈마 스토어에 결합되어 있던 참조 카운터 로직을 상위 레이어로 분리하여, 코드 얽힘을 해소하고 유지보수성을 개선한 PR을 분석합니다.

#Ray #Ray Core #Refactoring #C++#Memory Management #Reference Counting

2025년 11월 13일

[Triton] Gluon에 coalesced layout 추가 — 메모리 접근 효율 최적화

Gluon DSL에 coalesced layout을 도입하여 글로벌 메모리 접근의 coalescing을 자동으로 보장한다

#Triton #Gluon #Memory Coalescing #Layout #GPU Optimization

2025년 11월 13일

[Gradio] 큐 성능 개선 — MCP 응답 속도 향상을 위한 구조 리팩터링

MCP 도구 호출 경로를 리팩터링하고 클라이언트 초기화 오버헤드를 제거하여 큐 처리 성능을 개선한다

#Gradio #MCP #Queue Performance #Refactoring

2025년 11월 13일

[cpython] CPython의 새로운 Tracing JIT 컴파일러 프론트엔드

trace projection에서 trace recording 모델로 전환하여 pyperformance 1.7% 향상, Richards 100% 가속

#CPython #JIT Compiler #Trace Recording #Performance

2025년 11월 13일

[Ray Core] request ID 생성을 worker로 이동하여 plasma get 성능 회귀 수정

동기 AsyncGet 응답 대기를 제거하여 plasma store get 처리량 2배 복구

#Ray #Plasma Store #C++#Performance

2025년 11월 12일

[논문리뷰] WebVIA: A Web-based Vision-Language Agentic Framework for Interactive and Verifiable UI-to-Code Generation

본 논문은 기존 Vision-Language Models (VLMs) 기반의 UI-to-Code 접근 방식이 정적인 HTML/CSS 코드만 생성하고 GUI 상호작용을 지원하지 못하는 한계를 극복하고자 합니다.

#Review #UI-to-Code #Vision-Language Models #Agentic Framework #Interactive UI #Web Automation #Code Generation #UI Verification #Supervised Fine-Tuning

2025년 11월 12일

[논문리뷰] WMPO: World Model-based Policy Optimization for Vision-Language-Action Models

VLA 모델이 로봇 조작에 큰 잠재력을 보이지만, 전문가 데모에 의존하여 실패로부터 학습하고 스스로 수정하는 능력이 제한적이라는 문제를 해결하고자 합니다.

#Review #Vision-Language-Action (VLA)#Reinforcement Learning (RL)#Model-based RL #World Models #Policy Optimization #Robotics #Sample Efficiency #Self-correction

2025년 11월 12일

[논문리뷰] Toward the Frontiers of Reliable Diffusion Sampling via Adversarial Sinkhorn Attention Guidance

이 논문은 확산 모델의 샘플링 과정에서 발생하는 품질 및 제어 가능성 문제를 해결하고자 합니다.

#Review #Diffusion Models #Guidance Sampling #Optimal Transport #Sinkhorn Algorithm #Self-Attention #Adversarial Perturbation #Image Generation #ControlNet

2025년 11월 12일

[논문리뷰] TiDAR: Think in Diffusion, Talk in Autoregression

본 연구는 대규모 언어 모델(LLM)의 생성 과정에서 확산 모델(Diffusion Models) 의 빠른 병렬 생성 능력과 자기회귀(Autoregressive, AR) 모델 의 높은 품질을 동시에 달성하는 것을 목표로 합니다.

#Review #Hybrid LLM Architecture #Diffusion-Autoregressive #Parallel Token Generation #Speculative Decoding #Structured Attention Masks #LLM Inference Acceleration #KV Cache

2025년 11월 12일

[논문리뷰] Stemming Hallucination in Language Models Using a Licensing Oracle

언어 모델(LMs)의 고질적인 환각(hallucination) 문제, 즉 사실과 다른 정보를 유창하게 생성하는 문제를 해결하는 것이 목표입니다. 통계적 학습 방식의 한계를 극복하고, 구조화된 지식에 대한 결정론적인 진실성 검증 메커니즘 을 통해 LM의 출력에 인식론적 기반 을 마련하고자 합니다.

#Review #Hallucination Mitigation #Language Models #Knowledge Graphs #SHACL Validation #Epistemic Grounding #Retrieval-Augmented Generation #Neuro-symbolic AI

2025년 11월 12일

[논문리뷰] Motif 2 12.7B technical report

대규모 언어 모델(LLM)의 효율성 한계를 확장하고, 제한된 컴퓨팅 자원 내에서 Motif-2-12.7B 모델이 우수한 성능을 발휘할 수 있음을 입증하는 것을 목표로 합니다. 특히 아키텍처 혁신과 시스템 수준 최적화를 통해 대형 모델에 필적하는 능력을 소규모 파라미터로 구현하고자 합니다.

#Review #Large Language Model #LLM Efficiency #Grouped Differential Attention #Kernel Fusion #Parallel Muon #Supervised Fine-tuning #Architectural Scaling #Instruction Following

2025년 11월 12일

[논문리뷰] MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning

본 연구는 멀티모달 대규모 언어 모델(MLLM)이 복잡한 수학 문제 해결과 같은 추론 태스크에서 겪는 어려움을 극복하는 것을 목표로 합니다. 특히, 기존의 정적인 교사 모델 유래 데이터셋에 의존하는 방식이 모델의 새로운 문제 적응력과 견고한 일반화 능력을 제한한다는 한계를 해결하고자 합니다.

#Review #Multimodal Reasoning #Mathematical Problem Solving #Self-Evolving #Iterative Fine-Tuning #Reward Models #Reflection #Large Language Models (LLMs)

2025년 11월 12일

[논문리뷰] MADD: Multi-Agent Drug Discovery Orchestra

초기 신약 개발 과정에서 히트 분자(hit molecule) 식별 에 필요한 막대한 자원과 기존 AI 방법론의 복잡성 및 접근성 부족 문제를 해결하는 것이 목표입니다.

#Review #Multi-Agent System #Drug Discovery #LLM #Hit Identification #Virtual Screening #Generative AI #Property Prediction #Automated Machine Learning

2025년 11월 12일

[논문리뷰] Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

논문은 복잡한 3D 오픈 월드 환경 에서 인간 수준의 효율성으로 수 시간 길이의 미션을 실시간으로 완수할 수 있는 제너럴리스트 에이전트 를 구축하기 위한 '오픈 레시피'인 Lumine을 제시합니다.

#Review #Generalist Agent #3D Open World #Vision-Language Model #Imitation Learning #Real-time Inference #Hybrid Thinking #Action Chunking #Genshin Impact

2025년 11월 12일