최신 포스트

[Open WebUI] 공유 채팅 목록에서 불필요한 JSON 역직렬화를 제거하여 응답 속도 개선

전체 Chat 행을 로드하던 공유 채팅 목록 API를 컬럼 프로젝션으로 전환하여, 대용량 대화 JSON 역직렬화를 완전히 제거한 최적화 분석.

#Open WebUI #Python #Performance #SQLAlchemy #Database

2026년 2월 19일

[Grafana Loki] 검증이 완료될 때까지 accepted stream 캐시를 비활성화

확률적 자료구조인 블룸 필터 기반 캐시의 효과를 추가 검증하기 위해 기본값을 비활성으로 변경한 분석.

#Grafana Loki #Go #Bloom Filter #Cache #Feature Flag #Reliability

2026년 2월 19일

[Grafana Loki] 범위 집계를 병렬 파티션으로 푸시다운하여 쿼리 처리 최적화

결합법칙/교환법칙이 성립하는 집계 연산을 parallelPushdown 최적화에 적용하여, 네트워크 전송량 감소와 파이프라인 병목 해소를 동시에 달성한 분석.

#Grafana Loki #Go #Performance #Query Optimization #Parallel Processing

2026년 2월 19일

[feast] Feast 성능 최적화: 엔티티 키 직렬화 Hot Path 2.4배 개선하기

Feast의 온라인 스토어 성능을 좌우하는 엔티티 키 직렬화 로직을 Fast Path 도입과 memoryview 활용으로 최대 141% 개선한 사례를 분석합니다.

#Python #Performance #Feast #Optimization #Zero-copy

2026년 2월 19일

[Feast] Feast 엔티티 키 직렬화 핫패스 최적화

single-entity fast path와 memoryview zero-copy 슬라이싱으로 직렬화/역직렬화 성능을 개선

#Feast #Feature Store #Serialization #Performance

2026년 2월 19일

[Ray RLlib] SingleAgentEnvRunner의 validate 호출 위치 최적화로 3.1배 속도 향상

Ray RLlib의 SingleAgentEnvRunner에서 매 스텝마다 호출되던 validate를 에피소드 완료 시점으로 이동하여 add_step_data의 누적 시간을 16.7초에서 5.43초로 줄인 최적화를 분석합니다.

#Ray #RLlib #Python #Performance #Reinforcement Learning #Optimization

2026년 2월 19일

[Ray Core] Memory Monitor의 OS별 조건부 컴파일 패턴 적용

메모리 모니터를 인터페이스 분리 + OS별 빌드로 리팩토링하여 유지보수성과 확장성 개선.

#Ray #C++#Performance #Memory Management #Architecture

2026년 2월 18일

[pydantic-ai] Temporal/DBOS MCP 서버에서 매번 도구 목록을 다시 가져오는 문제 수정

Temporal과 DBOS의 MCP 래퍼에서 캐시된 도구 정의를 활용하여 불필요한 MCP 서버 왕복을 제거한 사례를 분석합니다.

#pydantic-ai #MCP #DBOS #Temporal #Caching #Performance

2026년 2월 19일

[논문리뷰] World Action Models are Zero-shot Policies

본 논문은 Vision-Language-Action (VLA) 모델의 한계인 새로운 환경에서 미지의 물리적 동작에 대한 일반화 능력 부족을 해결하고자 합니다.

#Review #World Action Models #Video Diffusion Models #Zero-shot Generalization #Cross-embodiment Transfer #Real-time Control #Robotics #Foundation Models #Flow Matching

2026년 2월 18일

[논문리뷰] Visual Memory Injection Attacks for Multi-Turn Conversations

본 논문은 대규모 시각-언어 모델(LVLM)의 다중 턴 대화 환경에서의 보안 취약점을 해결하고자 합니다.

#Review #LVLM #Adversarial Attacks #Multi-Turn Conversations #Visual Memory Injection #Stealthy Attacks #Benign Anchoring #Context-Cycling

2026년 2월 18일

[논문리뷰] Towards a Science of AI Agent Reliability

AI 에이전트의 높은 벤치마크 정확도와 실제 배포 시의 잦은 실패 간의 격차를 해소하는 것이 이 연구의 주요 목표입니다.

#Review #AI Agents #Reliability #Evaluation Metrics #Consistency #Robustness #Predictability #Safety #Benchmarks

2026년 2월 18일

[논문리뷰] SLA2: Sparse-Linear Attention with Learnable Routing and QAT

본 논문은 기존 Sparse-Linear Attention (SLA)의 한계, 즉 주의 가중치 크기에 기반한 휴리스틱 기반의 어텐션 분할 과 희소 및 선형 어텐션 출력 간의 불일치 를 해결하는 것을 목표로 합니다.

#Review #Sparse-Linear Attention #Diffusion Models #Video Generation #Learnable Routing #Quantization-Aware Training #Attention Acceleration #Model Optimization

2026년 2월 18일

[논문리뷰] SAM 3D Body: Robust Full-Body Human Mesh Recovery

본 연구는 단일 이미지로부터 강건한 전신 3D 인체 메시 복원(HMR) 을 목표로 하는 SAM 3D Body (3DB) 모델을 제안합니다. 특히, 도전적인 자세, 심각한 폐색, 그리고 흔치 않은 시점 등 다양한 실제 환경 조건에서 기존 HMR 모델의 낮은 견고성 및 부정확성을 개선하고자 합니다.

#Review #Human Mesh Recovery (HMR)#Full-Body Pose Estimation #Promptable Models #Momentum Human Rig (MHR)#Data Engine #Encoder-Decoder #Robustness #3D Vision

2026년 2월 18일

[논문리뷰] Optimizing Few-Step Generation with Adaptive Matching Distillation

본 논문은 Distribution Matching Distillation (DMD) 과정에서 발생하는 'Forbidden Zones'으로 인한 불안정성과 성능 저하 문제를 해결하는 것을 목표로 합니다.

#Review #Diffusion Models #Knowledge Distillation #Few-Step Generation #Adaptive Matching #Forbidden Zones #Generative Models #Sample Quality #Training Stability

2026년 2월 18일

[논문리뷰] Multi-agent cooperation through in-context co-player inference

다중 에이전트 강화 학습(MARL)에서 자기 이익을 추구하는 에이전트 간의 협력을 유도하는 근본적인 문제를 해결하고자 합니다.

#Review #Multi-Agent Reinforcement Learning #In-Context Learning #Cooperation #Sequence Models #Opponent Shaping #Iterated Prisoner's Dilemma #Predictive Policy Improvement

2026년 2월 18일

[논문리뷰] MMA: Multimodal Memory Agent

롱-호라이즌 멀티모달 에이전트의 메모리 검색 시 발생하는 오래되거나, 신뢰도가 낮거나, 상충되는 정보로 인한 과신 오류 및 안전 문제를 해결하는 것이 목표입니다. 특히 에이전트가 노이즈가 많고, 정보가 불안정하며, 모순적인 기억에 직면했을 때의 신뢰성 부족을 극복하고자 합니다.

#Review #Multimodal AI #Memory-Augmented Agents #Reliability Assessment #Epistemic Prudence #RAG Systems #Confidence Scoring #Belief Dynamics #Multimodal Conflict

2026년 2월 18일

[논문리뷰] MAEB: Massive Audio Embedding Benchmark

오디오 임베딩 모델의 평가 프로토콜이 파편화되어 모델 비교 및 의미 있는 진척도 추적에 어려움이 있는 문제를 해결하고자 합니다. 이를 위해 광범위하고 통일된 평가 프레임워크 인 MAEB(Massive Audio Embedding Benchmark) 를 구축하여 범용 오디오 임베딩 모델 개발을 촉진하는 것을 목표로 합니다.

#Review #Audio Embedding #Benchmark #Multimodal #Zero-shot Classification #Clustering #Representation Learning #MTEB Ecosystem #Cross-modal Audio-Text #Multilingual Audio

2026년 2월 18일

[논문리뷰] Learning Situated Awareness in the Real World

본 논문은 기존의 멀티모달 파운데이션 모델(MFM) 벤치마크들이 환경 중심의 공간 관계에만 초점을 맞추고, 에이전트의 시점, 자세, 움직임에 따른 관찰자 중심의 상황 인식(situated awareness) 을 간과하는 문제점을 해결하고자 합니다.

#Review #Situated Awareness #Egocentric Vision #Spatial Reasoning #Multimodal Foundation Models #Video Understanding #Benchmark #Real-world Data

2026년 2월 18일

[논문리뷰] Learning Humanoid End-Effector Control for Open-Vocabulary Visual Loco-Manipulation

본 연구는 인간형 로봇이 온보드 센서만을 사용하여 새로운 객체를 새로운 환경에서 자율적으로 로코-조작(loco-manipulate) 하는 능력을 개발하는 것을 목표로 합니다. 특히, 정확한 엔드-이펙터(EE) 제어 와 오픈-보케뷸러리 대규모 시각 모델 을 통한 장면 이해의 일반화라는 핵심 난제를 해결하고자 합니다.

#Review #Humanoid Robotics #End-Effector Control #Loco-Manipulation #Open-Vocabulary Perception #Visual Generalization #Sim2Real Transfer #Residual Learning #Robot Grasping

2026년 2월 18일

[논문리뷰] Empty Shelves or Lost Keys? Recall Is the Bottleneck for Parametric Factuality

본 논문은 대규모 언어 모델(LLM)의 사실성(factuality) 오류 원인을 '지식 누락(encoding failure, empty shelves)'과 '인코딩된 사실 접근 제한(recall failure, lost keys)'으로 구분하여 명확히 규명하는 것을 목표로 합니다.

#Review #LLM Factuality #Knowledge Profiling #Encoding vs. Recall #WikiProfile Benchmark #Inference-time Computation #Reversal Curse #Long-tail Knowledge #Parametric Knowledge

2026년 2월 18일