#RAG Systems

3개의 포스트

[논문리뷰] MMA: Multimodal Memory Agent

롱-호라이즌 멀티모달 에이전트의 메모리 검색 시 발생하는 오래되거나, 신뢰도가 낮거나, 상충되는 정보로 인한 과신 오류 및 안전 문제를 해결하는 것이 목표입니다. 특히 에이전트가 노이즈가 많고, 정보가 불안정하며, 모순적인 기억에 직면했을 때의 신뢰성 부족을 극복하고자 합니다.

#Review #Multimodal AI #Memory-Augmented Agents #Reliability Assessment #Epistemic Prudence #RAG Systems #Confidence Scoring #Belief Dynamics #Multimodal Conflict

2026년 2월 18일

[논문리뷰] RefusalBench: Generative Evaluation of Selective Refusal in Grounded Language Models

이 논문은 RAG(Retrieval-Augmented Generation) 시스템 에서 언어 모델이 불충분하거나 신뢰할 수 없는 정보 를 기반으로 답변을 거부하는 선택적 거부(selective refusal) 능력의 평가 문제를 다룹니다.

#Review #RAG Systems #Selective Refusal #Generative Evaluation #Linguistic Perturbations #LLM Evaluation #Informational Uncertainty #Model Calibration #AI Safety

2025년 10월 17일

[논문리뷰] Context Is What You Need: The Maximum Effective Context Window for Real World Limits of LLMs

이 논문은 대규모 언어 모델(LLM) 공급자가 홍보하는 최대 컨텍스트 윈도우(MCW) 와 실제 사용 환경에서의 최대 유효 컨텍스트 윈도우(MECW) 간의 불일치를 해결하고자 합니다.

#Review #Large Language Models #Context Window #Effective Context Window #Model Performance #Hallucination Rates #RAG Systems #Token Limits

2025년 10월 1일