#Hallucination Detection

9개의 포스트

[논문리뷰] QuCo-RAG: Quantifying Uncertainty from the Pre-training Corpus for Dynamic Retrieval-Augmented Generation

Lu Cheng이 arXiv에 게시한 'QuCo-RAG: Quantifying Uncertainty from the Pre-training Corpus for Dynamic Retrieval-Augmented Generation' 논문에 대한 자세한 리뷰입니다.

#Review #Dynamic RAG #Hallucination Detection #Corpus Statistics #Uncertainty Quantification #Pre-training Data #LLM Calibration #Infini-gram #Multi-hop QA

2025년 12월 22일

[논문리뷰] AlignBench: Benchmarking Fine-Grained Image-Text Alignment with Synthetic Image-Caption Pairs

Tosho Hirasawa이 arXiv에 게시한 'AlignBench: Benchmarking Fine-Grained Image-Text Alignment with Synthetic Image-Caption Pairs' 논문에 대한 자세한 리뷰입니다.

#Review #Image-Text Alignment #Multimodal Benchmarking #Hallucination Detection #Vision-Language Models #Synthetic Data Generation #Fine-Grained Analysis #Captioning

2025년 12월 3일

[논문리뷰] HaluMem: Evaluating Hallucinations in Memory Systems of Agents

arXiv에 게시된 'HaluMem: Evaluating Hallucinations in Memory Systems of Agents' 논문에 대한 자세한 리뷰입니다.

#Review #Memory Systems #AI Agents #Hallucination Detection #Evaluation Benchmark #Long-term Memory #Memory Extraction #Memory Updating #Question Answering

2025년 11월 10일

[논문리뷰] Multi-Step Knowledge Interaction Analysis via Rank-2 Subspace Disentanglement

Isabelle Augenstein이 arXiv에 게시한 'Multi-Step Knowledge Interaction Analysis via Rank-2 Subspace Disentanglement' 논문에 대한 자세한 리뷰입니다.

#Review #LLMs #Knowledge Interaction #Parametric Knowledge #Contextual Knowledge #Subspace Disentanglement #NLE Generation #Hallucination Detection #Chain-of-Thought

2025년 11월 9일

[논문리뷰] When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA

Artem Vazhentsev이 arXiv에 게시한 'When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA' 논문에 대한 자세한 리뷰입니다.

#Review #Hallucination Detection #Multilingual LLMs #Span-Level Annotation #Synthetic Data Generation #Question Answering (QA)#Encoder Models #Uncertainty Quantification #GPT-4o

2025년 10월 17일

[논문리뷰] Large Language Models Do NOT Really Know What They Don't Know

arXiv에 게시된 'Large Language Models Do NOT Really Know What They Don't Know' 논문에 대한 자세한 리뷰입니다.

#Review #LLMs #Hallucination Detection #Mechanistic Interpretability #Internal States #Knowledge Recall #Refusal Tuning #Factual Associations #Associated Hallucinations

2025년 10월 17일

[논문리뷰] HalluGuard: Evidence-Grounded Small Reasoning Models to Mitigate Hallucinations in Retrieval-Augmented Generation

Radu State이 arXiv에 게시한 'HalluGuard: Evidence-Grounded Small Reasoning Models to Mitigate Hallucinations in Retrieval-Augmented Generation' 논문에 대한 자세한 리뷰입니다.

#Review #Hallucination Detection #Retrieval-Augmented Generation (RAG)#Small Reasoning Model (SRM)#Preference Fine-tuning #ORPO #Evidence Grounding #Fact-checking

2025년 10월 8일

[논문리뷰] Turk-LettuceDetect: A Hallucination Detection Models for Turkish RAG Applications

Fatma Betül Terzioğlu이 arXiv에 게시한 'Turk-LettuceDetect: A Hallucination Detection Models for Turkish RAG Applications' 논문에 대한 자세한 리뷰입니다.

#Review #Hallucination Detection #Retrieval Augmented Generation #Large Language Models #Turkish NLP #Token Classification #ModernBERT #Low-Resource Languages

2025년 9월 23일

[논문리뷰] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward

Songyang Gao이 arXiv에 게시한 'CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward' 논문에 대한 자세한 리뷰입니다.

#Review #LLM Evaluation #Answer Verification #Reward Model #Benchmarking #Data Augmentation #Reinforcement Learning #Formula Verification #Hallucination Detection

2025년 8월 6일