#Fact-checking

2개의 포스트

[논문리뷰] Fact2Fiction: Targeted Poisoning Attack to Agentic Fact-checking System

본 연구는 최신 LLM 기반 에이전트 팩트체킹 시스템 이 잘못된 정보를 확산시키거나 진실을 훼손할 수 있는 포이즈닝 공격에 취약함을 지적합니다. 기존 공격 방식은 이러한 정교한 시스템의 클레임 분해 및 교차 검증 메커니즘에 효과적이지 못합니다.

#Review #Adversarial Attack #Poisoning Attack #Fact-checking #LLM Agent #Retrieval Augmented Generation #Misinformation #System Security

2025년 8월 12일

[논문리뷰] HalluGuard: Evidence-Grounded Small Reasoning Models to Mitigate Hallucinations in Retrieval-Augmented Generation

대규모 언어 모델(LLM)과 소형 언어 모델(SLM)이 RAG 애플리케이션에서 흔히 겪는 환각(Hallucination) 문제를 해결하고, 사용자 신뢰도와 설명 가능성을 저해하는 문제를 완화하는 것이 주요 목표입니다.

#Review #Hallucination Detection #Retrieval-Augmented Generation (RAG)#Small Reasoning Model (SRM)#Preference Fine-tuning #ORPO #Evidence Grounding #Fact-checking

2025년 10월 8일