본문으로 건너뛰기

#Explainable AI (XAI)

7개의 포스트

[논문리뷰] AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

댓글 수 로딩 중

[논문리뷰] X-MuTeST: A Multilingual Benchmark for Explainable Hate Speech Detection and A Novel LLM-consulted Explanation Framework

댓글 수 로딩 중

[논문리뷰] REFLEX: Self-Refining Explainable Fact-Checking via Disentangling Truth into Style and Substance

댓글 수 로딩 중

[논문리뷰] Rethinking Saliency Maps: A Cognitive Human Aligned Taxonomy and Evaluation Framework for Explanations

댓글 수 로딩 중

[논문리뷰] When Explainability Meets Privacy: An Investigation at the Intersection of Post-hoc Explainability and Differential Privacy in the Context of Natural Language Processing

댓글 수 로딩 중