#Fisher Information Matrix

2개의 포스트

[논문리뷰] Measuring Model Robustness via Fisher Information: Spectral Bounds, Theoretical Guarantees, and Practical Algorithms

본 논문은 딥러닝 모델의 견고성 평가가 특정 공격(Attack-dependent)에 과도하게 의존하고 있으며, 이론적 근거가 부족하다는 점을 해결하고자 한다. 기존의 Lipschitz constant나 CLEVER score와 같은 지표들은 확장성(Scalability)이 낮거나 확률적 해석력이 부족하다는 한계가 있다.

#Review #Model Robustness #Fisher Information Matrix #Spectral Norm #Adversarial Vulnerability #Interpretability #Deep Learning

2026년 6월 7일

[논문리뷰] AlignGuard-LoRA: Alignment-Preserving Fine-Tuning via Fisher-Guided Decomposition and Riemannian-Geodesic Collision Regularization

대규모 언어 모델(LLM)의 LoRA 미세 조정 과정에서 발생하는 정렬 드리프트(alignment drift) 문제를 해결하여, 안전 및 행동 제약을 유지하면서도 새로운 태스크에 대한 성능 저하를 방지하는 것을 목표로 합니다.

#Review #Alignment Preservation #Fine-Tuning #LoRA #Fisher Information Matrix #Catastrophic Forgetting #LLM Safety #Riemannian Geometry #Parameter-Efficient Learning

2025년 8월 6일