#Episodic Clustering

1개의 포스트

[논문리뷰] EpiCache: Episodic KV Cache Management for Long Conversational Question Answering

대규모 언어 모델(LLM) 기반의 장기 대화형 질문 답변(LongConvQA) 시스템에서 KV 캐시의 메모리 사용량이 대화 길이에 따라 선형적으로 증가 하는 문제를 해결하는 것이 목표입니다.

#Review #KV Cache Management #Long Conversational QA #LLMs #Memory Efficiency #Episodic Clustering #Block Prefill Eviction #Sensitivity-aware Allocation

2025년 9월 23일