#Indexer

1개의 포스트

[논문리뷰] MISA: Mixture of Indexer Sparse Attention for Long-Context LLM Inference

본 논문은 Long-context LLM Inference에서 indexer 연산이 전체 비용의 지배적인 비중을 차지하는 문제를 해결하기 위해 MISA를 제안한다.

#Review #Large Language Models #Long-Context #Sparse Attention #Mixture of Experts #Indexer #Inference Efficiency #Retrieval

2026년 5월 10일