#Global Context Collapse

1개의 포스트

[논문리뷰] MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

Transformer의 핵심 모듈인 Self-Attention의 2차 시간 복잡성 으로 인한 확장성 문제를 해결하고자 합니다.

#Review #Linear Attention #Multi-Head Attention #Transformer #Global Context Collapse #Representational Diversity #Image Generation #NLP #Video Generation

2026년 1월 12일