#Efficient LLM

3개의 포스트

[논문리뷰] Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers

arXiv에 게시된 'Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers' 논문에 대한 자세한 리뷰입니다.

#Review #Transformer #Sparse Attention #Adaptive Sparsity #Efficient LLM #Attention Router #Long-Context #Hybrid Attention

2026년 1월 26일

[논문리뷰] TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding

arXiv에 게시된 'TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding' 논문에 대한 자세한 리뷰입니다.

#Review #Long Video Understanding #Hybrid Mamba-Transformer #Vision-Language Model #Token Compression #Vision-to-Text Aggregation #Efficient LLM #Multimodal AI

2025년 11월 20일

[논문리뷰] FocusAgent: Simple Yet Effective Ways of Trimming the Large Context of Web Agents

Léo Boisvert이 arXiv에 게시한 'FocusAgent: Simple Yet Effective Ways of Trimming the Large Context of Web Agents' 논문에 대한 자세한 리뷰입니다.

#Review #Web Agents #LLM Context Pruning #Accessibility Tree #Prompt Injection #Retrieval Augmented Generation #Web Navigation #Agent Security #Efficient LLM

2025년 10월 6일