#State Space Models

8개의 포스트

[논문리뷰] Compiler-First State Space Duality and Portable O(1) Autoregressive Caching for Inference

arXiv에 게시된 'Compiler-First State Space Duality and Portable O(1) Autoregressive Caching for Inference' 논문에 대한 자세한 리뷰입니다.

#Review #State Space Models #Mamba-2 #XLA #JAX #Compiler Codegen #Autoregressive Caching #Hardware Portability #Inference Optimization

2026년 3월 10일

[논문리뷰] 2Mamba2Furious: Linear in Complexity, Competitive in Accuracy

Eric C. Larson이 arXiv에 게시한 '2Mamba2Furious: Linear in Complexity, Competitive in Accuracy' 논문에 대한 자세한 리뷰입니다.

#Review #Linear Attention #Mamba-2 #High-Order Attention #Model Efficiency #Long Context #Transformer #State Space Models

2026년 2월 19일

[논문리뷰] UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs

arXiv에 게시된 'UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs' 논문에 대한 자세한 리뷰입니다.

#Review #LLM Compression #Quantization #Pruning #Edge AI #Adaptive Deployment #Transformer #State Space Models #Hybrid Models #One-shot Compression

2025년 12월 3일

[논문리뷰] Higher-order Linear Attention

arXiv에 게시된 'Higher-order Linear Attention' 논문에 대한 자세한 리뷰입니다.

#Review #Linear Attention #Higher-order Interactions #Causal Streaming #Associative Scans #Prefix Summaries #Transformer Architectures #State Space Models

2025년 11월 9일

[논문리뷰] MemMamba: Rethinking Memory Patterns in State Space Model

Xiao Sun이 arXiv에 게시한 'MemMamba: Rethinking Memory Patterns in State Space Model' 논문에 대한 자세한 리뷰입니다.

#Review #State Space Models #Mamba #Long-sequence modeling #Memory decay #State summarization #Cross-layer attention #Perplexity #Linear complexity

2025년 10월 10일

[논문리뷰] StateX: Enhancing RNN Recall via Post-training State Expansion

Zhiyuan Liu이 arXiv에 게시한 'StateX: Enhancing RNN Recall via Post-training State Expansion' 논문에 대한 자세한 리뷰입니다.

#Review #RNN #State Expansion #Post-training #Long-context Recall #Linear Attention #State Space Models #GLA #Mamba2

2025년 9월 29일

[논문리뷰] Autoregressive Universal Video Segmentation Model

Albert Gu이 arXiv에 게시한 'Autoregressive Universal Video Segmentation Model' 논문에 대한 자세한 리뷰입니다.

#Review #Video Segmentation #Autoregressive Model #Universal Model #State Space Models #Mamba #Parallel Training #Streaming Video #Deep Learning

2025년 8월 27일

[논문리뷰] Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

Jusen Du이 arXiv에 게시한 'Speed Always Wins: A Survey on Efficient Architectures for Large Language Models' 논문에 대한 자세한 리뷰입니다.

#Review #Large Language Models #Efficient Architectures #Transformer Optimization #Linear Attention #State Space Models #Mixture-of-Experts #Sparse Attention #Diffusion LLMs

2025년 8월 19일