#Token Reduction

3개의 포스트

[논문리뷰] Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

David Eigen이 arXiv에 게시한 'Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing' 논문에 대한 자세한 리뷰입니다.

#Review #Video Understanding #Multi-modal Large Language Models (MLLMs)#Vision Transformers (ViTs)#Autoregressive Gazing #Token Reduction #Multi-scale Patches #High-Resolution Video #Long-Form Video

2026년 3월 24일

[논문리뷰] Dynamic Chunking Diffusion Transformer

arXiv에 게시된 'Dynamic Chunking Diffusion Transformer' 논문에 대한 자세한 리뷰입니다.

#Review #Diffusion Transformer #Dynamic Chunking #Adaptive Patching #Image Generation #Computational Efficiency #Token Reduction #Spatial Segmentation #Load Balancing

2026년 3월 8일

[논문리뷰] Focused Chain-of-Thought: Efficient LLM Reasoning via Structured Input Information

Kristian Kersting이 arXiv에 게시한 'Focused Chain-of-Thought: Efficient LLM Reasoning via Structured Input Information' 논문에 대한 자세한 리뷰입니다.

#Review #LLM Reasoning #Chain-of-Thought #Prompt Engineering #Efficiency #Structured Input #Information Extraction #Cognitive Psychology #Token Reduction

2025년 11월 30일