본문으로 건너뛰기

#Computational Efficiency

56개의 포스트

[논문리뷰] Swift Sampling: Selecting Temporal Surprises via Taylor Series

댓글 수 로딩 중

[논문리뷰] FluidWorld: Reaction-Diffusion Dynamics as a Predictive Substrate for World Models

댓글 수 로딩 중

[논문리뷰] Dynamic Chunking Diffusion Transformer

댓글 수 로딩 중

[논문리뷰] Test-Time Training with KV Binding Is Secretly Linear Attention

댓글 수 로딩 중

[논문리뷰] SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer

댓글 수 로딩 중

[논문리뷰] Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning

댓글 수 로딩 중

[논문리뷰] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection

댓글 수 로딩 중

[논문리뷰] Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering

댓글 수 로딩 중

[논문리뷰] GlimpRouter: Efficient Collaborative Inference by Glimpsing One Token of Thoughts

댓글 수 로딩 중

[논문리뷰] Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling

댓글 수 로딩 중

[논문리뷰] Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits

댓글 수 로딩 중

[논문리뷰] SpotEdit: Selective Region Editing in Diffusion Transformers

댓글 수 로딩 중

[논문리뷰] Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

댓글 수 로딩 중

[논문리뷰] Glance: Accelerating Diffusion Models with 1 Sample

댓글 수 로딩 중

[논문리뷰] The Collapse of Patches

댓글 수 로딩 중

[논문리뷰] Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps via Uncertainty Heads

댓글 수 로딩 중

[논문리뷰] Continuous Autoregressive Language Models

댓글 수 로딩 중

[논문리뷰] MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

댓글 수 로딩 중

[논문리뷰] Hyper-Bagel: A Unified Acceleration Framework for Multimodal Understanding and Generation

댓글 수 로딩 중

[논문리뷰] CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification

댓글 수 로딩 중

[논문리뷰] Deep Think with Confidence

댓글 수 로딩 중

[논문리뷰] Inverse-LLaVA: Eliminating Alignment Pre-training Through Text-to-Vision Mapping

댓글 수 로딩 중

[논문리뷰] Learning an Efficient Multi-Turn Dialogue Evaluator from Multiple Judges

댓글 수 로딩 중

[논문리뷰] Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models

댓글 수 로딩 중

[논문리뷰] Phi-Ground Tech Report: Advancing Perception in GUI Grounding

댓글 수 로딩 중

[논문리뷰] Dr.LLM: Dynamic Layer Routing in LLMs

댓글 수 로딩 중

[논문리뷰] Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost

댓글 수 로딩 중

[논문리뷰] UltraGen: High-Resolution Video Generation with Hierarchical Attention

댓글 수 로딩 중

[논문리뷰] Rewiring Experts on the Fly:Continuous Rerouting for Better Online Adaptation in Mixture-of-Expert models

댓글 수 로딩 중