본문으로 건너뛰기

#Knowledge Distillation

53개의 포스트

[논문리뷰] StreamChar: Long-Horizon Streaming Character Audio-Video Generation with Decoupled Orchestration

댓글 수 로딩 중

[논문리뷰] Trust-Region Behavior Blending for On-Policy Distillation

댓글 수 로딩 중

[논문리뷰] COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

댓글 수 로딩 중

[논문리뷰] ETCHR: Editing To Clarify and Harness Reasoning

댓글 수 로딩 중

[논문리뷰] Trees to Flows and Back: Unifying Decision Trees and Diffusion Models

댓글 수 로딩 중

[논문리뷰] Structured Distillation of Web Agent Capabilities Enables Generalization

댓글 수 로딩 중

[논문리뷰] Beyond Hard Negatives: The Importance of Score Distribution in Knowledge Distillation for Dense Retrieval

댓글 수 로딩 중

[논문리뷰] On-Policy Self-Distillation for Reasoning Compression

댓글 수 로딩 중

[논문리뷰] Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception

댓글 수 로딩 중

[논문리뷰] Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

댓글 수 로딩 중

[논문리뷰] Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning

댓글 수 로딩 중

[논문리뷰] EvasionBench: Detecting Evasive Answers in Financial Q&A via Multi-Model Consensus and LLM-as-Judge

댓글 수 로딩 중

[논문리뷰] Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning

댓글 수 로딩 중

[논문리뷰] SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices

댓글 수 로딩 중

[논문리뷰] Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

댓글 수 로딩 중

[논문리뷰] Enhancing Object Detection with Privileged Information: A Model-Agnostic Teacher-Student Approach

댓글 수 로딩 중

[논문리뷰] GTR-Turbo: Merged Checkpoint is Secretly a Free Teacher for Agentic VLM Training

댓글 수 로딩 중

[논문리뷰] SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens

댓글 수 로딩 중

[논문리뷰] NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

댓글 수 로딩 중

[논문리뷰] GeRe: Towards Efficient Anti-Forgetting in Continual Learning of LLM via General Samples Replay

댓글 수 로딩 중

[논문리뷰] Beyond Linear Bottlenecks: Spline-Based Knowledge Distillation for Culturally Diverse Art Style Classification

댓글 수 로딩 중

[논문리뷰] PatenTEB: A Comprehensive Benchmark and Model Family for Patent Text Embedding

댓글 수 로딩 중

[논문리뷰] EchoDistill: Bidirectional Concept Distillation for One-Step Diffusion Personalization

댓글 수 로딩 중

[논문리뷰] BitNet Distillation

댓글 수 로딩 중

[논문리뷰] FlashWorld: High-quality 3D Scene Generation within Seconds

댓글 수 로딩 중

[논문리뷰] LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens

댓글 수 로딩 중

[논문리뷰] Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

댓글 수 로딩 중

[논문리뷰] ACON: Optimizing Context Compression for Long-horizon LLM Agents

댓글 수 로딩 중