#Consistency Distillation

2개의 포스트

[논문리뷰] Fast and Accurate Causal Parallel Decoding using Jacobi Forcing

본 논문은 대규모 언어 모델(LLM)의 순차적(autoregressive, AR) 디코딩으로 인한 높은 지연 시간을 해결하고, AR 모델의 생성 품질과 인과적 추론 특성을 유지하면서 효율적인 병렬 디코딩을 가능하게 하는 것을 목표로 합니다.

#Review #Parallel Decoding #Causal LLM #Jacobi Decoding #Consistency Distillation #Transformer Inference #Latency Reduction #Rejection Recycling #Multi-block Decoding

2025년 12월 17일

[논문리뷰] Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

본 논문은 멀티모달 대규모 언어 모델(MLLMs)에서 시각 토큰이 소모하는 막대한 계산 자원으로 인한 효율성 저하 문제를 해결하고자 합니다. 특히, 시각 토큰 압축 과정에서 발생하는 학습 난이도 증가 와 특징 공간 교란 문제를 해결하여, 효율성을 높이면서도 성능 저하를 최소화하는 것을 목표로 합니다.

#Review #Multi-modal LLMs #Token Compression #Efficiency #Knowledge Distillation #Progressive Learning #Consistency Distillation #MLLM Training

2025년 10월 6일