#Expert Parallelism

2개의 포스트

[SGLang] Expert Parallel MoE: 분산 전문가 레이어 구현

SGLang의 Expert Parallel MoE를 분석한다. 전문가를 여러 GPU에 분산하는 EP 구조, All-to-All 통신, 로컬/리모트 전문가 라우팅을 코드와 함께 살펴본다.

#sglang #Expert Parallelism #EP-MoE #All-to-All #Distributed

2026년 4월 12일

[논문리뷰] VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Bin Jia이 arXiv에 게시한 'VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo' 논문에 대한 자세한 리뷰입니다.

#Review #Omni-modal LLMs #Distributed Training #Model-centric #Parallelism #FSDP #Sequence Parallelism #Expert Parallelism #Mixture-of-Experts

2025년 8월 5일