[Triton] preload에 optional device 인자 추가

2025년 12월 9일수정: 2025년 12월 9일

PR 링크: triton-lang/triton#8951 상태: Merged | 변경: +3 / -2

들어가며

Triton의 preload 메서드는 serialized specialization data를 사용하여 커널을 미리 컴파일하는 기능이다. 기존에는 항상 현재 활성화된 디바이스를 사용했는데, 멀티 GPU 환경에서는 특정 디바이스를 지정해야 하는 경우가 있다. 이 PR은 preload에 optional device 인자를 추가한다.

핵심 코드 분석

Before

def preload(self, specialization_data):
    import json
    import triton.language as tl
    device = driver.active.get_current_device()

항상 get_current_device()를 호출하여 현재 디바이스를 사용했다.

After

def preload(self, specialization_data, device=None):
    import json
    import triton.language as tl
    if device is None:
        device = driver.active.get_current_device()

device 인자가 None이면 기존과 동일하게 동작하고, 지정하면 해당 디바이스를 사용한다.