#Refactoring

37개의 포스트

[sglang] GSM8K 평가를 Chat API 기반으로 통합

GSM8K 평가 경로를 few-shot 전용 모듈에서 Chat API 기반 simple_eval로 통합하여 CI 회귀 테스트 일관성 확보

#SGLang #Evaluation #GSM8K #Testing #Refactoring

2026년 4월 2일

[sglang] Dumper 디버그 유틸리티 리팩토링: 설정 구조 개선과 Non-intrusive 모드 도입

SGLang의 dumper.py를 upstream main에서 동기화하며 설정 클래스 구조 개선, CLI key=value 파싱 지원, non-intrusive 모드 등을 추가한 대규모 리팩토링 분석.

#SGLang #Debug #Refactoring #Python #LLM Inference

2026년 3월 30일

[CPython 3.13] pickle fast_save_enter() 테스트 정리 (backport)

pickle 모듈의 fast_save 테스트에서 불필요한 dict wrapper와 seed 매개변수를 제거한 3.13 backport 분석.

#CPython #pickle #Testing #Backport #Refactoring #Python

2026년 3월 27일

[CPython] pickle fast_save_enter() 테스트 정리 및 불필요한 wrapper 제거

pickle 모듈의 fast_save 관련 테스트에서 불필요한 dict wrapper를 제거하고 deep_nested_struct의 seed 매개변수를 제거하여 테스트를 단순화한 분석.

#CPython #pickle #Testing #Refactoring #Python

2026년 3월 26일

[SGLang] Diffusion JIT 커널 테스트 레이아웃 리팩터링 및 CI 트리거 정밀화

JIT 커널 테스트/벤치마크를 diffusion/ 서브폴더로 이동하고 CI 트리거를 관련 경로에만 반응하도록 좁힌다

#SGLang #CI/CD #Testing #Refactoring

2026년 3월 26일

[pytest] request.getfixturevalue()의 dirty optimization 제거

동적으로 요청한 fixture를 arg2fixturedefs에 추가하던 불필요한 최적화를 제거하고 Mapping 타입으로 변경

#Python #pytest #Fixtures #Refactoring #Code Quality

2026년 3월 17일

[triton] AMD GFX1250 MXFP Flash Attention 예제 커널 대규모 리팩터링

preshuffle 로직 제거, TDM store 도입, expand_dims 전환 등 GFX1250 FA 예제를 단순화하고 성능을 개선한 리팩터링을 분석합니다.

#Triton #AMD #GPU #FlashAttention #GFX1250 #Refactoring

2026년 3월 12일

[Grafana Loki] 배치 처리를 파이프라인 래퍼로 분리하여 캐시 통합 준비

실행기의 drain 로직에 섞여 있던 배치 처리를 독립 파이프라인으로 추출하여, 태스크 캐시 구현의 기반을 마련한 리팩터링 분석.

#Grafana Loki #Go #Refactoring #Pipeline #Arrow

2026년 3월 11일

[triton] Concurrency Sanitizer를 Vendor Target Hooks로 리팩터링

Triton의 Concurrency Sanitizer를 벤더 독립적인 인터페이스로 리팩터링하여 NVIDIA 외 다른 GPU 벤더도 지원할 수 있게 한 PR 분석.

#Triton #ConSan #Sanitizer #Refactoring #VendorHooks #Architecture

2026년 3월 9일

[Open WebUI] 사용자 메모리 컬렉션 쿼리에 소유권 검증 추가

user-memory 및 file 컬렉션에 대한 접근 권한 검증을 공통 함수로 추출하여 보안 강화.

#Open WebUI #Python #Security #Performance #Refactoring

2026년 3월 1일

[triton] Backend별 global_scratch_alloc 할당 통합

Proton 프로파일러의 scratch 메모리를 별도 풀로 분리하고, third-party allocation 지원을 추가하여 global scratch 메모리 관리를 통합한 사례를 분석합니다.

#Triton #GPU #MemoryAllocation #Proton #Refactoring

2026년 2월 26일

[faster-qwen3-tts] 패키지 리네이밍 및 코드 간소화

qwen3_tts_cuda_graphs에서 faster_qwen3_tts로 리네이밍하고 불필요한 코드를 정리한다

#faster-qwen3-tts #TTS #Refactoring #Naming

2026년 2월 20일

[faster-qwen3-tts] 로컬 모델 경로를 HuggingFace Hub ID로 전환하여 배포 간소화

Qwen3-TTS CUDA Graphs 프로젝트에서 하드코딩된 로컬 모델 경로를 HuggingFace Hub ID로 교체하고, config 파싱 로직을 제거하여 코드를 단순화한 사례를 분석합니다.

#Qwen3-TTS #HuggingFace #Model Loading #Python #Refactoring

2026년 2월 20일

[Ray] ExecutionCache 도입으로 데이터셋 캐싱 로직 통합 및 간소화

산재된 스냅샷 변수들을 ExecutionCache 클래스로 통합하고, 반복 실행과 일반 실행의 캐시 검증을 일관되게 만든 분석.

#Ray #Python #Refactoring #Cache #Performance #Data Pipeline

2026년 2월 18일

[faster-qwen3-tts] 프로젝트 구조 정리: 불필요한 문서 제거와 파일명 표준화

faster-qwen3-tts 프로젝트에서 632줄의 불필요한 문서를 제거하고 핵심 모듈 파일명을 표준화하여 유지보수성을 개선한 리팩토링 사례를 분석합니다.

#Qwen3-TTS #Refactoring #Project Structure #Python #Clean Code

2026년 2월 16일

[triton] AMD Async Load에 ROCDL Op 사용으로 전환

AMD GPU의 async load 연산에서 LLVM intrinsic 문자열 기반 호출을 타입 안전한 ROCDL op으로 교체한 NFC(Non-Functional Change) PR 분석.

#Triton #AMD #ROCDL #AsyncCopy #NFC #Refactoring

2026년 2월 9일

[pydantic-ai] Bedrock CachePoint가 여러 trailing 문서 사이에 잘못 배치되는 버그 수정

AWS Bedrock에서 복수의 문서/비디오가 연속될 때 CachePoint가 마지막 문서 앞이 아닌 전체 그룹 앞에 올바르게 배치되도록 수정한 사례를 분석합니다.

#pydantic-ai #AWS Bedrock #Caching #Bug Fix #Refactoring

2026년 2월 5일

[triton] AMD MoveUpPrologueLoads로 ReorderInstructions 패스 완전 대체

여러 차례 최적화가 제거된 ReorderInstructions를 단일 목적의 MoveUpPrologueLoads 패스로 대체하여 코드 명확성을 높인 PR을 분석합니다.

#Triton #AMD #Refactoring #Compiler #Pipeline

2026년 2월 1일

[pytest] 캐시 디렉터리 생성 로직 단순화 — 원자적 생성 함수 추출

pytest 캐시 디렉터리 생성을 _make_cachedir() 함수로 추출하고 TemporaryDirectory 대신 shutil.rmtree로 정리

#Python #pytest #Refactoring #File System #Concurrency

2026년 1월 29일

[Ray Data] 논리적 최적화 규칙에서 in-place 변형을 제거하여 불변성 준비

limit_pushdown, predicate_pushdown, inherit_batch_format 규칙이 DAG 노드를 직접 수정하던 패턴을 복사-재구축 방식으로 전환한 리팩터링 분석.

#Ray #Python #Refactoring #DAG #Query Optimization

2026년 1월 26일

[Loki] memory 서브패키지 통합으로 코드 구조 개선

memory/bitmap, memory/buffer를 memory 패키지로 통합하여 중복 제거

#Grafana Loki #Go #Refactoring #Performance

2026년 1월 16일

[Triton] ReduceOp 로우어링을 LinearLayout 기반으로 개선 및 단순화

ReduceOp 로우어링을 LinearLayout 기반으로 재설계하여 shmem swizzling 활용, 불필요한 round-trip 제거

#Triton #MLIR #Compiler Optimization #LinearLayout #Refactoring

2026년 1월 12일

[pytorch] CI: fbgemm/torchrec 핀 버전 업데이트 및 빌드 로직 리팩토링

PyTorch CI에서 fbgemm과 torchrec의 핀 버전을 업데이트하고, fbgemm 빌드 로직을 install_fbgemm 함수로 분리하여 CUDA/ROCm 양쪽에서 재사용 가능하게 리팩토링한 사례를 분석합니다.

#PyTorch #CI #fbgemm #torchrec #ROCm #Build System #Refactoring

2026년 1월 11일

[triton] AMD ReorderInstructions에서 no-op sinkDotConversion 최적화 제거

ConvertLayout이 이미 local_load로 대체된 후 실행되어 효과가 없는 sinkDotConversion 최적화를 제거하여 코드 복잡성을 줄인 PR을 분석합니다.

#Triton #AMD #Refactoring #Dead Code #MLIR

2026년 1월 9일

[Triton] Proton GlobalScratchAllocOp 폐기 — TritonGPU 공용 op으로 통합

Proton 전용 GlobalScratchAllocOp을 TritonGPU의 공용 op으로 교체하고, backend 속성으로 할당 정책을 구분한다

#Triton #Proton #MLIR #Refactoring #Op Deprecation

2026년 1월 7일

[triton] Proton의 Runtime과 Metric 상관관계 단순화로 오버헤드 감소

Proton 프로파일러의 Data/Metric 인터페이스를 재설계하여 이중 잠금과 불필요한 조회를 제거하고 프로파일링 오버헤드를 줄인 사례를 분석합니다.

#Triton #Proton #Profiling #Performance #Refactoring

2026년 1월 4일

[triton] AMD ReorderInstructions에서 효과 없는 sinkSecondLoad 최적화 제거

제한적 케이스에서만 트리거되고 성능 영향이 없는 sinkSecondLoad 최적화를 제거하여 ReorderInstructions를 단순화한 PR을 분석합니다.

#Triton #AMD #Refactoring #Dead Code #Cleanup

2025년 12월 30일

[Triton] Gluon 검증 로직을 C++ verifier로 이동 — 차원 축소 로드 지원

Python assert 기반 검증을 C++ verifier로 이동하여 dimension-reducing load를 올바르게 지원한다

#Triton #Gluon #MLIR #Verifier #Refactoring

2025년 12월 18일

[triton] Triton Kernel의 Matrix Multiplication 리팩토링: 코드 가독성과 유지보수성 향상

Triton의 행렬 곱셈 관련 모듈을 정리하고 변수 명명 규칙을 개선하여 코드의 일관성과 유지보수성을 높인 리팩토링 사례를 분석합니다.

#Triton #GPU #Kernel #Refactoring #MatrixMultiplication

2025년 11월 23일

[Triton] TRITON_INTERPRET 모드에서 언어 패치 자동 정리

인터프리터 모드가 triton.language를 패치한 후 자동으로 원래 상태로 복원하도록 개선

#Triton #Interpreter #Python #Refactoring

2025년 11월 14일

[Ray Core] 메모리 스토어와 플라즈마 스토어에서 참조 카운터 분리 리팩터링

Ray의 CoreWorker에서 메모리 스토어와 플라즈마 스토어에 결합되어 있던 참조 카운터 로직을 상위 레이어로 분리하여, 코드 얽힘을 해소하고 유지보수성을 개선한 PR을 분석합니다.

#Ray #Ray Core #Refactoring #C++#Memory Management #Reference Counting

2025년 11월 13일

[Gradio] 큐 성능 개선 — MCP 응답 속도 향상을 위한 구조 리팩터링

MCP 도구 호출 경로를 리팩터링하고 클라이언트 초기화 오버헤드를 제거하여 큐 처리 성능을 개선한다

#Gradio #MCP #Queue Performance #Refactoring

2025년 11월 13일

[triton] rewrite-partition-dependencies를 insert-aref로 통합하여 Warp Specialization 파이프라인 간소화

Triton Warp Specialization의 partition dependency 재작성 pass를 insert-aref pass에 통합하여 컴파일 파이프라인을 간소화한 PR 분석.

#Triton #WarpSpecialization #MLIR #Compiler #Refactoring

2025년 11월 3일

[triton] Matmul에서 Split-K Reduction과 Inter-Expert Reduction 분리

Triton Kernels의 matmul_ogs에서 split-k reduction을 inter-expert reduction과 분리하여 MoE 파이프라인의 유연성을 높인 PR 분석.

#Triton #MatMul #SplitK #MoE #Reduction #Refactoring

2025년 10월 29일

[Ray] OpResourceAllocator 리팩토링으로 데이터 흐름 명시화

Ray Data의 리소스 할당 시스템인 OpResourceAllocator를 리팩토링하여, API에서 데이터 흐름을 명시적으로 표현하고 디버깅을 위한 progress bar 정보를 강화한 변경 사항을 분석합니다.

#Ray #Python #Refactoring #Resource Management #Data Pipeline #Architecture

2025년 10월 27일

[Grafana Loki] 쿼리 옵티마이저를 bottom-up에서 top-down 방식으로 리팩터링하여 중복 작업 제거

DAG 노드마다 규칙을 개별 적용하던 bottom-up 옵티마이저를 루트에서 시작하는 top-down 방식으로 전환하여, 중복 규칙 적용과 추론 복잡성을 제거한 분석.

#Grafana Loki #Go #Performance #Query Optimizer #Refactoring

2025년 10월 24일

[Triton] debuginfo 테스트 단순화 — subprocess 제거

별도 프로세스를 spawn하던 디버그 정보 테스트를 pytest parametrize와 monkeypatch로 리팩터링

#Triton #Testing #Refactoring #Python

2025년 10월 3일