#Debugging

6개의 포스트

[논문리뷰] Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

본 연구는 Deep-Research Agent의 오류 원인을 파악하기 어렵다는 블랙박스 특성을 해결하고자 합니다. 기존의 에이전트 평가는 주로 최종 결과물(Final Answer)의 정확도에만 집중하기 때문에, 중간 단계의 어떤 부분에서 추론이 어긋났는지 진단하는 데 한계가 존재합니다.

#Review #Deep-Research Agents #Error Localization #Agent Trajectories #Span-Level Analysis #LLM Reasoning #Debugging

2026년 6월 3일

[cpython] Python JIT의 GDB 디버깅 지원: .eh_frame 생성을 통한 스택 언와인딩 구현

CPython JIT 코드의 GDB 백트레이스 지원을 위해 .eh_frame과 DWARF CFI를 동적으로 생성하는 최적화 기법을 분석합니다.

#CPython #JIT #GDB #DWARF #Debugging #LowLevel

2026년 5월 2일

[CPython] JIT stencil에서 frame pointer 보존 검증 추가

CPython JIT 컴파일러가 생성하는 stencil 코드에서 frame pointer가 올바르게 보존되는지 검증하는 validation 로직 분석.

#CPython #JIT #Frame Pointer #Debugging #Profiling #AArch64 #x86

2026년 3월 27일

[triton] Global Sanitizer에 TMA 및 cp.async 연산 부분 지원 추가

Triton의 Global Sanitizer에 tensor descriptor 디코딩과 TMA/cp.async 연산의 메모리 접근 추적 기능을 추가한 PR 분석.

#Triton #GSan #Sanitizer #TMA #AsyncCopy #Debugging

2026년 3월 20일

[논문리뷰] DoVer: Intervention-Driven Auto Debugging for LLM Multi-Agent Systems

LLM 기반 다중 에이전트 시스템의 복잡한 디버깅 문제를 해결하는 것을 목표로 합니다.

#Review #LLM Multi-Agent Systems #Debugging #Intervention-Driven #Failure Attribution #Automated Debugging #Verification #AI Agents #Reliability

2025년 12월 8일

[triton] ConSan: 상태 변경 시 커널 재컴파일을 보장하여 JIT 캐시 무효화

Concurrency Sanitizer 상태를 컴파일 옵션에 포함시켜 활성화/비활성화 시 커널이 자동으로 재컴파일되도록 하는 변경 분석.

#Triton #ConSan #JIT #Cache #Sanitizer #Debugging

2025년 10월 1일