#Compression

5개의 포스트

[논문리뷰] FRAPPE: Full Input, Residual Output Autoencoding with Projection Pursuit Encoder

본 연구는 로봇, 웨어러블 기기 등 자원이 제한된 환경에서 클라우드 기반의 AI 인식을 원활하게 수행하기 위한 실시간 영상 압축 기술의 한계를 해결하고자 합니다.

#Review #Compression #Autoencoder #Projection Pursuit #Asymmetric Codec #Real-time #Resource-constrained #Variable-rate

2026년 5월 31일

[cpython] tarfile 스트리밍 모드(r|*) 성능 개선: 파이썬 압축 파일 처리의 숨겨진 병목 제거

tarfile 모듈의 r|* 모드에서 발생하던 비효율적인 버퍼링 문제를 해결하여 압축 파일 읽기 성능을 획기적으로 개선한 PR 분석.

#Python #tarfile #Performance #Optimization #CPython #Compression #Streaming

2026년 5월 30일

[llm-compressor] Compression Save: compressed-tensors 체크포인트 저장

transformers/compression 디렉토리가 압축된 모델을 compressed-tensors 포맷으로 직렬화해 vLLM/SGLang이 로딩 가능하도록 만드는 구조 분석

#llm-compressor #Compression #Save #compressed-tensors

2026년 4월 13일

[논문리뷰] TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

본 논문은 기존 KV Cache 압축 기법들이 post-RoPE 공간의 제한된 관측치에 의존하여 발생하는 불안정성 문제를 해결하고자 합니다. 기존 방식들은 회전이 적용된 post-RoPE 쿼리를 사용하므로, 대표성 있는 쿼리 확보가 어려워 중요한 토큰이 조기에 삭제되는 현상이 빈번합니다 .

#Review #KV Cache #LLM #Attention #RoPE #Compression #Reasoning

2026년 4월 6일

[논문리뷰] Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model

본 논문은 기존 월드 모델의 수백 개의 잠재 토큰 이 실시간 계획 수립에 필요한 계산 비용을 과도하게 증가시키는 문제를 해결하고자 합니다.

#Review #World Model #Discrete Tokenizer #Latent Representation #Action Planning #Model Predictive Control #Real-time AI #Compression #Vision Foundation Model

2026년 3월 8일