#Vector Quantization

7개의 포스트

[논문리뷰] IDEAL: In-DEpth ALignment Makes A Discrete Representation AutoEncoder

본 논문은 VFM 기반의 RAE가 재구성 품질과 의미 보존 사이에서 겪는 근본적인 병목 현상을 해결하고자 합니다. 기존 연구들은 주로 깊은 계층의 의미론적 정보에만 의존하는데, 이는 디테일한 시각적 속성(색상, 텍스트, 로컬 구조 등)을 소실시키는 결과를 초래합니다.

#Review #Representation Autoencoder #Vision Foundation Models #Vector Quantization #Autoregressive Generation #Semantic Preservation #Reconstruction Fidelity

2026년 6월 11일

[논문리뷰] Representation Forcing for Bottleneck-Free Unified Multimodal Models

본 논문은 기존 UMM이 frozen VAE에 의존하여 발생하는 structural bottleneck 문제를 해결하기 위해 Representation Forcing (RF)을 제안한다 .

#Review #Unified Multimodal Models #Representation Forcing #Pixel-space Diffusion #Vector Quantization #End-to-End Learning #Bottleneck-Free #Mixture-of-Transformers

2026년 5월 31일

[논문리뷰] SuperLocalMemory V3.3: The Living Brain -- Biologically-Inspired Forgetting, Cognitive Quantization, and Multi-Channel Retrieval for Zero-LLM Agent Memory Systems

본 논문은 정보 기하학에 기반한 FRQAD와 Local TurboQuant를 도입하여 메모리 저장 효율과 검색 정밀도를 동시에 달성한다. 저자들은 Fokker-Planck 동역학을 활용하여 메모리의 수명 주기를 수학적으로 관리하며, 이를 통해 고정밀에서 저정밀(32-bit에서 2-bit까지)로 이어지는 단계적 메모리 압축을 구현한다.

#Review #Agent Memory #Information Geometry #Vector Quantization #Ebbinghaus Forgetting #Cognitive Architecture #Soft Prompts #Fisher-Rao

2026년 4월 16일

[논문리뷰] GaussianGPT: Towards Autoregressive 3D Gaussian Scene Generation

본 논문은 기존 3D 생성 모델들이 주로 사용하는 전체적(holistic)인 Denoising이나 Diffusion 방식이 3D 환경의 점진적 구축과 편집에 부적합하다는 문제를 해결하고자 한다. 기존 방식은 고정된 장면을 한 번에 생성하는 경향이 있어, 실제 환경처럼 점진적으로 확장하거나 수정하는 유연성이 부족하다.

#Review #3D Gaussian Splatting #Autoregressive Modeling #Scene Generation #Transformer #Vector Quantization #3D Scene Completion

2026년 4월 1일

[논문리뷰] VQ-Seg: Vector-Quantized Token Perturbation for Semi-Supervised Medical Image Segmentation

본 논문은 반지도 학습 기반 의료 영상 분할에서 기존 dropout 방식의 불안정하고 튜닝이 어려운 특성 교란 문제를 해결하고자 합니다.

#Review #Semi-supervised Learning #Medical Image Segmentation #Vector Quantization #Consistency Learning #Feature Perturbation #Foundation Models #Dropout Replacement

2026년 1월 15일

[논문리뷰] VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction

멀티모달 이해, 생성 및 재구성 표현을 단일 토크나이저 내에서 통합하는 핵심 과제를 해결하고자 합니다. 기존의 듀얼 인코더 방식의 복잡성과 이산형 토크나이저의 의미 이해 능력 저하 문제를 극복하고, 연속형 의미 특징 과 이산형 미세 토큰 을 동시에 생성할 수 있는 통합 토크나이저를 제안하는 것이 목표입니다.

#Review #Multimodal Learning #Vector Quantization #Autoencoder #Unified Tokenizer #Image Generation #Image Reconstruction #Vision Transformers #Semantic Features

2025년 12월 11일

[논문리뷰] LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs

본 논문은 효율적인 단일 배치 대규모 언어 모델(LLM) 추론을 위해 FPGA 의 장점을 활용하는 것을 목표로 합니다. 특히, 기존 산술 기반 연산에서 메모리 기반 연산 으로 전환하여 GPU 대비 FPGA의 성능 및 에너지 효율성 한계를 극복하고, 온디바이스 AI 구현을 위한 핵심 기술을 개발하고자 합니다.

#Review #FPGA #Large Language Models (LLM)#Inference Optimization #Memory-based Computation #Vector Quantization #Table Lookup #Hardware Acceleration

2025년 11월 10일