#Encoder-Decoder Architecture

2개의 포스트

[논문리뷰] GoClick: Lightweight Element Grounding Model for Autonomous GUI Interaction

본 연구는 GUI Agent를 모바일 기기와 같은 자원 제약 환경에 효과적으로 배포하기 위해, 기존 대규모 VLM이 가진 과도한 연산 비용과 메모리 요구사항 문제를 해결하고자 한다. 대부분의 최신 VLM은 2.5B 이상의 파라미터를 사용하여 온디바이스 환경에서 활용하기 어렵다는 한계가 있다.

#Review #GUI Agent #Vision-Language Model #Visual Grounding #Data Refinement #Model Compression #Encoder-Decoder Architecture

2026년 4월 28일

[논문리뷰] ARC-Encoder: learning compressed text representations for large language models

본 논문은 대규모 언어 모델(LLM)의 긴 컨텍스트 처리로 인한 추론 비용 증가와 컨텍스트 창 제한 문제를 해결하고자 합니다. 특히, 디코더 모델의 아키텍처를 수정하거나 파인튜닝하지 않고도 컨텍스트를 압축하여 LLM의 일반적인 능력을 유지하면서 효율성을 높이는 것을 목표로 합니다.

#Review #Context Compression #Large Language Models #Encoder-Decoder Architecture #Text Representation #In-Context Learning #Parameter Efficiency #Retrieval-Augmented Generation

2025년 10월 27일