#Continuous Control

6개의 포스트

[논문리뷰] ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands

기존 GUI 에이전트들이 주로 이산적인 클릭 예측에 의존하여 연속적이고 자유로운 형태의 드래그(예: 그림 그리기, 캡차 풀이)와 같이 즉각적인 시각적 인지와 조정이 필요한 복잡한 GUI 상호작용을 수행하기 어렵다는 문제를 해결합니다.

#Review #GUI Automation #Flow-based Generative Models #Continuous Control #Vision-Language Models #Human-Computer Interaction #ScreenDrag Benchmark #Dexterous Manipulation

2026년 1월 13일

[논문리뷰] RynnVLA-002: A Unified Vision-Language-Action and World Model

본 논문은 기존 VLA 모델(액션 다이내믹스 이해 부족, 상상력 및 물리 지식 결여)과 월드 모델(직접적인 액션 생성 불가)의 한계를 극복하기 위해, VLA 모델과 월드 모델을 단일 프레임워크로 통합 하는 것을 목표로 합니다.

#Review #Vision-Language-Action (VLA) Model #World Model #Robotics #Unified Framework #Multi-modal Learning #Action Generation #Attention Mask #Continuous Control

2025년 11월 23일

[논문리뷰] SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control

기존 instruction-based image editing 모델들이 고정된 강도로 편집을 적용하여 개별 편집에 대한 정밀하고 연속적인 제어가 불가능하다는 한계를 해결하고자 합니다.

#Review #Image Editing #Continuous Control #Fine-Grained Control #Instruction-based #Low-Rank Adaptation #Disentanglement #Generative Models

2025년 11월 13일

[논문리뷰] CAMAR: Continuous Actions Multi-Agent Routing

이 논문은 기존 다중 에이전트 강화 학습(MARL) 벤치마크가 연속적인 상태 및 행동 공간, 복잡한 조정 및 계획 작업을 충분히 지원하지 못하는 한계를 해결하고자 합니다.

#Review #Multi-Agent Reinforcement Learning #Continuous Control #Pathfinding #MARL Benchmark #GPU Acceleration #Robotics Simulation #Scalability #Heterogeneous Agents

2025년 8월 20일

[논문리뷰] Entropy Regularizing Activation: Boosting Continuous Control, Large Language Models, and Image Classification with Activation as Entropy Constraints

논문은 기존의 엔트로피 정규화 방식들이 최적화 목표를 왜곡하거나 특정 도메인에만 적용 가능한 한계를 지적하며, 범용적이고 비침습적이며 이론적으로 근거 있는 새로운 엔트로피 제약 패러다임을 제안하는 것을 목표로 합니다. 이는 다양한 AI/ML 문제에서 정책의 탐색 능력과 견고성을 향상시키고자 합니다.

#Review #Entropy Regularization #Activation Functions #Continuous Control #Large Language Models #Image Classification #Reinforcement Learning #Policy Stochasticity #Entropy Constraints

2025년 10월 10일

[논문리뷰] SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder

이 논문은 대규모 텍스트-투-이미지 확산 모델의 이미지 편집 시 미세하고 연속적인 제어 부족 문제를 해결하는 것을 목표로 합니다.

#Review #Image Editing #Diffusion Models #Sparse Autoencoder (SAE)#Text-to-Image #Disentangled Control #Continuous Control #Token-level Manipulation #Text Embeddings

2025년 10월 7일