본문으로 건너뛰기

#Human-Computer Interaction

20개의 포스트

[논문리뷰] DrawMotion: Generating 3D Human Motions by Freehand Drawing

댓글 수 로딩 중

[논문리뷰] VenusBench-Mobile: A Challenging and User-Centric Benchmark for Mobile GUI Agents with Capability Diagnostics

댓글 수 로딩 중

[논문리뷰] PIRA-Bench: A Transition from Reactive GUI Agents to GUI-based Proactive Intent Recommendation Agents

댓글 수 로딩 중

[논문리뷰] Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control

댓글 수 로딩 중

[논문리뷰] Continual GUI Agents

댓글 수 로딩 중

[논문리뷰] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection

댓글 수 로딩 중

[논문리뷰] ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands

댓글 수 로딩 중

[논문리뷰] DreamOmni3: Scribble-based Editing and Generation

댓글 수 로딩 중

[논문리뷰] Step-GUI Technical Report

댓글 수 로딩 중

[논문리뷰] StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos

댓글 수 로딩 중

[논문리뷰] Aligning Generative Music AI with Human Preferences: Methods and Challenges

댓글 수 로딩 중

[논문리뷰] PersonaX: Multimodal Datasets with LLM-Inferred Behavior Traits

댓글 수 로딩 중

[논문리뷰] 'Does the cafe entrance look accessible? Where is the door?' Towards Geospatial AI Agents for Visual Inquiries

댓글 수 로딩 중

[논문리뷰] InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization

댓글 수 로딩 중

[논문리뷰] C3: A Bilingual Benchmark for Spoken Dialogue Models Exploring Challenges in Complex Conversations

댓글 수 로딩 중