본문으로 건너뛰기

#VLM

20개의 포스트

[논문리뷰] Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents

댓글 수 로딩 중

[논문리뷰] EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation

댓글 수 로딩 중

[논문리뷰] OCR-Agent: Agentic OCR with Capability and Memory Reflection

댓글 수 로딩 중

[논문리뷰] Alterbute: Editing Intrinsic Attributes of Objects in Images

댓글 수 로딩 중

[논문리뷰] ThinkRL-Edit: Thinking in Reinforcement Learning for Reasoning-Centric Image Editing

댓글 수 로딩 중

[논문리뷰] WebGen-Agent: Enhancing Interactive Website Generation with Multi-Level Feedback and Step-Level Reinforcement Learning

댓글 수 로딩 중

[논문리뷰] RewardDance: Reward Scaling in Visual Generation

댓글 수 로딩 중