본문으로 건너뛰기

#Multi-task Learning

23개의 포스트

[논문리뷰] Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

댓글 수 로딩 중

[논문리뷰] Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning

댓글 수 로딩 중

[논문리뷰] Vero: An Open RL Recipe for General Visual Reasoning

댓글 수 로딩 중

[논문리뷰] Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models

댓글 수 로딩 중

[논문리뷰] Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making

댓글 수 로딩 중

[논문리뷰] VLingNav: Embodied Navigation with Adaptive Reasoning and Visual-Assisted Linguistic Memory

댓글 수 로딩 중

[논문리뷰] SOP: A Scalable Online Post-Training System for Vision-Language-Action Models

댓글 수 로딩 중

[논문리뷰] UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation

댓글 수 로딩 중

[논문리뷰] SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering

댓글 수 로딩 중

[논문리뷰] 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding

댓글 수 로딩 중

[논문리뷰] PatenTEB: A Comprehensive Benchmark and Model Family for Patent Text Embedding

댓글 수 로딩 중

[논문리뷰] E^2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker

댓글 수 로딩 중

[논문리뷰] Presenting a Paper is an Art: Self-Improvement Aesthetic Agents for Academic Presentations

댓글 수 로딩 중