본문으로 건너뛰기

#Generalization

68개의 포스트

[논문리뷰] GGT-100K: Generative Ground Truth for Generalizable Real-World Image Restoration

댓글 수 로딩 중

[논문리뷰] Chartographer: Counterfactual Chart Generation for Evaluating Vision-Language Models

댓글 수 로딩 중

[논문리뷰] Context Training with Active Information Seeking

댓글 수 로딩 중

[논문리뷰] Structured Distillation of Web Agent Capabilities Enables Generalization

댓글 수 로딩 중

[논문리뷰] Beyond Hard Negatives: The Importance of Score Distribution in Knowledge Distillation for Dense Retrieval

댓글 수 로딩 중

[논문리뷰] DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use

댓글 수 로딩 중

[논문리뷰] CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR

댓글 수 로딩 중

[논문리뷰] FVG-PT: Adaptive Foreground View-Guided Prompt Tuning for Vision-Language Models

댓글 수 로딩 중

[논문리뷰] Demystifying Action Space Design for Robotic Manipulation Policies

댓글 수 로딩 중

[논문리뷰] CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning

댓글 수 로딩 중

[논문리뷰] Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization

댓글 수 로딩 중

[논문리뷰] Risk-Aware World Model Predictive Control for Generalizable End-to-End Autonomous Driving

댓글 수 로딩 중

[논문리뷰] World Guidance: World Modeling in Condition Space for Action Generation

댓글 수 로딩 중

[논문리뷰] A Very Big Video Reasoning Suite

댓글 수 로딩 중

[논문리뷰] VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model

댓글 수 로딩 중

[논문리뷰] ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training

댓글 수 로딩 중

[논문리뷰] DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning

댓글 수 로딩 중

[논문리뷰] AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning

댓글 수 로딩 중

[논문리뷰] OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

댓글 수 로딩 중

[논문리뷰] Stronger Normalization-Free Transformers

댓글 수 로딩 중

[논문리뷰] Ground Slow, Move Fast: A Dual-System Foundation Model for Generalizable Vision-and-Language Navigation

댓글 수 로딩 중

[논문리뷰] VideoVLA: Video Generators Can Be Generalizable Robot Manipulators

댓글 수 로딩 중

[논문리뷰] On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

댓글 수 로딩 중

[논문리뷰] From Imitation to Discrimination: Toward A Generalized Curriculum Advantage Mechanism Enhancing Cross-Domain Reasoning Tasks

댓글 수 로딩 중

[논문리뷰] PretrainZero: Reinforcement Active Pretraining

댓글 수 로딩 중

[논문리뷰] Revisiting the Necessity of Lengthy Chain-of-Thought in Vision-centric Reasoning Generalization

댓글 수 로딩 중

[논문리뷰] Frequency-Adaptive Sharpness Regularization for Improving 3D Gaussian Splatting Generalization

댓글 수 로딩 중

[논문리뷰] Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs

댓글 수 로딩 중

[논문리뷰] RLoop: An Self-Improving Framework for Reinforcement Learning with Iterative Policy Initialization

댓글 수 로딩 중

[논문리뷰] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

댓글 수 로딩 중

[논문리뷰] On Robustness and Reliability of Benchmark-Based Evaluation of LLMs

댓글 수 로딩 중

[논문리뷰] EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control

댓글 수 로딩 중

[논문리뷰] CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

댓글 수 로딩 중

[논문리뷰] Shortcut Learning in Generalist Robot Policies: The Role of Dataset Diversity and Fragmentation

댓글 수 로딩 중

[논문리뷰] Scalable Multi-Task Reinforcement Learning for Generalizable Spatial Intelligence in Visuomotor Agents

댓글 수 로딩 중

[논문리뷰] Flow Equivariant Recurrent Neural Networks

댓글 수 로딩 중

[논문리뷰] The Quest for Generalizable Motion Generation: Data, Model, and Evaluation

댓글 수 로딩 중

[논문리뷰] CityRiSE: Reasoning Urban Socio-Economic Status in Vision-Language Models via Reinforcement Learning

댓글 수 로딩 중

[논문리뷰] Generalization or Memorization: Dynamic Decoding for Mode Steering

댓글 수 로딩 중

[논문리뷰] LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

댓글 수 로딩 중

[논문리뷰] D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

댓글 수 로딩 중

[논문리뷰] Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning

댓글 수 로딩 중

[논문리뷰] A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning

댓글 수 로딩 중

[논문리뷰] Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum

댓글 수 로딩 중