본문으로 건너뛰기

#Instruction Following

36개의 포스트

[논문리뷰] Aurora: Unified Video Editing with a Tool-Using Agent

댓글 수 로딩 중

[논문리뷰] VEFX-Bench: A Holistic Benchmark for Generic Video Editing and Visual Effects

댓글 수 로딩 중

[논문리뷰] Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation

댓글 수 로딩 중

[논문리뷰] Fish Audio S2 Technical Report

댓글 수 로딩 중

[논문리뷰] Do What I Say: A Spoken Prompt Dataset for Instruction-Following

댓글 수 로딩 중

[논문리뷰] LSRIF: Logic-Structured Reinforcement Learning for Instruction Following

댓글 수 로딩 중

[논문리뷰] T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation

댓글 수 로딩 중

[논문리뷰] Olmo 3

댓글 수 로딩 중

[논문리뷰] EditThinker: Unlocking Iterative Reasoning for Any Image Editor

댓글 수 로딩 중

[논문리뷰] ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

댓글 수 로딩 중

[논문리뷰] MIRA: Multimodal Iterative Reasoning Agent for Image Editing

댓글 수 로딩 중

[논문리뷰] Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following

댓글 수 로딩 중

[논문리뷰] Motif 2 12.7B technical report

댓글 수 로딩 중

[논문리뷰] OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing

댓글 수 로딩 중

[논문리뷰] Instruction-Following Evaluation in Function Calling for Large Language Models

댓글 수 로딩 중

[논문리뷰] FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions

댓글 수 로딩 중

[논문리뷰] Do You Hear What I Mean? Quantifying the Instruction-Perception Gap in Instruction-Guided Expressive Text-To-Speech Systems

댓글 수 로딩 중

[논문리뷰] RecoWorld: Building Simulated Environments for Agentic Recommender Systems

댓글 수 로딩 중

[논문리뷰] Language Self-Play For Data-Free Training

댓글 수 로딩 중

[논문리뷰] Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

댓글 수 로딩 중

[논문리뷰] Do What? Teaching Vision-Language-Action Models to Reject the Impossible

댓글 수 로딩 중

[논문리뷰] NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

댓글 수 로딩 중

[논문리뷰] IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards

댓글 수 로딩 중

[논문리뷰] The End of Manual Decoding: Towards Truly End-to-End Language Models

댓글 수 로딩 중

[논문리뷰] LimRank: Less is More for Reasoning-Intensive Information Reranking

댓글 수 로딩 중

[논문리뷰] InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

댓글 수 로딩 중