본문으로 건너뛰기

#Chain-of-Thought

139개의 포스트

[논문리뷰] Thinking Before Constraining: A Unified Decoding Framework for Large Language Models

댓글 수 로딩 중

[논문리뷰] EvalVerse: Pipeline-Aware and Expert-Calibrated Benchmarking for Professional Cinematic Video Generation

댓글 수 로딩 중

[논문리뷰] ETCHR: Editing To Clarify and Harness Reasoning

댓글 수 로딩 중

[논문리뷰] LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

댓글 수 로딩 중

[논문리뷰] Bernini: Latent Semantic Planning for Video Diffusion

댓글 수 로딩 중

[논문리뷰] CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning

댓글 수 로딩 중

[논문리뷰] Code-Guided Reasoning for Small Language Models: Evaluating Executable MCQA Scaffolds

댓글 수 로딩 중

[논문리뷰] LLaTiSA: Towards Difficulty-Stratified Time Series Reasoning from Visual Perception to Semantics

댓글 수 로딩 중

[논문리뷰] Chain-of-Thought Degrades Visual Spatial Reasoning Capabilities of Multimodal LLMs

댓글 수 로딩 중

[논문리뷰] HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

댓글 수 로딩 중

[논문리뷰] Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization

댓글 수 로딩 중

[논문리뷰] Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning

댓글 수 로딩 중

[논문리뷰] The Depth Ceiling: On the Limits of Large Language Models in Discovering Latent Planning

댓글 수 로딩 중

[논문리뷰] Vero: An Open RL Recipe for General Visual Reasoning

댓글 수 로딩 중

[논문리뷰] PLUME: Latent Reasoning Based Universal Multimodal Embedding

댓글 수 로딩 중

[논문리뷰] InCoder-32B-Thinking: Industrial Code World Model for Thinking

댓글 수 로딩 중

[논문리뷰] ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?

댓글 수 로딩 중

[논문리뷰] Reasoning Shift: How Context Silently Shortens LLM Reasoning

댓글 수 로딩 중

[논문리뷰] FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

댓글 수 로딩 중

[논문리뷰] Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

댓글 수 로딩 중

[논문리뷰] InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

댓글 수 로딩 중

[논문리뷰] CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation

댓글 수 로딩 중

[논문리뷰] From Scale to Speed: Adaptive Test-Time Scaling for Image Editing

댓글 수 로딩 중

[논문리뷰] CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning

댓글 수 로딩 중

[논문리뷰] Ref-Adv: Exploring MLLM Visual Reasoning in Referring Expression Tasks

댓글 수 로딩 중

[논문리뷰] OCR-Agent: Agentic OCR with Capability and Memory Reflection

댓글 수 로딩 중

[논문리뷰] UniT: Unified Multimodal Chain-of-Thought Test-time Scaling

댓글 수 로딩 중

[논문리뷰] GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics

댓글 수 로딩 중

[논문리뷰] ThinkRouter: Efficient Reasoning via Routing Thinking between Latent and Discrete Spaces

댓글 수 로딩 중

[논문리뷰] Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

댓글 수 로딩 중

[논문리뷰] InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

댓글 수 로딩 중

[논문리뷰] Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

댓글 수 로딩 중

[논문리뷰] Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

댓글 수 로딩 중

[논문리뷰] TourPlanner: A Competitive Consensus Framework with Constraint-Gated Reinforcement Learning for Travel Planning

댓글 수 로딩 중

[논문리뷰] Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing

댓글 수 로딩 중

[논문리뷰] ThinkRL-Edit: Thinking in Reinforcement Learning for Reasoning-Centric Image Editing

댓글 수 로딩 중

[논문리뷰] EpiQAL: Benchmarking Large Language Models in Epidemiological Question Answering for Enhanced Alignment and Reasoning

댓글 수 로딩 중

[논문리뷰] Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process

댓글 수 로딩 중

[논문리뷰] OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification

댓글 수 로딩 중

[논문리뷰] EtCon: Edit-then-Consolidate for Reliable Knowledge Editing

댓글 수 로딩 중

[논문리뷰] VG-Refiner: Towards Tool-Refined Referring Grounded Reasoning via Agentic Reinforcement Learning

댓글 수 로딩 중

[논문리뷰] COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence

댓글 수 로딩 중

[논문리뷰] Rectifying LLM Thought from Lens of Optimization

댓글 수 로딩 중

[논문리뷰] LongVT: Incentivizing 'Thinking with Long Videos' via Native Tool Calling

댓글 수 로딩 중

[논문리뷰] Focused Chain-of-Thought: Efficient LLM Reasoning via Structured Input Information

댓글 수 로딩 중

[논문리뷰] Step-Audio-R1 Technical Report

댓글 수 로딩 중

[논문리뷰] Music Flamingo: Scaling Music Understanding in Audio Language Models

댓글 수 로딩 중

[논문리뷰] VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks

댓글 수 로딩 중

[논문리뷰] MME-CC: A Challenging Multi-Modal Evaluation Benchmark of Cognitive Capacity

댓글 수 로딩 중

[논문리뷰] Multi-Step Knowledge Interaction Analysis via Rank-2 Subspace Disentanglement

댓글 수 로딩 중

[논문리뷰] MR-Align: Meta-Reasoning Informed Factuality Alignment for Large Reasoning Models

댓글 수 로딩 중

[논문리뷰] X-CoT: Explainable Text-to-Video Retrieval via LLM-based Chain-of-Thought Reasoning

댓글 수 로딩 중

[논문리뷰] Understanding the Thinking Process of Reasoning Models: A Perspective from Schoenfeld's Episode Theory

댓글 수 로딩 중

[논문리뷰] SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines

댓글 수 로딩 중

[논문리뷰] What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT

댓글 수 로딩 중

[논문리뷰] TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs

댓글 수 로딩 중

[논문리뷰] AuditoryBench++: Can Language Models Understand Auditory Knowledge without Hearing?

댓글 수 로딩 중

[논문리뷰] EconProver: Towards More Economical Test-Time Scaling for Automated Theorem Proving

댓글 수 로딩 중

[논문리뷰] Reasoning Introduces New Poisoning Attacks Yet Makes Them More Complicated

댓글 수 로딩 중

[논문리뷰] SQL-of-Thought: Multi-agentic Text-to-SQL with Guided Error Correction

댓글 수 로딩 중

[논문리뷰] Kwai Keye-VL 1.5 Technical Report

댓글 수 로딩 중

[논문리뷰] Demystifying Scientific Problem-Solving in LLMs by Probing Knowledge and Reasoning

댓글 수 로딩 중

[논문리뷰] Explain Before You Answer: A Survey on Compositional Visual Reasoning

댓글 수 로딩 중

[논문리뷰] Aryabhata: An exam-focused language model for JEE Math

댓글 수 로딩 중

[논문리뷰] Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models

댓글 수 로딩 중

[논문리뷰] Reasoning Language Models for Root Cause Analysis in 5G Wireless Networks

댓글 수 로딩 중

[논문리뷰] HPSv3: Towards Wide-Spectrum Human Preference Score

댓글 수 로딩 중

[논문리뷰] 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding

댓글 수 로딩 중

[논문리뷰] Beyond One World: Benchmarking Super Heros in Role-Playing Across Multiversal Contexts

댓글 수 로딩 중

[논문리뷰] R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

댓글 수 로딩 중

[논문리뷰] ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping

댓글 수 로딩 중

[논문리뷰] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation

댓글 수 로딩 중

[논문리뷰] Factuality Matters: When Image Generation and Editing Meet Structured Visuals

댓글 수 로딩 중

[논문리뷰] LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

댓글 수 로딩 중

[논문리뷰] Distractor Injection Attacks on Large Reasoning Models: Characterization and Defense

댓글 수 로딩 중

[논문리뷰] DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

댓글 수 로딩 중