[논문리뷰] Rethinking Token-Level Policy Optimization for Multimodal Chain-of-ThoughtZhaojie Liu이 arXiv에 게시한 'Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought' 논문에 대한 자세한 리뷰입니다.#Review#Multimodal Chain-of-Thought#Reinforcement Learning#Token-Level Optimization#Visual Similarity#Entropy2026년 3월 24일댓글 수 로딩 중
[논문리뷰] ExpSeek: Self-Triggered Experience Seeking for Web AgentsarXiv에 게시된 'ExpSeek: Self-Triggered Experience Seeking for Web Agents' 논문에 대한 자세한 리뷰입니다.#Review#Web Agents#Experience Seeking#Self-Triggered#LLM Reasoning#Entropy#Proactive Guidance#Reinforcement Learning#Foundation Models2026년 1월 14일댓글 수 로딩 중
[논문리뷰] When Modalities Conflict: How Unimodal Reasoning Uncertainty Governs Preference Dynamics in MLLMsHaotian Wang이 arXiv에 게시한 'When Modalities Conflict: How Unimodal Reasoning Uncertainty Governs Preference Dynamics in MLLMs' 논문에 대한 자세한 리뷰입니다.#Review#Multimodal Large Language Models (MLLMs)#Modality Following#Unimodal Uncertainty#Modality Preference#Conflict Resolution#Internal Mechanism#Entropy#Controllable Dataset2025년 11월 9일댓글 수 로딩 중
[논문리뷰] Revisiting the Uniform Information Density Hypothesis in LLM Reasoning TracesarXiv에 게시된 'Revisiting the Uniform Information Density Hypothesis in LLM Reasoning Traces' 논문에 대한 자세한 리뷰입니다.#Review#LLM Reasoning#Chain-of-Thought#Uniform Information Density#Information Theory#Reasoning Trace Analysis#Entropy#Mathematical Reasoning#Model Evaluation2025년 10월 9일댓글 수 로딩 중
[논문리뷰] EntroPE: Entropy-Guided Dynamic Patch Encoder for Time Series ForecastingarXiv에 게시된 'EntroPE: Entropy-Guided Dynamic Patch Encoder for Time Series Forecasting' 논문에 대한 자세한 리뷰입니다.#Review#Time Series Forecasting#Transformer#Dynamic Patching#Entropy#Predictive Uncertainty#Adaptive Encoding#Attention Mechanisms#Causal Transformer2025년 10월 1일댓글 수 로딩 중