[논문리뷰] From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal ModelsWei Ye이 arXiv에 게시한 'From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models' 논문에 대한 자세한 리뷰입니다.#Review#Large Multimodal Models#Iterative Training#Diagnostic-Driven Learning#Reinforcement Learning#Multimodal Reasoning#Data Generation#Agent Systems2026년 2월 26일댓글 수 로딩 중
[논문리뷰] DRIFT: Learning from Abundant User Dissatisfaction in Real-World Preference LearningZheli Liu이 arXiv에 게시한 'DRIFT: Learning from Abundant User Dissatisfaction in Real-World Preference Learning' 논문에 대한 자세한 리뷰입니다.#Review#Preference Learning#LLMs#User Feedback#Dissatisfaction Signals#DPO#Iterative Training#RLHF#Exploration2025년 10월 8일댓글 수 로딩 중