[논문리뷰] FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature ExplorationarXiv에 게시된 'FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration' 논문에 대한 자세한 리뷰입니다.#Review#Scientific Idea Generation#Flow-Guided MCTS#GFlowNet#Test-Time Evolution#Isolation Island Paradigm#Generative Reward Model#Autonomous Research2026년 3월 31일댓글 수 로딩 중
[논문리뷰] Towards Faithful and Controllable Personalization via Critique-Post-Edit Reinforcement LearningYuchen Eleanor Jiang이 arXiv에 게시한 'Towards Faithful and Controllable Personalization via Critique-Post-Edit Reinforcement Learning' 논문에 대한 자세한 리뷰입니다.#Review#LLM Personalization#Reinforcement Learning#Generative Reward Model#Critique-Post-Edit#Reward Hacking#Controllable AI2025년 10월 22일댓글 수 로딩 중