[논문리뷰] SSL: Sweet Spot Learning for Differentiated Guidance in Agentic OptimizationBolin Ni이 arXiv에 게시한 'SSL: Sweet Spot Learning for Differentiated Guidance in Agentic Optimization' 논문에 대한 자세한 리뷰입니다.#Review#Reinforcement Learning#Reward Shaping#Agent Optimization#GUI Automation#Complex Reasoning#Sample Efficiency#Tiered Rewards2026년 2월 1일댓글 수 로딩 중
[논문리뷰] DentalGPT: Incentivizing Multimodal Complex Reasoning in DentistryYanchao Li이 arXiv에 게시한 'DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry' 논문에 대한 자세한 리뷰입니다.#Review#Multimodal Large Language Model#Dental Imaging#Complex Reasoning#Domain Adaptation#Reinforcement Learning#Medical VQA#Dental Healthcare2025년 12월 14일댓글 수 로딩 중
[논문리뷰] AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data SynthesisarXiv에 게시된 'AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis' 논문에 대한 자세한 리뷰입니다.#Review#LLM Agents#Data Synthesis#Zone of Proximal Development (ZPD)#Complex Reasoning#Tool Use#Automated Benchmarking#Agentic AI#Rejection Sampling Fine-Tuning2025년 10월 29일댓글 수 로딩 중