[논문리뷰] ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement ArenasKaichi Yu이 arXiv에 게시한 'ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas' 논문에 대한 자세한 리뷰입니다.#Review#LLM Agent#Tool Use#Trajectory Synthesis#Reinforcement Learning#Environment Synthesis#Data Generation#Multi-turn Interaction#Automated Training2026년 2월 1일댓글 수 로딩 중
[논문리뷰] Training Language Model Agents to Find Vulnerabilities with CTF-DojoZijian Wang이 arXiv에 게시한 'Training Language Model Agents to Find Vulnerabilities with CTF-Dojo' 논문에 대한 자세한 리뷰입니다.#Review#LLM Agents#Cybersecurity#CTF Challenges#Vulnerability Detection#Execution Environments#Docker#Automated Training#Verifiable Feedback2025년 8월 27일댓글 수 로딩 중