[논문리뷰] Why LLMs Aren't Scientists Yet: Lessons from Four Autonomous Research AttemptsarXiv에 게시된 'Why LLMs Aren't Scientists Yet: Lessons from Four Autonomous Research Attempts' 논문에 대한 자세한 리뷰입니다.#Review#Machine Learning Research#Autonomous Research#LLM Agents#Scientific Workflow#Failure Modes#Experimental Design#AI Scientist#Agentic Systems2026년 1월 7일댓글 수 로딩 중
[논문리뷰] ReplicationBench: Can AI Agents Replicate Astrophysics Research Papers?Ian L. V. Roque이 arXiv에 게시한 'ReplicationBench: Can AI Agents Replicate Astrophysics Research Papers?' 논문에 대한 자세한 리뷰입니다.#Review#AI Agents#Astrophysics Research#Reproducibility Benchmark#Large Language Models#Scientific Workflow#Code Execution#Evaluation Framework2025년 10월 29일댓글 수 로딩 중