본문으로 건너뛰기

#Scientific Reasoning

16개의 포스트

[논문리뷰] SciAtlas: A Large-Scale Knowledge Graph for Automated Scientific Research

댓글 수 로딩 중

[논문리뷰] SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents

댓글 수 로딩 중

[논문리뷰] Sci-CoE: Co-evolving Scientific Reasoning LLMs via Geometric Consensus with Sparse Supervision

댓글 수 로딩 중

[논문리뷰] P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

댓글 수 로딩 중

[논문리뷰] Innovator-VL: A Multimodal Large Language Model for Scientific Discovery

댓글 수 로딩 중

[논문리뷰] A^3-Bench: Benchmarking Memory-Driven Scientific Reasoning via Anchor and Attractor Activation

댓글 수 로딩 중

[논문리뷰] ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

댓글 수 로딩 중

[논문리뷰] MicroVQA++: High-Quality Microscopy Reasoning Dataset with Weakly Supervised Graphs for Multimodal Large Language Model

댓글 수 로딩 중

[논문리뷰] SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines

댓글 수 로딩 중

[논문리뷰] Demystifying Scientific Problem-Solving in LLMs by Probing Knowledge and Reasoning

댓글 수 로딩 중

[논문리뷰] CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics

댓글 수 로딩 중

[논문리뷰] T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation

댓글 수 로딩 중

[논문리뷰] ExpVid: A Benchmark for Experiment Video Understanding & Reasoning

댓글 수 로딩 중

[논문리뷰] Unleashing Scientific Reasoning for Bio-experimental Protocol Generation via Structured Component-based Reward Mechanism

댓글 수 로딩 중