본문으로 건너뛰기

#Synthetic Data Generation

33개의 포스트

[논문리뷰] K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts

댓글 수 로딩 중

[논문리뷰] A Survey on LLM-based Conversational User Simulation

댓글 수 로딩 중

[논문리뷰] AnomalyVFM -- Transforming Vision Foundation Models into Zero-Shot Anomaly Detectors

댓글 수 로딩 중

[논문리뷰] On Data Engineering for Scaling LLM Terminal Capabilities

댓글 수 로딩 중

[논문리뷰] SERA: Soft-Verified Efficient Repository Agents

댓글 수 로딩 중

[논문리뷰] User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale

댓글 수 로딩 중

[논문리뷰] X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests

댓글 수 로딩 중

[논문리뷰] AlignBench: Benchmarking Fine-Grained Image-Text Alignment with Synthetic Image-Caption Pairs

댓글 수 로딩 중

[논문리뷰] World in a Frame: Understanding Culture Mixing as a New Challenge for Vision-Language Models

댓글 수 로딩 중

[논문리뷰] Fara-7B: An Efficient Agentic Model for Computer Use

댓글 수 로딩 중

[논문리뷰] Taming Generative Synthetic Data for X-ray Prohibited Item Detection

댓글 수 로딩 중

[논문리뷰] Llama-Embed-Nemotron-8B: A Universal Text Embedding Model for Multilingual and Cross-Lingual Tasks

댓글 수 로딩 중

[논문리뷰] Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench

댓글 수 로딩 중

[논문리뷰] MisSynth: Improving MISSCI Logical Fallacies Classification with Synthetic Data

댓글 수 로딩 중

[논문리뷰] Beyond Objects: Contextual Synthetic Data Generation for Fine-Grained Classification

댓글 수 로딩 중

[논문리뷰] Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem

댓글 수 로딩 중

[논문리뷰] Attributes as Textual Genes: Leveraging LLMs as Genetic Algorithm Simulators for Conditional Synthetic Data Generation

댓글 수 로딩 중

[논문리뷰] LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training

댓글 수 로딩 중

[논문리뷰] Benchmark It Yourself (BIY): Preparing a Dataset and Benchmarking AI Models for Scatterplot-Related Tasks

댓글 수 로딩 중

[논문리뷰] olmOCR 2: Unit Test Rewards for Document OCR

댓글 수 로딩 중

[논문리뷰] Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

댓글 수 로딩 중