본문으로 건너뛰기

#Data Augmentation

35개의 포스트

[논문리뷰] The TTS-STT Flywheel: Synthetic Entity-Dense Audio Closes the Indic ASR Gap Where Commercial and Open-Source Systems Fail

댓글 수 로딩 중

[논문리뷰] Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling

댓글 수 로딩 중

[논문리뷰] TextPecker: Rewarding Structural Anomaly Quantification for Enhancing Visual Text Rendering

댓글 수 로딩 중

[논문리뷰] SAGE: Scalable Agentic 3D Scene Generation for Embodied AI

댓글 수 로딩 중

[논문리뷰] Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation

댓글 수 로딩 중

[논문리뷰] RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation

댓글 수 로딩 중

[논문리뷰] On the Role of Discreteness in Diffusion LLMs

댓글 수 로딩 중

[논문리뷰] See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning

댓글 수 로딩 중

[논문리뷰] FaithLens: Detecting and Explaining Faithfulness Hallucination

댓글 수 로딩 중

[논문리뷰] Infinite-Homography as Robust Conditioning for Camera-Controlled Video Generation

댓글 수 로딩 중

[논문리뷰] TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition

댓글 수 로딩 중

[논문리뷰] GR-RL: Going Dexterous and Precise for Long-Horizon Robotic Manipulation

댓글 수 로딩 중

[논문리뷰] Taming Generative Synthetic Data for X-ray Prohibited Item Detection

댓글 수 로딩 중

[논문리뷰] ScaleDiff: Scaling Difficult Problems for Advanced Mathematical Reasoning

댓글 수 로딩 중

[논문리뷰] Synthetic bootstrapped pretraining

댓글 수 로딩 중

[논문리뷰] OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models

댓글 수 로딩 중

[논문리뷰] Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

댓글 수 로딩 중

[논문리뷰] Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem

댓글 수 로딩 중

[논문리뷰] From Black Box to Transparency: Enhancing Automated Interpreting Assessment with Explainable AI in College Classrooms

댓글 수 로딩 중

[논문리뷰] TopXGen: Topic-Diverse Parallel Data Generation for Low-Resource Machine Translation

댓글 수 로딩 중

[논문리뷰] Shortcut Learning in Generalist Robot Policies: The Role of Dataset Diversity and Fragmentation

댓글 수 로딩 중

[논문리뷰] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward

댓글 수 로딩 중

[논문리뷰] Phi-Ground Tech Report: Advancing Perception in GUI Grounding

댓글 수 로딩 중

[논문리뷰] PairUni: Pairwise Training for Unified Multimodal Language Models

댓글 수 로딩 중

[논문리뷰] Fidelity-Aware Data Composition for Robust Robot Generalization

댓글 수 로딩 중

[논문리뷰] Beyond Monolingual Assumptions: A Survey of Code-Switched NLP in the Era of Large Language Models

댓글 수 로딩 중

[논문리뷰] EvolProver: Advancing Automated Theorem Proving by Evolving Formalized Problems via Symmetry and Difficulty

댓글 수 로딩 중

[논문리뷰] KORE: Enhancing Knowledge Injection for Large Multimodal Models via Knowledge-Oriented Augmentations and Constraints

댓글 수 로딩 중