본문으로 건너뛰기

#Data Curation

57개의 포스트

[논문리뷰] Is Position Bias in Dense Retrievers Built In-or Learned from Data?

댓글 수 로딩 중

[논문리뷰] MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping

댓글 수 로딩 중

[논문리뷰] Watch Before You Answer: Learning from Visually Grounded Post-Training

댓글 수 로딩 중

[논문리뷰] Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation

댓글 수 로딩 중

[논문리뷰] GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

댓글 수 로딩 중

[논문리뷰] ClinAlign: Scaling Healthcare Alignment from Clinician Preference

댓글 수 로딩 중

[논문리뷰] FireRed-Image-Edit-1.0 Techinical Report

댓글 수 로딩 중

[논문리뷰] Towards Universal Video MLLMs with Attribute-Structured and Quality-Verified Instructions

댓글 수 로딩 중

[논문리뷰] ABot-M0: VLA Foundation Model for Robotic Manipulation with Action Manifold Learning

댓글 수 로딩 중

[논문리뷰] Typhoon ASR Real-time: FastConformer-Transducer for Thai Automatic Speech Recognition

댓글 수 로딩 중

[논문리뷰] Action100M: A Large-scale Video Action Dataset

댓글 수 로딩 중

[논문리뷰] DreamStyle: A Unified Framework for Video Stylization

댓글 수 로딩 중

[논문리뷰] UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement

댓글 수 로딩 중

[논문리뷰] Olmo 3

댓글 수 로딩 중

[논문리뷰] DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling

댓글 수 로딩 중

[논문리뷰] The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment

댓글 수 로딩 중

[논문리뷰] OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

댓글 수 로딩 중

[논문리뷰] Music Flamingo: Scaling Music Understanding in Audio Language Models

댓글 수 로딩 중

[논문리뷰] Wasm: A Pipeline for Constructing Structured Arabic Interleaved Multimodal Corpora

댓글 수 로딩 중

[논문리뷰] DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation

댓글 수 로딩 중

[논문리뷰] LucidFlux: Caption-Free Universal Image Restoration via a Large-Scale Diffusion Transformer

댓글 수 로딩 중

[논문리뷰] SAIL-VL2 Technical Report

댓글 수 로딩 중

[논문리뷰] TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis

댓글 수 로딩 중

[논문리뷰] Wan-S2V: Audio-Driven Cinematic Video Generation

댓글 수 로딩 중

[논문리뷰] Fin-PRM: A Domain-Specialized Process Reward Model for Financial Reasoning in Large Language Models

댓글 수 로딩 중

[논문리뷰] InfiAlign: A Scalable and Sample-Efficient Framework for Aligning LLMs to Enhance Reasoning Capabilities

댓글 수 로딩 중

[논문리뷰] MiDashengLM: Efficient Audio Understanding with General Audio Captions

댓글 수 로딩 중

[논문리뷰] The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models

댓글 수 로딩 중

[논문리뷰] Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought

댓글 수 로딩 중

[논문리뷰] MLE-Smith: Scaling MLE Tasks with Automated Multi-Agent Pipeline

댓글 수 로딩 중

[논문리뷰] TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning

댓글 수 로딩 중

[논문리뷰] ComProScanner: A multi-agent based framework for composition-property structured data extraction from scientific literature

댓글 수 로딩 중

[논문리뷰] OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

댓글 수 로딩 중

[논문리뷰] DA^2: Depth Anything in Any Direction

댓글 수 로딩 중