본문으로 건너뛰기

#Data Selection

10개의 포스트

[논문리뷰] Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders

댓글 수 로딩 중

[논문리뷰] DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

댓글 수 로딩 중

[논문리뷰] A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn't)

댓글 수 로딩 중

[논문리뷰] OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

댓글 수 로딩 중

[논문리뷰] Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection

댓글 수 로딩 중

[논문리뷰] Data-Efficient RLVR via Off-Policy Influence Guidance

댓글 수 로딩 중