본문으로 건너뛰기

#Fine-tuning

63개의 포스트

[논문리뷰] Is Position Bias in Dense Retrievers Built In-or Learned from Data?

댓글 수 로딩 중

[논문리뷰] Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty?

댓글 수 로딩 중

[논문리뷰] Learn Hard Problems During RL with Reference Guided Fine-tuning

댓글 수 로딩 중

[논문리뷰] Half-Truths Break Similarity-Based Retrieval

댓글 수 로딩 중

[논문리뷰] FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment

댓글 수 로딩 중

[논문리뷰] Fundamental Reasoning Paradigms Induce Out-of-Domain Generalization in Language Models

댓글 수 로딩 중

[논문리뷰] Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models

댓글 수 로딩 중

[논문리뷰] Typhoon OCR: Open Vision-Language Model For Thai Document Extraction

댓글 수 로딩 중

[논문리뷰] Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation

댓글 수 로딩 중

[논문리뷰] More Images, More Problems? A Controlled Analysis of VLM Failure Modes

댓글 수 로딩 중

[논문리뷰] COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs

댓글 수 로딩 중

[논문리뷰] SWE-Bench++: A Framework for the Scalable Generation of Software Engineering Benchmarks from Open-Source Repositories

댓글 수 로딩 중

[논문리뷰] Masks Can Be Distracting: On Context Comprehension in Diffusion Language Models

댓글 수 로딩 중

[논문리뷰] World in a Frame: Understanding Culture Mixing as a New Challenge for Vision-Language Models

댓글 수 로딩 중

[논문리뷰] TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval

댓글 수 로딩 중

[논문리뷰] Do LLMs Feel? Teaching Emotion Recognition with Prompts, Retrieval, and Curriculum Learning

댓글 수 로딩 중

[논문리뷰] VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks

댓글 수 로딩 중

[논문리뷰] TIR-Bench: A Comprehensive Benchmark for Agentic Thinking-with-Images Reasoning

댓글 수 로딩 중

[논문리뷰] CHURRO: Making History Readable with an Open-Weight Large Vision-Language Model for High-Accuracy, Low-Cost Historical Text Recognition

댓글 수 로딩 중

[논문리뷰] Video2Roleplay: A Multimodal Dataset and Framework for Video-Guided Role-playing Agents

댓글 수 로딩 중

[논문리뷰] Unraveling the cognitive patterns of Large Language Models through module communities

댓글 수 로딩 중

[논문리뷰] Demystifying Scientific Problem-Solving in LLMs by Probing Knowledge and Reasoning

댓글 수 로딩 중

[논문리뷰] CARFT: Boosting LLM Reasoning via Contrastive Learning with Annotated Chain-of-Thought-based Reinforced Fine-Tuning

댓글 수 로딩 중

[논문리뷰] AMFT: Aligning LLM Reasoners by Meta-Learning the Optimal Imitation-Exploration Balance

댓글 수 로딩 중

[논문리뷰] BiasGym: Fantastic Biases and How to Find (and Remove) Them

댓글 수 로딩 중

[논문리뷰] Pruning the Unsurprising: Efficient Code Reasoning via First-Token Surprisal

댓글 수 로딩 중

[논문리뷰] Performance Trade-offs of Optimizing Small Language Models for E-Commerce

댓글 수 로딩 중

[논문리뷰] VisJudge-Bench: Aesthetics and Quality Assessment of Visualizations

댓글 수 로딩 중

[논문리뷰] Mitigating Attention Sinks and Massive Activations in Audio-Visual Speech Recognition with LLMS

댓글 수 로딩 중

[논문리뷰] VLA-0: Building State-of-the-Art VLAs with Zero Modification

댓글 수 로딩 중

[논문리뷰] Deflanderization for Game Dialogue: Balancing Character Authenticity with Task Execution in LLM-based NPCs

댓글 수 로딩 중

[논문리뷰] LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens

댓글 수 로딩 중

[논문리뷰] Speculative Jacobi-Denoising Decoding for Accelerating Autoregressive Text-to-image Generation

댓글 수 로딩 중

[논문리뷰] HiKE: Hierarchical Evaluation Framework for Korean-English Code-Switching Speech Recognition

댓글 수 로딩 중

[논문리뷰] NuRisk: A Visual Question Answering Dataset for Agent-Level Risk Assessment in Autonomous Driving

댓글 수 로딩 중

[논문리뷰] Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost

댓글 수 로딩 중

[논문리뷰] DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents

댓글 수 로딩 중

[논문리뷰] BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses

댓글 수 로딩 중

[논문리뷰] Knowledge Homophily in Large Language Models

댓글 수 로딩 중

[논문리뷰] DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder

댓글 수 로딩 중