본문으로 건너뛰기

#Contrastive Learning

49개의 포스트

[논문리뷰] Critic-R: Improving Agentic Search using Instruction-tuned Retrievers with Natural Language Introspective Feedback

댓글 수 로딩 중

[논문리뷰] MERIT: Learning Disentangled Music Representations for Audio Similarity

댓글 수 로딩 중

[논문리뷰] Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer

댓글 수 로딩 중

[논문리뷰] Hide-and-Seek in Trajectories: Discovering Failure Signals for VLA Runtime Monitoring

댓글 수 로딩 중

[논문리뷰] CEPO: RLVR Self-Distillation using Contrastive Evidence Policy Optimization

댓글 수 로딩 중

[논문리뷰] MegaStyle: Constructing Diverse and Scalable Style Dataset via Consistent Text-to-Image Style Mapping

댓글 수 로딩 중

[논문리뷰] π-StepNFT: Wider Space Needs Finer Steps in Online RL for Flow-based VLAs

댓글 수 로딩 중

[논문리뷰] SLER-IR: Spherical Layer-wise Expert Routing for All-in-One Image Restoration

댓글 수 로딩 중

[논문리뷰] InfoNCE Induces Gaussian Distribution

댓글 수 로딩 중

[논문리뷰] CGPT: Cluster-Guided Partial Tables with LLM-Generated Supervision for Table Retrieval

댓글 수 로딩 중

[논문리뷰] OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation

댓글 수 로딩 중

[논문리뷰] Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

댓글 수 로딩 중

[논문리뷰] Mitigating Object and Action Hallucinations in Multimodal LLMs via Self-Augmented Contrastive Alignment

댓글 수 로딩 중

[논문리뷰] Pillar-0: A New Frontier for Radiology Foundation Models

댓글 수 로딩 중

[논문리뷰] Large Language Models Meet Extreme Multi-label Classification: Scaling and Multi-modal Framework

댓글 수 로딩 중

[논문리뷰] Llama-Embed-Nemotron-8B: A Universal Text Embedding Model for Multilingual and Cross-Lingual Tasks

댓글 수 로딩 중

[논문리뷰] Mind-the-Glitch: Visual Correspondence for Detecting Inconsistencies in Subject-Driven Generation

댓글 수 로딩 중

[논문리뷰] Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation

댓글 수 로딩 중

[논문리뷰] Modality Alignment with Multi-scale Bilateral Attention for Multimodal Recommendation

댓글 수 로딩 중

[논문리뷰] CARFT: Boosting LLM Reasoning via Contrastive Learning with Annotated Chain-of-Thought-based Reinforced Fine-Tuning

댓글 수 로딩 중

[논문리뷰] Refining Contrastive Learning and Homography Relations for Multi-Modal Recommendation

댓글 수 로딩 중

[논문리뷰] UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation

댓글 수 로딩 중

[논문리뷰] CRINN: Contrastive Reinforcement Learning for Approximate Nearest Neighbor Search

댓글 수 로딩 중

[논문리뷰] E^2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker

댓글 수 로딩 중

[논문리뷰] WithAnyone: Towards Controllable and ID Consistent Image Generation

댓글 수 로딩 중

[논문리뷰] UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning

댓글 수 로딩 중

[논문리뷰] FG-CLIP 2: A Bilingual Fine-grained Vision-Language Alignment Model

댓글 수 로딩 중

[논문리뷰] SAIL-Embedding Technical Report: Omni-modal Embedding Foundation Model

댓글 수 로딩 중

[논문리뷰] No Tokens Wasted: Leveraging Long Context in Biomedical Vision-Language Models

댓글 수 로딩 중

[논문리뷰] Towards Mixed-Modal Retrieval for Universal Retrieval-Augmented Generation

댓글 수 로딩 중

[논문리뷰] OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

댓글 수 로딩 중