본문으로 건너뛰기

#LoRA

59개의 포스트

[논문리뷰] LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents

댓글 수 로딩 중

[논문리뷰] LayerRoute: Input-Conditioned Adaptive Layer Skipping via LoRA Fine-Tuning for Agentic Language Models

댓글 수 로딩 중

[논문리뷰] Entropy as a Structural Prior: How a Log-Barrier on DiT Belief Space Drives Musical Diversity and Development

댓글 수 로딩 중

[논문리뷰] Stable-Layers: Fine-Tuning Image Layer Decomposition Models with VLM-Scored Reinforcement Learning

댓글 수 로딩 중

[논문리뷰] Training Large Language Models to Predict Clinical Events

댓글 수 로딩 중

[논문리뷰] The TTS-STT Flywheel: Synthetic Entity-Dense Audio Closes the Indic ASR Gap Where Commercial and Open-Source Systems Fail

댓글 수 로딩 중

[논문리뷰] Encoder-Free Human Motion Understanding via Structured Motion Descriptions

댓글 수 로딩 중

[논문리뷰] AnomalyVFM -- Transforming Vision Foundation Models into Zero-Shot Anomaly Detectors

댓글 수 로딩 중

[논문리뷰] Diffutron: A Masked Diffusion Language Model for Turkish Language

댓글 수 로딩 중

[논문리뷰] LookaheadKV: Fast and Accurate KV Cache Eviction by Glimpsing into the Future without Generation

댓글 수 로딩 중

[논문리뷰] Accent Vector: Controllable Accent Manipulation for Multilingual TTS Without Accented Data

댓글 수 로딩 중

[논문리뷰] ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning

댓글 수 로딩 중

[논문리뷰] PureCC: Pure Learning for Text-to-Image Concept Customization

댓글 수 로딩 중

[논문리뷰] StereoAdapter-2: Globally Structure-Consistent Underwater Stereo Depth Estimation

댓글 수 로딩 중

[논문리뷰] SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer

댓글 수 로딩 중

[논문리뷰] DreamStyle: A Unified Framework for Video Stylization

댓글 수 로딩 중

[논문리뷰] PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation

댓글 수 로딩 중

[논문리뷰] IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning

댓글 수 로딩 중

[논문리뷰] Glance: Accelerating Diffusion Models with 1 Sample

댓글 수 로딩 중

[논문리뷰] First Frame Is the Place to Go for Video Content Customization

댓글 수 로딩 중

[논문리뷰] Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models

댓글 수 로딩 중

[논문리뷰] MisSynth: Improving MISSCI Logical Fallacies Classification with Synthetic Data

댓글 수 로딩 중

[논문리뷰] ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models

댓글 수 로딩 중

[논문리뷰] DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework

댓글 수 로딩 중

[논문리뷰] AlignGuard-LoRA: Alignment-Preserving Fine-Tuning via Fisher-Guided Decomposition and Riemannian-Geodesic Collision Regularization

댓글 수 로딩 중

[논문리뷰] Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

댓글 수 로딩 중

[논문리뷰] LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal

댓글 수 로딩 중