#Geometric Consistency

12개의 포스트

[논문리뷰] BA-T: An Iterative Transformer for Two-View Bundle Adjustment

본 연구는 기존의 feed-forward 3D 재구성 모델들이 의존하는 heavy decoder stack의 비효율성과 기하학적 self-correction 메커니즘의 부재를 해결하고자 합니다.

#Review #Bundle Adjustment #Iterative Transformer #Implicit Latent Space #Two-View Reconstruction #Pose Estimation #Geometric Consistency

2026년 6월 2일

[논문리뷰] Quantitative Video World Model Evaluation for Geometric-Consistency

본 연구는 현존하는 생성형 비디오 모델이 시각적으로는 고품질을 구현하지만, 엄격한 물리적 법칙을 따르는 3D 공간 이해도는 낮다는 점을 해결하고자 합니다.

#Review #Video World Models #Geometric Consistency #PDI-Bench #3D Lifting #Perspective Distortion Index #Physical Realism

2026년 5월 14일

[논문리뷰] VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward

대규모 비디오 Diffusion 모델은 뛰어난 시각적 품질을 보여주지만, 카메라 궤적의 불안정성이나 기하학적 표류(Geometric Drift)와 같은 3D/4D 일관성 문제에 취약합니다 .

#Review #Video Diffusion Models #Geometric Consistency #Reinforcement Learning #Latent Geometry Model #4D Reconstruction #Group Relative Policy Optimization

2026년 3월 31일

[논문리뷰] Repurposing Geometric Foundation Models for Multi-view Diffusion

최근 latent space의 발전이 single-image generation에서 상당한 진전을 이끌었지만, Novel View Synthesis (NVS) 를 위한 최적의 latent space는 대부분 미탐색 상태로 남아있습니다.

#Review #Geometric Foundation Models #Multi-view Diffusion #Novel View Synthesis (NVS)#Latent Space Design #Geometric Consistency #Diffusion Models #RGB Reconstruction #3D Consistency

2026년 3월 23일

[논문리뷰] TAPESTRY: From Geometry to Appearance via Consistent Turntable Videos

Untextured 3D 모델에 대해 사진처럼 사실적이고 자체 일관성(self-consistent) 있는 외관을 자동으로 생성하는 것은 디지털 콘텐츠 제작 분야에서 중요한 도전 과제입니다.

#Review #Video Generation #3D Texturing #Geometric Consistency #Turntable Video #Diffusion Models #Neural Rendering

2026년 3월 22일

[논문리뷰] WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion

논문은 단일 이미지로부터 장범위(long-range) 및 기하학적으로 일관된 새로운 시점 비디오를 생성하는 근본적인 문제를 해결하고자 합니다.

#Review #Novel View Synthesis #3D Geometry Propagation #Video Diffusion Models #Gaussian Splatting #Autoregressive Generation #Spatio-Temporal Noise #Geometric Consistency

2025년 12월 22일

[논문리뷰] Efficiently Reconstructing Dynamic Scenes One D4RT at a Time

논문은 복잡한 동적 장면의 기하학적 구조와 움직임을 비디오로부터 효율적으로 재구성하는 것을 목표로 합니다. 기존의 단편적이고 컴퓨팅 비용이 높은 3D 재구성 접근 방식의 한계를 극복하고, 단일의 통일된 모델로 깊이, 시공간 대응, 전체 카메라 파라미터 추론을 수행하는 4D 이해 프레임워크 를 제시하고자 합니다.

#Review #Dynamic Scene Reconstruction #4D Reconstruction #Point Tracking #Transformer Architecture #Feedforward Model #Query-based Inference #Computer Vision #Geometric Consistency

2025년 12월 9일

[논문리뷰] ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding

본 연구는 기존 비디오 리테이크 생성 방법론이 가변 길이 입력, 동적 카메라 모션, 분포 외 카메라 궤적에 취약하며, 종종 워핑 아티팩트나 흐릿한 객체를 생성하는 한계를 해결하고자 합니다.

#Review #Video Retake Generation #Camera Control #Rotary Position Embedding (RoPE)#Rotary Camera Encoding (RoCE)#Geometric Consistency #Video Generative Models #Transformer Architecture #Multi-view Synthesis

2025년 11월 25일

[논문리뷰] SyncMV4D: Synchronized Multi-view Joint Diffusion of Appearance and Motion for Hand-Object Interaction Synthesis

본 논문은 단일 뷰(single-view) HOI 비디오 생성의 기하학적 왜곡 및 비현실적인 모션 문제와 3D HOI 방법론의 제한된 일반화 능력 문제를 해결하고자 합니다.

#Review #Hand-Object Interaction #Multi-view Video Generation #4D Motion Synthesis #Diffusion Models #Spatio-temporal Consistency #Geometric Consistency #Appearance and Motion Joint Modeling

2025년 11월 24일

[논문리뷰] WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance

본 연구는 기존 비디오 확산 모델(VDM)이 3D/4D 작업에서 겪는 제어 가능성, 시공간 일관성, 기하학적 충실도의 한계를 해결하고자 합니다.

#Review #Video Diffusion Models #3D/4D Generation #Training-Free Guidance #Camera Trajectory Control #Novel View Synthesis #Geometric Consistency #Inference-Time Optimization

2025년 9월 19일

[논문리뷰] WristWorld: Generating Wrist-Views via 4D World Models for Robotic Manipulation

로봇 조작을 위한 VLA(Vision-Language-Action) 모델 은 미세한 손-객체 상호작용을 포착하는 손목 시점(wrist-view) 관찰에 크게 의존하지만, 대규모 데이터셋에서는 이러한 손목 시점 데이터가 부족합니다.

#Review #4D World Models #Robotic Manipulation #Video Generation #Multi-view Synthesis #Visual-Language-Action (VLA)#Geometric Consistency #Diffusion Models #Wrist-View

2025년 10월 9일

[논문리뷰] NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks

본 논문은 기존 3D 객체 편집 방법들이 비효율적이고 일관성이 부족하며, 편집되지 않은 영역을 보존하는 데 실패하는 문제를 해결하고자 합니다.

#Review #3D Object Editing #Training-Free #FlowEdit #Mask-Free #Deep Generative Models #TRELLIS #Data Generation #Geometric Consistency

2025년 10월 20일