#3D Object Detection

3개의 포스트

[논문리뷰] VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection

본 연구는 정밀한 카메라 자세나 깊이 정보 와 같은 센서 기반의 기하학적 입력 없이 다중 시점 실내 3D 객체 탐지를 수행하는 Sensor-Geometry-Free (SG-Free) 설정을 목표로 합니다.

#Review #3D Object Detection #Multi-View #Sensor-Geometry-Free #Transformer #VGGT #Attention-Guided Query Generation #Query-Driven Feature Aggregation

2026년 3월 2일

[논문리뷰] N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models

본 연구는 기존 멀티모달 모델이 2D 이미지에 의존하여 3D 공간 이해 능력이 부족하다는 한계를 해결하는 것을 목표로 합니다.

#Review #3D Grounding #Spatial Reasoning #Vision-Language Models #Depth Estimation #3D Object Detection #Chain-of-Thought #Data Generation #Multimodal AI

2025년 12월 18일

[논문리뷰] TUN3D: Towards Real-World Scene Understanding from Unposed Images

본 논문은 실세계 스캔에서 정확한 카메라 포즈나 깊이 정보 없이 다중 뷰 이미지 입력만으로 조인트 레이아웃 추정(layout estimation) 과 3D 객체 감지(3D object detection) 를 수행하는 최초의 방법론인 TUN3D 를 제시합니다.

#Review #3D Scene Understanding #Layout Estimation #3D Object Detection #Unposed Images #Sparse Convolutional Networks #Multi-view Stereo #Real-time AI

2025년 9월 29일