[논문리뷰] Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial IntelligenceYuning Gong이 arXiv에 게시한 'Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence' 논문에 대한 자세한 리뷰입니다.#Review#3D Spatial Intelligence#Video Stream Processing#Automated Data Curation#3D Gaussian Splatting (3DGS)#Vision-Language Models (VLMs)#Open-Vocabulary Segmentation#Spatial Reasoning#Multimodal Datasets2026년 3월 9일댓글 수 로딩 중
[논문리뷰] OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene UnderstandingarXiv에 게시된 'OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding' 논문에 대한 자세한 리뷰입니다.#Review#3D Scene Understanding#Open-Vocabulary Segmentation#Referring Expression Segmentation#Training-Free#Voxel Grouping#Vision-Language Models#Multi-modal Large Language Models#Sparse Voxel Rasterization2026년 1월 14일댓글 수 로딩 중
[논문리뷰] SAM 3: Segment Anything with ConceptsarXiv에 게시된 'SAM 3: Segment Anything with Concepts' 논문에 대한 자세한 리뷰입니다.#Review#Segment Anything Model#Open-Vocabulary Segmentation#Multimodal Foundation Model#Instance Segmentation#Video Object Tracking#Prompt Engineering#Data Engine#Human-in-the-loop2025년 11월 23일댓글 수 로딩 중