#Spatio-temporal Modeling

3개의 포스트

[논문리뷰] LiteFrame: Efficient Vision Encoders Unlock Frame Scaling in Video LLMs

본 연구는 장편 비디오 이해를 위해 Video LLMs를 확장할 때 발생하는 고질적인 계산 복잡도와 효율성 병목 문제를 해결하는 데 집중합니다.

#Review #Video LLMs #Vision Encoder #Token Compression #Compressed Token Distillation #Long-form Video Understanding #Spatio-temporal Modeling

2026년 5월 18일

[논문리뷰] Seeing the Wind from a Falling Leaf

본 연구는 영상 데이터로부터 나뭇잎이 떨어지는 바람과 같이 눈에 보이지 않는 물리적 힘(invisible forces)을 추정하는 것을 목표로 합니다. 인간이 시각적 단서만으로 보이지 않는 물리적 효과를 인지하는 능력을 모방하여, 비전과 물리학 간의 간극을 줄이고 픽셀 뒤의 물리적 과정을 이해하는 데 기여하고자 합니다.

#Review #Inverse Graphics #Differentiable Physics #Force Estimation #Video Generation #Material Point Method #3D Gaussians #Spatio-temporal Modeling #Vision-Language Models

2025년 12월 1일

[논문리뷰] Trace Anything: Representing Any Video in 4D via Trajectory Fields

본 논문은 비디오의 동적 장면을 모델링하고 이해하는 데 필수적인 효과적인 시공간 표현 문제를 해결하고자 합니다.

#Review #4D Video Representation #Trajectory Fields #Neural Networks #Spatio-temporal Modeling #3D Point Tracking #Motion Forecasting #Computer Vision #B-splines

2025년 10월 16일