G$^2$VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning Paper • 2511.21688 • Published 15 days ago • 8 • 2
G$^2$VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning Paper • 2511.21688 • Published 15 days ago • 8
G^2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning Paper • 2511.21688 • Published 15 days ago • 8
Running on Zero Featured 309 Depth Anything 3 🏢 309 Generate depth maps from images using GPU acceleration
facebook/dinov3-vith16plus-pretrain-lvd1689m Image Feature Extraction • 0.8B • Updated Aug 19 • 106k • 35