Compose multimodal datasets 🎹
-
Updated
Jan 5, 2026 - Python
Compose multimodal datasets 🎹
[NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"
[CVPR 2026] G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning
[NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"
The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'
Visual Spatial Tuning
Official code of "Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding"
Official Implementation of Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training
[Awesome-Spatial-VLMs] This repository is the official, community-maintained resource for the survey paper: Spatial Intelligence in Vision-Language Models: A Comprehensive Survey;
[ICLR 2026] OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models
Official implementation of "RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics"
[CVPR 2026] SpatialScore: Towards Comprehensive Evaluation for Spatial Intelligence
Code for "ReSpace: Text-Driven 3D Indoor Scene Synthesis and Editing with Preference Alignment"
[CVPR 2025] Program synthesis for 3D spatial reasoning
[NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs
[CVPR2020] A Dataset for SPAtial REasoning on Three-View Line Drawings
[ICCV 2025] Official code for Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation
Qualitative Reasoning: Spatio-Temporal Reasoning using Relation Algebras and Constraint Networks. Documentation is under construction at ReadTheDocs. See link below.
[ICLR2026] Spatial Reasoning with Vision-Language Models
[AAAI 2022] Dataset and pytorch codes for the paper titled "StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts" in AAAI 2022 (Oral)
Add a description, image, and links to the spatial-reasoning topic page so that developers can more easily learn about it.
To associate your repository with the spatial-reasoning topic, visit your repo's landing page and select "manage topics."