GeoLangBind: Unifying Earth Observation with Agglomerative Vision-Language Foundation Models Paper • 2503.06312 • Published Mar 8, 2025
ChatEarthNet: A Global-Scale Image-Text Dataset Empowering Vision-Language Geo-Foundation Models Paper • 2402.11325 • Published Feb 17, 2024
RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing Data Paper • 2210.12634 • Published Oct 23, 2022
EO-VAE: Towards A Multi-sensor Tokenizer for Earth Observation Data Paper • 2602.12177 • Published Feb 12
TerraScope: Pixel-Grounded Visual Reasoning for Earth Observation Paper • 2603.19039 • Published 4 days ago • 42
SkyEyeGPT: Unifying Remote Sensing Vision-Language Tasks via Instruction Tuning with Large Language Model Paper • 2401.09712 • Published Jan 18, 2024 • 1
Neural Plasticity-Inspired Multimodal Foundation Model for Earth Observation Paper • 2403.15356 • Published Mar 22, 2024
Decoupling Common and Unique Representations for Multimodal Self-supervised Learning Paper • 2309.05300 • Published Sep 11, 2023
SSL4EO-S12: A Large-Scale Multi-Modal, Multi-Temporal Dataset for Self-Supervised Learning in Earth Observation Paper • 2211.07044 • Published Nov 13, 2022
GAMUS: A Geometry-aware Multi-modal Semantic Segmentation Benchmark for Remote Sensing Data Paper • 2305.14914 • Published May 24, 2023
Towards a Unified Copernicus Foundation Model for Earth Vision Paper • 2503.11849 • Published Mar 14, 2025 • 4
REOBench: Benchmarking Robustness of Earth Observation Foundation Models Paper • 2505.16793 • Published May 22, 2025
EarthMind: Towards Multi-Granular and Multi-Sensor Earth Observation with Large Multimodal Models Paper • 2506.01667 • Published Jun 2, 2025 • 21