Annotation-Free Open-Vocabulary Segmentation for Remote-Sensing Images

We extend SegEarth-OV to SAR images. This is the first OVSS work for SAR images.

Kaiyu Li Xiangyong Cao^✉ Ruixun Liu Shihong Wang Zixuan Jiang Zhi Wang Deyu Meng

Xi'an Jiaotong University

News

2025-08-26 We release our code and AlignEarth's weights.
2025-08-26 We release the paper on arXiv.

Abstract

Semantic segmentation of remote sensing images is pivotal for comprehensive Earth observation, but the demand for interpreting new object categories, coupled with the high expense of manual annotation, poses significant challenges. Although open-vocabulary semantic segmentation (OVSS) offers a promising solution, existing frameworks designed for natural images are insufficient for the unique complexities of remote sensing data. They struggle with vast scale variations and fine-grained details, and their adaptation often relies on extensive, costly annotations. To address this critical gap, this paper introduces SegEarth-OV, the first framework for annotation-free open-vocabulary segmentation of remote sensing images. Specifically, we propose SimFeatUp, a universal upsampler that robustly restores high-resolution spatial details from coarse Vision-Language Model (VLM) features, correcting distorted target shapes without any task-specific post-training. We also present a simple yet effective Global Bias Alleviation operation to subtract the inherent global context from patch features, significantly enhancing local semantic fidelity. These components empower SegEarth-OV to effectively harness the rich semantics of pre-trained VLMs, making OVSS possible in optical remote sensing contexts. Furthermore, to extend the framework's universality to other challenging remote sensing modalities like Synthetic Aperture Radar (SAR) images, where large-scale pre-trained VLMs (e.g. SAR-CLIP) are unavailable and prohibitively expensive to create, we introduce AlignEarth, which is a distillation-based strategy and can efficiently transfer semantic knowledge from an optical VLM encoder to an SAR encoder, bypassing the need to build SAR foundation models from scratch and enabling universal OVSS across diverse sensor types. Extensive experiments on both optical and SAR datasets validate that our proposed SegEarth-OV can achieve dramatic improvements over the state-of-the-art methods, establishing a robust foundation for annotation-free and open-world Earth observation.

Dependencies and Installation

# 1. install SimFeatUp
# refer to https://github.com/likyoo/SimFeatUp

# 2. git clone this repository
git clone https://github.com/earth-insights/SegEarth-OV-2.git
cd SegEarth-OV

# 3. create new anaconda env
conda create -n SegEarth python=3.9
conda activate SegEarth

# 4. install torch and dependencies
pip install -r requirements.txt
# The dependent versions are not strict, and in general you only need to pay attention to mmcv and mmsegmentation.

# 5. download the weights of AlignEarth
# HuggingFace: https://huggingface.co/likyoo/AlignEarth-SAR-ViT-B-16
# Baidu Disk: https://pan.baidu.com/s/1X-AOk3cgJoyU9qoOQKut9A?pwd=7mtz

Datasets

For SAR images, you can download all datasets directly here.

For optical images, please refer to dataset_prepare.md for dataset preparation.

Model evaluation

Single-GPU:

python eval.py --config ./configs/cfg_DATASET.py --workdir YOUR_WORK_DIR

Multi-GPU:

bash ./dist_test.sh ./config/cfg_DATASET.py

Evaluation on all datasets:

python eval_all.py

Results will be saved in results.xlsx.

Citation

@article{li2025segearthov2,
  title={Annotation-Free Open-Vocabulary Segmentation for Remote-Sensing Images},
  author={Li, Kaiyu and Cao, Xiangyong and Liu, Ruixun and Wang, Shihong and Jiang, zixuan and Wang, Zhi and Meng, Deyu},
  journal={arXiv preprint arXiv:2508.18067},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
BLIP		BLIP
configs		configs
demo		demo
gem		gem
open_clip		open_clip
prompts		prompts
simfeatup_dev		simfeatup_dev
tools/dataset_converters		tools/dataset_converters
.gitignore		.gitignore
README.md		README.md
custom_datasets.py		custom_datasets.py
dataset_prepare.md		dataset_prepare.md
demo.py		demo.py
dist_test.sh		dist_test.sh
eval.py		eval.py
eval_all.py		eval_all.py
requirements.txt		requirements.txt
segearth_segmentor.py		segearth_segmentor.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Annotation-Free Open-Vocabulary Segmentation for Remote-Sensing Images

We extend SegEarth-OV to SAR images. This is the first OVSS work for SAR images.

News

Abstract

Dependencies and Installation

Datasets

Model evaluation

Citation

About

Uh oh!

Releases

Packages

Languages

earth-insights/SegEarth-OV-2

Folders and files

Latest commit

History

Repository files navigation

Annotation-Free Open-Vocabulary Segmentation for Remote-Sensing Images

We extend SegEarth-OV to SAR images. This is the first OVSS work for SAR images.

News

Abstract

Dependencies and Installation

Datasets

Model evaluation

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages