GitHub - Kingdroper/STFCE: A Benchmark Dataset and Baseline method for Satellite Video Multi-label Scene Classification

SAT-MTB-MLSC website ^HOT MLSC Baseline and Benchmark ^{TRY IT OUT}

Introduction

This is the official implementation of our paper "Satellite Video Multi-Label Scene Classification With Spatial and Temporal Feature Cooperative Encoding: A Benchmark Dataset and Method".

This is the first publicly available and large-scale satellite video multi-label scene classification dataset.
It consists of 18 classes of static and dynamic ground contents, 3549 videos, and 141960 frames. We also propose a baseline method STFCE. Our Dataset:

Our STFCE:

We hope that this work provides a new research topic for researchers to promote the applications of satellite video.

Train and val dataset:DATASET.zip

Train and val frame features extracted by Inception network: Train and Val frame features.zip

STFCE models: best model

Installation

All our experiments were done on 4 Tesla V100 GPUs. We imported our conda and pip environment configurations into two files：conda_env.yml and requirements.txt.

Use the following code to reproduce the environment and make sure your GPUs are available：

conda env create -f conda_env.yml
pip install -r requirements.txt

Download the dataset, train-val frame features, and our pretrained models using above link. Then put them in the code root directory. We use the extracted frame features to train and test our model. Test our model:

python eval.py --eval_data_pattern="val.tfrecord" --model=LstmModel --train_dir=stfce_model --frame_features=True --feature_names="rgb" --feature_sizes="1024" --batch_size=1024 --base_learning_rate=0.0002 --lstm_random_sequence=True --run_once=True --top_k=18 --num_classes=18

Train our model:

python train.py --train_data_pattern=train.tfrecord --model=LstmModel --train_dir=stfce_model --frame_features=True --feature_names="rgb" --feature_sizes="1024" --batch_size=80 --base_learning_rate=0.0002 --lstm_random_sequence=True --max_step=1000 --num_classes=18 --export_model_steps=100 --num_epochs=36

We also support other models, please refer to Gated NetVLAD, Gated NetFV...

Citation

If you find this project useful in your research, please consider cite:

@ARTICLE{10471306,
  author={Guo, Weilong and Li, Shengyang and Chen, Feixiang and Sun, Yuhan and Gu, Yanfeng},
  journal={IEEE Transactions on Image Processing}, 
  title={Satellite Video Multi-Label Scene Classification With Spatial and Temporal Feature Cooperative Encoding: A Benchmark Dataset and Method}, 
  year={2024},
  volume={33},
  pages={2238-2251},
  doi={10.1109/TIP.2024.3374100}}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
feature_extractor		feature_extractor
resource		resource
LICENSE		LICENSE
README.md		README.md
Read.md		Read.md
__init__.py		__init__.py
average_precision_calculator.py		average_precision_calculator.py
cloudml-gpu-distributed.yaml		cloudml-gpu-distributed.yaml
cloudml-gpu.yaml		cloudml-gpu.yaml
conda_env.yml		conda_env.yml
conda_requirements.txt		conda_requirements.txt
convert_prediction_from_json_to_csv.py		convert_prediction_from_json_to_csv.py
eval.py		eval.py
eval_util.py		eval_util.py
export_model.py		export_model.py
file_averaging.py		file_averaging.py
frame_level_models.py		frame_level_models.py
inference.py		inference.py
losses.py		losses.py
mean_average_precision_calculator.py		mean_average_precision_calculator.py
model_utils.py		model_utils.py
models.py		models.py
readers.py		readers.py
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py
video_level_models.py		video_level_models.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Installation

Citation

About

Releases

Packages

Languages

License

Kingdroper/STFCE

Folders and files

Latest commit

History

Repository files navigation

Introduction

Installation

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages