Name		Name	Last commit message	Last commit date
parent directory ..
assets		assets
jsons		jsons
models		models
optimizers		optimizers
utils		utils
README.md		README.md
dist_B.sh		dist_B.sh
dist_H.sh		dist_H.sh
dist_L.sh		dist_L.sh
single_train.sh		single_train.sh
voco_train.py		voco_train.py

README.md

Large-Scale 3D Medical Image Pre-training with Geometric Context Priors

We observe that 3D medical images contain consistent geometric context, i.e., consistent geometric relations between different organs, which leads to a promising way for learning consistent representations. Motivated by this, we propose a simple-yet-effective Volume Contrast (VoCo) framework to leverage geometric context priors for self-supervision. Given an input volume, we extract base crops from different regions to construct positive and negative pairs for contrastive learning. Then we predict the contextual position of a random crop by contrasting its similarity to the base crops. In this way, VoCo implicitly encodes the inherent geometric context into model representations, facilitating high-level semantic learning without annotations.

Pre-trained Models

Model	Params	Checkpoint
VoCo_B_SSL_head	53M	Download
VoCo_L_SSL_head	206M	Download
VoCo_H_SSL_head	818M	Download

Pre-training

Download Pre-training Dataset

Please refer to Acknowledgment. Download our PreCT-160K for pre-training.

The path of PreCT-160K should be organized as:

# or you can modify it in 'utils/data_utils*.py'
├── data
    ├── BTCV
    ├── TCIAcovid19
    ├── Luna16-jx
    ├── ...
    └── cache

WARNING:

It requires 22.6 TB space to store the original datasets. For pre-training, it requires 30 TB more space to cache the data, otherwise the pre-training will be very slow. And please store them in SSD.

Some cases:

Descriptions:

Usage

cd Self-supervised
source activate YOUR-CONDA-ENVIRONMENT
# single GPU, if you don't have enough gpu resource
sh single_train
# multi-gpu
sh dist_B.sh
sh dist_L.sh
sh dist_H.sh

Acknowledgement

NOTE THAT we are not the authors of these datasets. Although all these datasets are publicly available for academic research, you need to cite the original works as shown in our paper. For certain datasets (e.g., WORD) that necessitate approval from the authors, you need to download it from the original link.

Citation

If you find this repo useful for your research, please consider citing the paper as follows:

@article{wu2024large,
  title={Large-Scale 3D Medical Image Pre-training with Geometric Context Priors},
  author={Wu, Linshan and Zhuang, Jiaxin and Chen, Hao},
  journal={arXiv preprint arXiv:2410.09890},
  year={2024}
}
@InProceedings{voco-v1,
    author    = {Wu, Linshan and Zhuang, Jiaxin and Chen, Hao},
    title     = {VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis},
    booktitle = {CVPR},
    month     = {June},
    year      = {2024},
    pages     = {22873-22882}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Self-supervised

Self-supervised

README.md

Large-Scale 3D Medical Image Pre-training with Geometric Context Priors

Pre-trained Models

Pre-training

Download Pre-training Dataset

Usage

Acknowledgement

Citation

Files

Self-supervised

Directory actions

More options

Directory actions

More options

Latest commit

History

Self-supervised

Folders and files

parent directory

README.md

Large-Scale 3D Medical Image Pre-training with Geometric Context Priors

Pre-trained Models

Pre-training

Download Pre-training Dataset

Usage

Acknowledgement

Citation