Semantic Human Matting

Yet another PyTorch implementation of the 2018 ACL Multimedia paper on Semantic Human Matting.

Getting Started

Install Dependencies

All dependencies are listed in the Pipfile. You can install them using pipenv.

$ pipenv install

This repository depends on the PSPNet implementation from https://github.com/hszhao/semseg. You will need to download the resnet model resnet50_v2.pth from the initmodel directory from this google drive link and place it in data/models.

Preparing Data

This repository expects training data in the form of raw images and alpha mattes placed in data/images and data/mattes folders respectively.

Pre-train TNet

If you're considering pretraining the TNet separately, you will need target trimaps for training. To do so, simply run the generate_trimap.py script located in the data directory with a list of all files to be converted in images.txt. This will create trimaps in data/trimaps which can be used while pre-training the model.

# cd data
$ python3 generate_trimap.py

Pre-train MNet

This repository currently assumes that the final mattes in data/mattes are also the ground truths for pre-training the MNet. There is no support for using a separate ground-truth as of now.

Training

To train the image matting pipeline end-to-end, simply run the train.py script.

$ python3 train.py

The training script also supports pre-training of TNet and MNet. This can easily be done by using the --mode flag.

# Pre-train TNet
$ python3 train.py --mode pretrain_tnet
# Pre-train MNet
$ python3 train.py --mode pretrain_mnet

For additional options such as changing hyperparameters or using a GPU, please use the --help flag.

Inference

To run inference with a trained model, use the test.py script. This will automatically choose the best model available.

$ python3 test.py

For additional options, please see the --help flag.

What's different?

Although there are a bunch of implementations available for this paper, here are a few key differences why you might want to consider this repository.

Minimal dependencies: The only dependencies are torch and torchvision.
Correct loss computation: Most other implementations use the L2 loss even when the paper specifically mentions the L1 loss.
Based on official repositories: The code is based on the official implementations of PSPNet and DIMNet.

Acknowledgements

This repository is primarily based on the official implementations of PSPNet and DIMNet from https://github.com/foamliu/Deep-Image-Matting-PyTorch and https://github.com/hszhao/semseg respectively. Any other attributions are commented on top of individual files.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
modules		modules
.gitignore		.gitignore
Pipfile		Pipfile
README.md		README.md
dataset.py		dataset.py
test.py		test.py
train.py		train.py
transforms.py		transforms.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Semantic Human Matting

Getting Started

Install Dependencies

Preparing Data

Pre-train TNet

Pre-train MNet

Training

Inference

What's different?

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

nikhilweee/semantic-human-matting

Folders and files

Latest commit

History

Repository files navigation

Semantic Human Matting

Getting Started

Install Dependencies

Preparing Data

Pre-train TNet

Pre-train MNet

Training

Inference

What's different?

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages