SoundSil-DS: Deep Denoising and Segmentation of Sound-field Images with Silhouettes

The official PyTorch implementation of the paper

SoundSil-DS: Deep Denoising and Segmentation of Sound-field Images with Silhouettes

What is SoundSil-DS?

The SoundSil-DS is a deep learning model for noise reduction in sound-field images with object silhouettes measured by optical methods, such as interferometer and holography. This is a continuous work of Deep Sound-Field Denoiser. It treats the complex-valued amplitude of the sound field in the frequency domain as a 2-channel image consisting of real and imaginary parts, and performs noise reduction and object-silhouette segmentation using a network based on CascadedGaze. The network has been trained using a sound-field image dataset we created using 2D acoustic simulations. The dataset includes noisy data with additive Gaussian white noise.

Getting started

Our code is based on CascadedGaze.

Download trained weights and place them in the 'trained_weights' directory.
Download dataset and place them in the 'dataset' directory. When you run evaluation.py, 'evaluatation' directory from the dataset is required. When you run train.py, 'training' and 'validation' directories are required.
Install dependencies.

pip install -r requirements.txt

Use SoundSil-DS

Quick demo by jupyter notebook

demo.ipynb provides a simple demo including loading a sound field from evaluation dataset, performing denoising and segmentation by pretrained weights, and displaying true, noisy, and denoised images.

Evaluation

To evaluate metrics and save denoised data on evaluation dataset, run

python evaluate.py --config config.yml

You can specify parameters for evaluation by the properties below 'eval' section in config.yml. The evaluation results will be saved into 'save_dir' directory of 'evaluation' section in the yaml file, a sub directory with the timestamp as its name will be automatically generated.

Training

To train your model, run

python train.py --config config.yml

You can specify parameters for training by the properties below 'train' and 'validation' sections in config.yml.

License

Read the NTTSoftwareLicenseAgreement.pdf.

Citation

If you use SoundSil-DS, or this codebase in your work, please consider citing this work:

@misc{tanigawa2024soundsilds,
      title={SoundSil-DS: Deep Denoising and Segmentation of Sound-field Images with Silhouettes}, 
      author={Risako Tanigawa and Kenji Ishikawa and Noboru Harada and Yasuhiro Oikawa},
      year={2024},
      eprint={2411.07517},
      archivePrefix={arXiv},
      primaryClass={eess.SP},
      url={https://arxiv.org/abs/2411.07517}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
basicsr		basicsr
models		models
utils		utils
NTTSoftwareLicenseAgreement.pdf		NTTSoftwareLicenseAgreement.pdf
README.md		README.md
config.yml		config.yml
demo.ipynb		demo.ipynb
evaluate.py		evaluate.py
fig.png		fig.png
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SoundSil-DS: Deep Denoising and Segmentation of Sound-field Images with Silhouettes

What is SoundSil-DS?

Getting started

Use SoundSil-DS

Quick demo by jupyter notebook

Evaluation

Training

License

Citation

About

Releases

Packages

Languages

nttcslab/soundsil-ds

Folders and files

Latest commit

History

Repository files navigation

SoundSil-DS: Deep Denoising and Segmentation of Sound-field Images with Silhouettes

What is SoundSil-DS?

Getting started

Use SoundSil-DS

Quick demo by jupyter notebook

Evaluation

Training

License

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages