Deep Learning Library Testing via Effective Model Generation

This is the implementation repository of our ESEC/FSE 2020 paper: Deep Learning Library Testing via Effective Model Generation.

Description

LEMON is a novel approach to testing DL libraries by generating effective DL models via guided mutation. The goal is to test DL libraries as sufficiently as possible by exploring unused library code or different usage ways of library code. We further propose a heuristic strategy in LEMON to guide the process of model generation so as to generate models that can amplify the inconsistent degrees for real bugs. In this way, it is clearer to distinguish real bugs and uncertain impacts in DL libraries. We conducted an empirical study to evaluate the effectiveness of LEMON based on 20 release versions of TensorFlow,Theano, CNTK, and MXNet. LEMON detected 24 new bugs in the latest release versions of these libraries. The results also demonstrate that the models generated by LEMON outperform existing models and the models generated without guidance in terms of the number of unique bugs/inconsistencies and the achieved inconsistent degrees.

Datasets/Models/Libraries

Datasets/Models

We used 12 popular DL models based on 6 datasets including both images and sequential data, as the initial seed models in LEMON, which have been widely used in many existing studies.

Model	Dataset	Related link¹
AlexNet	CIFAR-10	alexnet-cifar-10-keras-jupyter
LeNet5	Fashion-MNIST	fashion_mnist_keras
LeNet5	MNIST	lenet5-mnist
LSTM-1	Sine-Wave	LSTM-Neural-Network-for-Time-Series-Prediction
LSTM-2	Stock-Price	StockPricesPredictionProject
ResNet50	ImageNet²	Keras applications tutorial³
MobileNetV1	ImageNet²	Keras applications tutorial³
InceptionV3	ImageNet²	Keras applications tutorial³
DenseNet121	ImageNet²	Keras applications tutorial³
VGG16	ImageNet²	Keras applications tutorial³
VGG19	ImageNet²	Keras applications tutorial³
Xception	ImageNet²	Keras applications tutorial³

1: The first 5 models are trained using existing repositories while the last 7 models in ImageNet are obtained directly using the API provided by Keras.

2: We sampled 1500 images from ImageNet and you could obtain them from sampled_imagenet-1500.npz. You can also sample your own images from the ImageNet validation dataset.

3: Keras applications tutorial can be found in: https://keras.io/api/applications/

4: All model files and two regression dataset and ImageNet dataset we sampled can be access in OneDrive

NOTE: LEMON use 6 dataset, and 3 of them could be directly obtained from their homepages ( CIFAR-10, Fashion-MNIST, MNIST ) or accessed by Keras API. We only upload the ImageNet dataset sampled by LEMON and two datasets collected from GitHub.

In dataset.zip, there are 3 files:

sinewave.csv: dataset of LSTM-1 DIS.csv : dataset of LSTM-2 sampled_imagenet-1500.npz: dataset samples by LEMON from ImageNet

In origin_model.zip, there are 12 models used in LEMON.

Libraries

We used 20 release versions of 4 widely-used DL libraries, i.e., TensorFlow, CNTK,Theano, and MXNet, as subjects to constructed five experiments (indexed E1 to E5 in Table) to conduct differential testing.

We share the link of each library and docker image used in LEMON.

Experiment ID	Tensorflow	Theano	CNTK	MXNet	CUDA
E1	1.14.0	1.0.4	2.7.0	1.5.1	10.1
E2	1.13.1	1.0.3	2.6.0	1.4.1	10.0
E3	1.12.0	1.0.2	2.5.1	1.3.1	9.0
E4	1.11.0	1.0.1	2.4.0	1.2.1	9.0
E5	1.10.0	1.0.0	2.3.1	1.1.0	9.0

* All libraries should be GPU-supported version

Reproducibility

Environment

We conducted 5 experiments in LEMON of which the library and CUDA version information are as described above. In order to facilitate other researchers to reproduce LEMON, we provide a docker image for the E1 experiment. It can be easily obtained by the following command. (Note: nvidia-docker2 is required!)

Step 0: Please Install nvidia-docker2. You can use this instruction to install it.

Step 1: Clone the repository. Download the dataset and models from OneDrive. Save the code and unzip datasets and models to /your/local/path/ , e.g. /your/local/path/origin_model and /your/local/path/dataset. (/your/local/path/ should be the absolute path on your server, e.g. /home/user_xxx/)

Step 2: Using the following command to pull the docker image we released for E1 and create a container for it.

docker pull yenming1227/lemon:latest
docker run --runtime=nvidia -it -v /your/local/path/:/data  yenming1227/lemon:latest /bin/bash

Then you will enter a container.(Remember to change /your/local/path/ to the real path! ^_^)

Note: If your server is using http proxy, you should configure proxy in the container just as you did in your server before

Step 3: Create five virtual environments as shown below in your docker container.

Note: Please copy the installation command line by line to prevent some commands from being skipped.

# tensorflow
conda create -n tensorflow python=3.6
source activate tensorflow
pip install -r lemon_requirements.txt
pip install keras==2.2.4
pip install tensorflow-gpu==1.14.0
source deactivate

# theano
conda create -n theano python=3.6
source activate theano
pip install -r lemon_requirements.txt
conda install pygpu=0.7.6
pip install keras==2.2.4
pip install theano==1.0.4
source deactivate

# cntk
conda create -n cntk python=3.6
source activate cntk
pip install -r lemon_requirements.txt
pip install keras==2.2.4
pip install cntk-gpu==2.7
source deactivate

# mxnet
conda create -n mxnet python=3.6
source activate mxnet
pip install -r lemon_requirements.txt
pip install keras-mxnet==2.2.4.2
pip install mxnet-cu101==1.5.1.post0
source deactivate

# default lemon python
conda create -n lemon python=3.6
source activate lemon
pip install -r lemon_requirements.txt
pip install keras==2.2.4
pip install tensorflow-gpu==1.14.0
source deactivate

Redis Startup

LEMON uses redis to store intermediate outputs and exchange data between different processes. We have installed redis in our docker image, you can start it with the following command:

cd /root/redis-4.0.8/src
./redis-server ../redis.conf

Running LEMON

The LEMON artifacts are well organized, and researchers can simply run LEMON with the following command.

Note: We conducted five large scale experiments (generating 100 mutants for each of the 12 initial models and analyzing inconsistencies on 1500 inputs and locating bugs) and you can reproduce the bugs reported in LEMON by running tool with experiments.conf. However, it could not be completed within 48 hours. Therefore, we provide a demo run, which can be completed within 1 hour if you running the tool with demo.conf.

cd /LEMON
git pull 
source activate lemon

We really recommend you use git pull to update LEMON to the latest version.

Mutation:

python -u -m run.mutation_executor demo.conf

The above command shows how to generate mutants and calculating inconsistencies in LEMON. demo.conf is the configuration file we provided for demo run.

Localization:

python -u -m run.localization_executor demo.conf

This command shows the way to perform localization in LEMON. The final bug reports will be stored in path /data/lemon_outputs/bug_list.txt

Extension

LEMON also supports researchers to switch to other models and datasets. You only need to focus on the code snippets of the data processing part in DataUtils.get_data_by_exp in scripts/tools/utils.py.

# TODO: Add your own data preprocessing here
# Note: The returned inputs should be preprocessed and labels should decoded as one-hot vectors which could be directly feed in model. Both of them should be returned in batch, e.g. shape like (1500,28,28,1) and (1500,10)
# 
# elif 'xxx' in exp:
#     x_test, y_test = get_your_data(dataset_dir)

Besides, you should name your model file in format NetworkName-DatasetName_origin.h5, e.g. mobilenet.1.00.224-imagenet_origin.h5.

Note: _ and - can't be shown in NetworkName. You can replace them with .

For example , changing mobilenet_1.00_224-imagenet_origin.h5 to mobilenet.1.00.224-imagenet_origin.h5.

Citation

Please cite our paper if this work is helpful to you.

@inproceedings{DBLP:conf/sigsoft/WangYCLZ20,
  author    = {Zan Wang and
               Ming Yan and
               Junjie Chen and
               Shuang Liu and
               Dongdi Zhang},
  title     = {Deep learning library testing via effective model generation},
  booktitle = {{ESEC/SIGSOFT} {FSE}},
  pages     = {788--799},
  publisher = {{ACM}},
  year      = {2020}
}

Contact

Authors information:

Name	Email Address	Github id
Zan Wang	[email protected]	tjuwangzan
Ming Yan	[email protected]	Jacob-yen
Junjie Chen *	[email protected]	JunjieChen
Shuang Liu	[email protected]	AbigailLiu
Dongdi Zhang	[email protected]	Dandy-John

* corresponding author

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
config		config
run		run
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
fse20main-p759-final.pdf		fse20main-p759-final.pdf
lemon_requirements.txt		lemon_requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Learning Library Testing via Effective Model Generation

Description

Datasets/Models/Libraries

Datasets/Models

Libraries

Reproducibility

Environment

Redis Startup

Running LEMON

Extension

Citation

Contact

About

Releases 1

Packages

Contributors 2

Languages

License

Jacob-yen/LEMON

Folders and files

Latest commit

History

Repository files navigation

Deep Learning Library Testing via Effective Model Generation

Description

Datasets/Models/Libraries

Datasets/Models

Libraries

Reproducibility

Environment

Redis Startup

Running LEMON

Extension

Citation

Contact

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages