OE-SafetyEval

A unified repo for efficient safety eval

Installation

conda create -n safetyeval python=3.10
conda activate safetyeval
pip install vllm==0.3.2
pip install openai==0.28.0
pip install datasets tenacity
pip install scipy scikit-learn google-api-python-client
pip install ai2-olmo
pip install accelerate

How to add a new model to the benchmark

1. Models supported by vLLM

You can take the files under scripts as a reference to add a new model to the benchmark, for example, to add zephyr-7b-beta to the benchmark, you can follow the following steps:

Create a script named "zephyr-7b-beta.py" under scripts folder.
Copy and paste the most similar existing script file to it. For example, Mistral-7B-Instruct-v0.1.sh is the most similar to zephyr-7b-beta.py.
Change the model_name and model_pretty_name to HuggingFaceH4/zephyr-7b-beta and zephyr-7b-beta respectively. Make sure that model_name is the same as the model name in the Hugging Face model hub, and the model_pretty_name is the same as the script name without the .py extension.
Specify the conversation template for this model by modifying the code in src/fastchat_conversation.py.
Run your script to make sure it works. You can run the script by running bash scripts/zephyr-7b-beta.sh in the root folder.

2. Models that are only supported by native Hugging Face API

Some new models may not be supported by vLLM for now. You can do the same thing as above but use --engine hf in the script instead, and test your script. Note that some models may need more specific configurations, and you will need to read the code and modify them accordingly. In these cases, you should add name checking conditions to ensure that the model-specific changes are only applied to the specific model.

3. Private API-based Models

You should change the code to add these APIs, for example, gemini, cohere, claude, and reka. You can refer to the --engine openai logic in the existing scripts to add your own API-based models. Please make sure that you do not expose your API keys in the code.

How to run the benchmark

You can run the benchmark by running the scripts under scripts folder. For example, bash scripts/zephyr-7b-beta.sh. This will generate either a single result file in the specified output_dir. Note that if you use the shard mode, the script will merge the results into a single file later when all subprocesses are finished.

After you tested your script, you can create a PR so we will run your script on our side and merge it into the benchmark. (Note that we may need to modify the script to make it work on our side, and we will let you know if we need to do so. Also, due to the non-deteministic nature of LLMs, our results may not be exactly the same as yours. )

Name		Name	Last commit message	Last commit date
Latest commit History 265 Commits
evaluation		evaluation
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
requirements-hold.txt		requirements-hold.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OE-SafetyEval

Installation

How to add a new model to the benchmark

1. Models supported by vLLM

2. Models that are only supported by native Hugging Face API

3. Private API-based Models

How to run the benchmark

About

Releases

Packages

Contributors 3

Languages

License

aetting/OE-SafetyEval

Folders and files

Latest commit

History

Repository files navigation

OE-SafetyEval

Installation

How to add a new model to the benchmark

1. Models supported by vLLM

2. Models that are only supported by native Hugging Face API

3. Private API-based Models

How to run the benchmark

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages