runpod-playground

Steps

Api healthcheck is enabled by default, which sends a message to the vllm server in fixed period of time.
- To disable healthcheck, ENABLE_HEALTH_CHECK=0 should be set in .env file.
To send the healthcheck failure message to Microsoft Teams, TEAMS_WEBHOOK_URL should be set in .env file.
- Example: TEAMS_WEBHOOK_URL=https://outlook.office.com/webhook/...

cd /workspace
git clone https://github.com/ilkersigirci/runpod-playground.git
cd /workspace/runpod-playground

# Initial dependency install
make install

# Download model
make download-model

# Start vllm
make start-vllm

# See vllm logs
make log-vllm

# Restart vllm
make restart-vllm

# Start the simple gui
make gui

To deploy different model, in .env file, change HF_MODEL_NAME variable to the model name you want to deploy by following hunggingface repository id convention.
Also you can change SERVED_MODEL_NAME to specify model name for requests.
One can also change MAX_CONTEXT_LEN variable to the desired context length.
Example: Change default model and its context length to CohereForAI/c4ai-command-r-plus-GPTQ

make change-model-env HF_MODEL_NAME=CohereForAI/c4ai-command-r-plus-GPTQ
make change-max-context-len-env MAX_CONTEXT_LEN=40000

Example request with system message

curl --request POST \
    --url http://0.0.0.0:8000/v1/chat/completions \
    --header "Content-Type: application/json" \
    --data '{
  "model": "c4ai-command-r-plus-GPTQ",
  "messages": [
  {
      "role": "system",
      "content": "You are a helpful virtual assistant trained by OpenAI."
  },
  {
    "role": "user",
    "content": "Who are you?"
  }
  ], 
  "temperature": 0.8,
  "stream": false
}'

Example request without system message

curl --request POST \
    --url http://0.0.0.0:8000/v1/chat/completions \
    --header "Content-Type: application/json" \
    --data '{
  "model": "c4ai-command-r-plus-GPTQ",
  "messages": [
  {
    "role": "user",
    "content": "Who are you?"
  }
  ], 
  "temperature": 0.8,
  "stream": false
}'

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
.vscode		.vscode
configs/tabbyAPI		configs/tabbyAPI
docker		docker
huggingface		huggingface
models		models
notebooks		notebooks
prompt_templates		prompt_templates
runpod_playground		runpod_playground
scripts		scripts
.env		.env
.gitignore		.gitignore
.python-version		.python-version
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

runpod-playground

Steps

TabbyAPI Prompt Templates

About

Releases

Packages

Languages

ilkersigirci/runpod-playground

Folders and files

Latest commit

History

Repository files navigation

runpod-playground

Steps

TabbyAPI Prompt Templates

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages