Skip to content

ilkersigirci/runpod-playground

Repository files navigation

runpod-playground

Steps

  • Api healthcheck is enabled by default, which sends a message to the vllm server in fixed period of time.
    • To disable healthcheck, ENABLE_HEALTH_CHECK=0 should be set in .env file.
  • To send the healthcheck failure message to Microsoft Teams, TEAMS_WEBHOOK_URL should be set in .env file.
    • Example: TEAMS_WEBHOOK_URL=https://outlook.office.com/webhook/...
cd /workspace
git clone https://github.com/ilkersigirci/runpod-playground.git
cd /workspace/runpod-playground

# Initial dependency install
make install

# Download model
make download-model

# Start vllm
make start-vllm

# See vllm logs
make log-vllm

# Restart vllm
make restart-vllm

# Start the simple gui
make gui
  • To deploy different model, in .env file, change HF_MODEL_NAME variable to the model name you want to deploy by following hunggingface repository id convention.
  • Also you can change SERVED_MODEL_NAME to specify model name for requests.
  • One can also change MAX_CONTEXT_LEN variable to the desired context length.
  • Example: Change default model and its context length to CohereForAI/c4ai-command-r-plus-GPTQ
make change-model-env HF_MODEL_NAME=CohereForAI/c4ai-command-r-plus-GPTQ
make change-max-context-len-env MAX_CONTEXT_LEN=40000
  • Example request with system message
curl --request POST \
    --url http://0.0.0.0:8000/v1/chat/completions \
    --header "Content-Type: application/json" \
    --data '{
  "model": "c4ai-command-r-plus-GPTQ",
  "messages": [
  {
      "role": "system",
      "content": "You are a helpful virtual assistant trained by OpenAI."
  },
  {
    "role": "user",
    "content": "Who are you?"
  }
  ], 
  "temperature": 0.8,
  "stream": false
}'
  • Example request without system message
curl --request POST \
    --url http://0.0.0.0:8000/v1/chat/completions \
    --header "Content-Type: application/json" \
    --data '{
  "model": "c4ai-command-r-plus-GPTQ",
  "messages": [
  {
    "role": "user",
    "content": "Who are you?"
  }
  ], 
  "temperature": 0.8,
  "stream": false
}'

TabbyAPI Prompt Templates

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published