Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Config hidden layer number to run in 1 lazy graph #451

Open
wants to merge 2 commits into
base: habana_main
Choose a base branch
from

Commits on Nov 1, 2024

  1. Add VLLM_CONFIG_HIDDEN_LAYERS to config hidden layers to run in one l…

    …azy graph.
    
    When batch size can't go high due to TPOT limiation, instead of run 1 layer in each graph,
    run more layer helps with performance.
    libinta committed Nov 1, 2024
    Configuration menu
    Copy the full SHA
    5cb7128 View commit details
    Browse the repository at this point in the history
  2. Fix miss spell

    libinta committed Nov 1, 2024
    Configuration menu
    Copy the full SHA
    ec0f44e View commit details
    Browse the repository at this point in the history