Fix performance of top_p and top_k calculations #449

kdamaszk · 2024-10-30T15:34:14Z

This change is fixing the performance issue I have introduced in the PR #414 -- due to the usage of torch.where both functions have been called. Now we will run only the selected one.

This change is fixing the performance issue I have introduced in the PR HabanaAI#414 -- due to the usage of `torch.where` both functions have been called. Now we will run only the selected one.

Fix performance of top_p and top_k calculations

bc6e304

kdamaszk requested review from michalkuligowski and mswiniarsk October 30, 2024 15:34

format.sh

4f7dca9

michalkuligowski approved these changes Oct 30, 2024

View reviewed changes

michalkuligowski merged commit d3257b2 into habana_main Oct 30, 2024
19 checks passed

michalkuligowski deleted the dev/kdamaszke/fix-topp-topk-sampler branch October 30, 2024 15:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix performance of top_p and top_k calculations #449

Fix performance of top_p and top_k calculations #449

kdamaszk commented Oct 30, 2024

Fix performance of top_p and top_k calculations #449

Fix performance of top_p and top_k calculations #449

Conversation

kdamaszk commented Oct 30, 2024