[Misc] Enhance ppc64le Dockerfile with UBI9 Integration and Non-Root User Context #133

varad-ahirwadkar · 2024-08-26T05:20:29Z

Enhance ppc64le Dockerfile:

Adding UBI9 as base image for final stage
Adding Non-Root User Context and setting environment variables for OCP Cluster Compatibility

BEFORE SUBMITTING, PLEASE READ THE CHECKLIST BELOW AND FILL IN THE DESCRIPTION ABOVE

PR Checklist (Click to Expand)

Thank you for your contribution to vLLM! Before submitting the pull request, please ensure the PR meets the following criteria. This helps vLLM maintain the code quality and improve the efficiency of the review process.

PR Title and Classification

Only specific types of PRs will be reviewed. The PR title is prefixed appropriately to indicate the type of change. Please use one of the following:

[Bugfix] for bug fixes.
[CI/Build] for build or continuous integration improvements.
[Doc] for documentation fixes and improvements.
[Model] for adding a new model or improving an existing model. Model name should appear in the title.
[Frontend] For changes on the vLLM frontend (e.g., OpenAI API server, LLM class, etc.)
[Kernel] for changes affecting CUDA kernels or other compute kernels.
[Core] for changes in the core vLLM logic (e.g., LLMEngine, AsyncLLMEngine, Scheduler, etc.)
[Hardware][Vendor] for hardware-specific changes. Vendor name should appear in the prefix (e.g., [Hardware][AMD]).
[Misc] for PRs that do not fit the above categories. Please use this sparingly.

Note: If the PR spans more than one category, please include all relevant prefixes.

Code Quality

The PR need to meet the following code quality standards:

We adhere to Google Python style guide and Google C++ style guide.
Pass all linter checks. Please use format.sh to format your code.
The code need to be well-documented to ensure future contributors can easily understand the code.
Include sufficient tests to ensure the project to stay correct and robust. This includes both unit tests and integration tests.
Please add documentation to docs/source/ if the PR modifies the user-facing behaviors of vLLM. It helps vLLM user understand and utilize the new features or changes.

Notes for Large Changes

Please keep the changes as concise as possible. For major architectural changes (>500 LOC excluding kernel/data/config/test), we would expect a GitHub issue (RFC) discussing the technical design and justification. Otherwise, we will tag it with rfc-required and might not go through the PR.

What to Expect for the Reviews

The goal of the vLLM team is to be a transparent reviewing machine. We would like to make the review process transparent and efficient and make sure no contributor feel confused or frustrated. However, the vLLM team is small, so we need to prioritize some PRs over others. Here is what you can expect from the review process:

After the PR is submitted, the PR will be assigned to a reviewer. Every reviewer will pick up the PRs based on their expertise and availability.
After the PR is assigned, the reviewer will provide status update every 2-3 days. If the PR is not reviewed within 7 days, please feel free to ping the reviewer or the vLLM team.
After the review, the reviewer will put an action-required label on the PR if there are changes required. The contributor should address the comments and ping the reviewer to re-review the PR.
Please respond to all comments within a reasonable time frame. If a comment isn't clear or you disagree with a suggestion, feel free to ask for clarification or discuss the suggestion.

Thank You

Finally, thank you for taking the time to read these guidelines and for your interest in contributing to vLLM. Your contributions make vLLM a great tool for everyone!

…ntext Signed-off-by: Varad Ahirwadkar <[email protected]>

openshift-ci · 2024-08-26T05:20:34Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: varad-ahirwadkar
Once this PR has been reviewed and has the lgtm label, please assign dtrifiro for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

openshift-ci · 2024-08-26T05:20:40Z

Hi @varad-ahirwadkar. Thanks for your PR.

I'm waiting for a opendatahub-io member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

vaibhavjainwiz · 2024-08-26T08:53:45Z

/ok-to-test

varad-ahirwadkar · 2024-08-27T08:24:10Z

WIP

openshift-ci · 2024-09-03T22:15:59Z

@varad-ahirwadkar: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
ci/prow/smoke-test	`6205a7b`	link	true	`/test smoke-test`
ci/prow/rocm-pr-image-mirror	`6205a7b`	link	true	`/test rocm-pr-image-mirror`

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

dtrifiro · 2024-09-12T09:31:49Z

Hi @varad-ahirwadkar. Thanks for the contribution, but I think it's more appropriate to propose this upstream instead (vllm-project/vllm)

* formatting fixes * Upstream CR update

* tightened atol for custom PA; enable supported head size, block sizes in testing * update num_blocks and num_iters in benchmark PA to realistic settings * move to generic b16 type * bf16 first port * enabled all bf16 tests, set atol for bf16 * enable custom PA for bf16 as well as block size 32 and head size 64 * fix cast to zero in custom PA reduce * py linter fixes * clang format fixes * div round up clang-format --------- Co-authored-by: Charlie Fu <[email protected]> Co-authored-by: Gregory Shtrasberg <[email protected]>

Enhance ppc64le Dockerfile with UBI9 Integration and Non-Root User Co…

6205a7b

…ntext Signed-off-by: Varad Ahirwadkar <[email protected]>

openshift-ci bot requested review from terrytangyuan and vaibhavjainwiz August 26, 2024 05:20

openshift-ci bot added the needs-ok-to-test label Aug 26, 2024

openshift-ci bot added ok-to-test and removed needs-ok-to-test labels Aug 26, 2024

dtrifiro closed this Sep 12, 2024

Xaenalt pushed a commit that referenced this pull request Sep 18, 2024

Address upstream PR code review comments (#133)

a0646da

* formatting fixes * Upstream CR update

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Misc] Enhance ppc64le Dockerfile with UBI9 Integration and Non-Root User Context #133

[Misc] Enhance ppc64le Dockerfile with UBI9 Integration and Non-Root User Context #133

varad-ahirwadkar commented Aug 26, 2024

openshift-ci bot commented Aug 26, 2024

openshift-ci bot commented Aug 26, 2024

vaibhavjainwiz commented Aug 26, 2024

varad-ahirwadkar commented Aug 27, 2024

openshift-ci bot commented Sep 3, 2024

dtrifiro commented Sep 12, 2024

[Misc] Enhance ppc64le Dockerfile with UBI9 Integration and Non-Root User Context #133

[Misc] Enhance ppc64le Dockerfile with UBI9 Integration and Non-Root User Context #133

Conversation

varad-ahirwadkar commented Aug 26, 2024

PR Title and Classification

Code Quality

Notes for Large Changes

What to Expect for the Reviews

Thank You

openshift-ci bot commented Aug 26, 2024

openshift-ci bot commented Aug 26, 2024

vaibhavjainwiz commented Aug 26, 2024

varad-ahirwadkar commented Aug 27, 2024

openshift-ci bot commented Sep 3, 2024

dtrifiro commented Sep 12, 2024