Add in Dockerfile.hpu.ubi #602

Xaenalt · 2024-12-09T17:51:18Z

Creates the 1.18.0 version of the UBI dockerfile for use with Red Hat OpenShift AI

Dockerfile.hpu.ubi

uMartinXu · 2024-12-12T20:52:10Z

Dockerfile.hpu.ubi

+    DISABLE_LOGPROBS_DURING_SPEC_DECODING=false
+
+USER 2000
+ENTRYPOINT ["python3", "-m", "vllm_tgis_adapter", "--uvicorn-log-level=warning"]


The PR looks good.
My question is how could 1.19.0 and the future releases also has the feature. Should the PR be submitted to main branch?

Yes, the main part that needs updating release-to-release is L7: ENV HABANA_SOFTWARE_VERSION="1.18.0-524"

Dockerfile.hpu.ubi

vllm gaudi ubi image based on PR HabanaAI/vllm-fork#602 Signed-off-by: vbedida79 <[email protected]>

uMartinXu · 2024-12-17T00:02:46Z

we have verified the PR in https://github.com/intel/intel-technology-enabling-for-openshift/tree/main/tests/gaudi/l2#vllm
It works fine, I think with some modification in the comments in the PR the PR is good to merge.
Thanks!

Xaenalt · 2024-12-17T15:17:21Z

What other changes are needed?

uMartinXu · 2024-12-17T17:36:38Z

What other changes are needed?

The PR itself is good.
I think the PR should be merged into main branch so the following 1.19.0,1.20.0 releases can have the openShift and RHEL support.
Besides the related README file also needs to be updated. see https://github.com/HabanaAI/vllm-fork/blob/habana_main/README_GAUDI.md
So users can know how to build and run vLLM Gaudi support on RHEL and OpenShift.

Xaenalt mentioned this pull request Dec 10, 2024

[build] Changes for RH build #190

Closed

uMartinXu reviewed Dec 12, 2024

View reviewed changes

uMartinXu reviewed Dec 13, 2024

View reviewed changes

Dockerfile.hpu.ubi Show resolved Hide resolved

uMartinXu mentioned this pull request Dec 16, 2024

tests_gaudi: Added L2 vllm workload intel/intel-technology-enabling-for-openshift#329

Merged

Add in Dockerfile.hpu.ubi

c55246a

Xaenalt force-pushed the gaudi-1.18.0-main branch from 1262b71 to c55246a Compare December 16, 2024 19:10

vbedida79 added a commit to vbedida79/intel-technology-enabling-for-openshift that referenced this pull request Dec 16, 2024

tests_gaudi: Added L2 vllm workload

5f35eab

vllm gaudi ubi image based on PR HabanaAI/vllm-fork#602 Signed-off-by: vbedida79 <[email protected]>

vbedida79 added a commit to vbedida79/intel-technology-enabling-for-openshift that referenced this pull request Dec 16, 2024

tests_gaudi: Added L2 vllm workload

4e68146

vllm gaudi ubi image based on PR HabanaAI/vllm-fork#602 Signed-off-by: vbedida79 <[email protected]>

vbedida79 added a commit to vbedida79/intel-technology-enabling-for-openshift that referenced this pull request Dec 16, 2024

tests_gaudi: Added L2 vllm workload

dd2a16c

vllm gaudi ubi image based on PR HabanaAI/vllm-fork#602 Signed-off-by: vbedida79 <[email protected]>

vbedida79 added a commit to vbedida79/intel-technology-enabling-for-openshift that referenced this pull request Dec 16, 2024

tests_gaudi: Added L2 vllm workload

9a963b3

vllm gaudi ubi image based on PR HabanaAI/vllm-fork#602 Signed-off-by: vbedida79 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add in Dockerfile.hpu.ubi #602

Add in Dockerfile.hpu.ubi #602

Xaenalt commented Dec 9, 2024

uMartinXu Dec 12, 2024

Xaenalt Dec 16, 2024

uMartinXu commented Dec 17, 2024

Xaenalt commented Dec 17, 2024

uMartinXu commented Dec 17, 2024 •

edited

Loading

Add in Dockerfile.hpu.ubi #602

Are you sure you want to change the base?

Add in Dockerfile.hpu.ubi #602

Conversation

Xaenalt commented Dec 9, 2024

uMartinXu Dec 12, 2024

Choose a reason for hiding this comment

Xaenalt Dec 16, 2024

Choose a reason for hiding this comment

uMartinXu commented Dec 17, 2024

Xaenalt commented Dec 17, 2024

uMartinXu commented Dec 17, 2024 • edited Loading

uMartinXu commented Dec 17, 2024 •

edited

Loading