-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sync #231
Sync #231
Conversation
* Bump golang.org/x/crypto to 0.19.0 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Bump knative.dev/serving to 0.40.1 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * go mod tidy Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Some code of the E2Es assume the environment is GitHub, because it is referring to GitHub-specific variables. This PR focuses on references to the `kserve/image-transformer` image. This image is built in the CI flow and made available to the runner, so that a pull from an external registry is not needed. The references to this image are changed to an environment variable that is more agnostic to the runner, in an effort to make E2Es more compatible in other environments. The e2e-test.yml is modified to declare the new variable. Signed-off-by: Edgar Hernández <[email protected]>
* Initial commit to support vllm as alternative backend Signed-off-by: Gavrish Prabhu <[email protected]> * include minor fixes and readme changes Signed-off-by: Gavrish Prabhu <[email protected]> * fix poetry lock issues Signed-off-by: Gavrish Prabhu <[email protected]> * fix lint issues Signed-off-by: Gavrish Prabhu <[email protected]> * use_vllm support True as default Signed-off-by: Gavrish Prabhu <[email protected]> * refactor code and fix review comments Signed-off-by: Gavrish Prabhu <[email protected]> * build failure - fix tests and install vllm part of dockerfile Signed-off-by: Gavrish Prabhu <[email protected]> * fix poetry lock issue Signed-off-by: Gavrish Prabhu <[email protected]> * include string constants Signed-off-by: Gavrish Prabhu <[email protected]> * linting fix Signed-off-by: Gavrish Prabhu <[email protected]> * fix review comments Signed-off-by: Gavrish Prabhu <[email protected]> * fix tests Signed-off-by: Gavrish Prabhu <[email protected]> * fix review comments Signed-off-by: Gavrish Prabhu <[email protected]> * add support in vllm for locally downloaded models Signed-off-by: Gavrish Prabhu <[email protected]> * Update Readme Signed-off-by: Gavrish Prabhu <[email protected]> * Update Readme Signed-off-by: Gavrish Prabhu <[email protected]> * Update python/huggingfaceserver/README.md Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: Gavrish Prabhu <[email protected]> Signed-off-by: Dan Sun <[email protected]> Co-authored-by: Dan Sun <[email protected]>
Expose qpext aggregate metrics port Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
* Bump paddlepaddle to 2.6.0 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Bump transformers to version 4.37.2 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Bump cryptography to version 42.0.2 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Bump fastapi to version 0.109.2 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Bump pillow to version 10.2.0 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Bump aiohttp to version 3.9.3 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Revert fastapi bump Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Bump ray serve to 2.9.2 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Pin alibi to >=0.9.4 Signed-off-by: Sivanantham Chinnaiyan <[email protected]> * Fix alibi version Signed-off-by: Sivanantham Chinnaiyan <[email protected]> --------- Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
Signed-off-by: Dan Sun <[email protected]>
…ve#3436) * feat: Allow to disable ingress creation for raw deployment mode Signed-off-by: Yuan Tang <[email protected]> * test Signed-off-by: Yuan Tang <[email protected]> * Add configs Signed-off-by: Yuan Tang <[email protected]> * codegen Signed-off-by: Yuan Tang <[email protected]> * python docs Signed-off-by: Yuan Tang <[email protected]> * update helm docs Signed-off-by: Yuan Tang <[email protected]> * undo changes in pkg/apis/serving/v1beta1/zz_generated.deepcopy.go Signed-off-by: Yuan Tang <[email protected]> * remove unnecessary changes Signed-off-by: Yuan Tang <[email protected]> * Empty-Commit Signed-off-by: Yuan Tang <[email protected]> * remove hf change Signed-off-by: Yuan Tang <[email protected]> * Update config/configmap Signed-off-by: Yuan Tang <[email protected]> * fix chart syntax Signed-off-by: Yuan Tang <[email protected]> * Empty-Commit Signed-off-by: Yuan Tang <[email protected]> --------- Signed-off-by: Yuan Tang <[email protected]>
* build: Skip already downloaded Istio in quick_install.sh Signed-off-by: Yuan Tang <[email protected]> * Update quick_install.sh Signed-off-by: Yuan Tang <[email protected]> --------- Signed-off-by: Yuan Tang <[email protected]>
…rver (kserve#3449) * fix: HuggingFace predictor should not be recognized as multi-model server Signed-off-by: Yuan Tang <[email protected]> * add Signed-off-by: Yuan Tang <[email protected]> --------- Signed-off-by: Yuan Tang <[email protected]>
* Do not inject model agent for huggingface runtime Signed-off-by: Dan Sun <[email protected]> * Add option to return token type ids Signed-off-by: Dan Sun <[email protected]> * Add option to return token type ids Signed-off-by: Dan Sun <[email protected]> * Default model dir Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: Dan Sun <[email protected]>
Add padding and truncation in huggingface tokenizer Signed-off-by: Kevin Mingtarja <[email protected]> Signed-off-by: Kevin Mingtarja <[email protected]>
kserve#3456) * Fix model dir does not work with vllm Signed-off-by: Dan Sun <[email protected]> * Fix extra comma Signed-off-by: Dan Sun <[email protected]> --------- Signed-off-by: Dan Sun <[email protected]>
…et (kserve#3390) * Fix azure workload identity federation by excluding azure client secret Signed-off-by: Robbert van der Gugten <[email protected]> * comment code Signed-off-by: Robbert van der Gugten <[email protected]> --------- Signed-off-by: Robbert van der Gugten <[email protected]> Co-authored-by: Robbert van der Gugten <[email protected]>
Signed-off-by: Dan Sun <[email protected]>
Signed-off-by: Sivanantham Chinnaiyan <[email protected]>
…kserve#3463) * change certificate field for s3 storageSecret in python Signed-off-by: jooho <[email protected]> * change the field name to ca_bundle that is the same pattern with aws Signed-off-by: jooho <[email protected]> --------- Signed-off-by: jooho <[email protected]>
Please enter a commit message to explain why this merge is necessary,
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: spolti The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
@spolti: The following tests failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
/retest |
What this PR does / why we need it:
Which issue(s) this PR fixes (optional, in
fixes #<issue number>(, fixes #<issue_number>, ...)
format, will close the issue(s) when PR gets merged):Fixes #
Type of changes
Please delete options that are not relevant.
Feature/Issue validation/testing:
Please describe the tests that you ran to verify your changes and relevant result summary. Provide instructions so it can be reproduced.
Please also list any relevant details for your test configuration.
Test A
Test B
Logs
Special notes for your reviewer:
Checklist:
Release note: