-
Notifications
You must be signed in to change notification settings - Fork 60
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix router bugs on max_new_tokens and dataprep gaudi yaml file #273
Conversation
Signed-off-by: zhlsunshine <[email protected]>
Signed-off-by: zhlsunshine <[email protected]>
Signed-off-by: zhlsunshine <[email protected]>
} | ||
// Merge init request into respReq | ||
for key, value := range initReqData[Parameters].(map[string]interface{}) { | ||
/*if _, exists := respReqData[key]; !exists { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
remove this if not needed
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @KfreeZ, just let's keep it there, because let's see if they need to keep the response instead overwriting by initial request.
@@ -198,6 +199,32 @@ func executeStep( | |||
return callService(step, serviceURL, input, headers) | |||
} | |||
|
|||
func mergeRequests(respReq []byte, initReqData map[string]interface{}) []byte { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just curious, why the handleEnsemblePipeline does not need this mergeRequest?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @KfreeZ, based on current implementation, EnsemblePipeline have no response case, so just let it there.
Description
Fix router bugs on max_new_tokens and dataprep gaudi yaml file. And remove
^M
from dataprep xeon yaml file.This max_new_tokens bug was found when doing performance test by @leslieluyu
Issues
#272
Type of change
List the type of change like below. Please delete options that are not relevant.
Dependencies
n/a
.Tests
Set max_new_tokens=17
Set max_new_tokens=50