Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add missing scores parameter in MLPutTrainedModelVocabularyRequest #2321

Merged
merged 4 commits into from
Oct 31, 2023

Conversation

dolaru
Copy link
Member

@dolaru dolaru commented Oct 31, 2023

This adds the missing scores parameter that was introduced in v8.9.0.

Docs say:

(Optional, array) Vocabulary value scores used by sentence-piece tokenization. Must have the same length as vocabulary. Required for unigram sentence-piece tokenized models like XLMRoberta and T5.

@github-actions
Copy link
Contributor

Following you can find the validation results for the API you have changed.

API Status Request Response
ml.put_trained_model_vocabulary Missing test Missing test

You can validate this API yourself by using the make validate target.

Copy link
Member

@davidkyle davidkyle left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@dolaru dolaru merged commit e279583 into main Oct 31, 2023
14 of 15 checks passed
@dolaru dolaru deleted the ml-put-vocab-scores branch October 31, 2023 12:39
github-actions bot pushed a commit that referenced this pull request Oct 31, 2023
#2321)

This adds the missing scores parameter that was introduced in v8.9.0.

[Docs](https://www.elastic.co/guide/en/elasticsearch/reference/8.9/put-trained-model-vocabulary.html#ml-put-trained-model-vocabulary-request-body) say:

> (Optional, array) Vocabulary value scores used by sentence-piece tokenization. Must have the same length as vocabulary. Required for unigram sentence-piece tokenized models like XLMRoberta and T5.

(cherry picked from commit e279583)
github-actions bot pushed a commit that referenced this pull request Oct 31, 2023
#2321)

This adds the missing scores parameter that was introduced in v8.9.0.

[Docs](https://www.elastic.co/guide/en/elasticsearch/reference/8.9/put-trained-model-vocabulary.html#ml-put-trained-model-vocabulary-request-body) say:

> (Optional, array) Vocabulary value scores used by sentence-piece tokenization. Must have the same length as vocabulary. Required for unigram sentence-piece tokenized models like XLMRoberta and T5.

(cherry picked from commit e279583)
github-actions bot pushed a commit that referenced this pull request Oct 31, 2023
#2321)

This adds the missing scores parameter that was introduced in v8.9.0.

[Docs](https://www.elastic.co/guide/en/elasticsearch/reference/8.9/put-trained-model-vocabulary.html#ml-put-trained-model-vocabulary-request-body) say:

> (Optional, array) Vocabulary value scores used by sentence-piece tokenization. Must have the same length as vocabulary. Required for unigram sentence-piece tokenized models like XLMRoberta and T5.

(cherry picked from commit e279583)
dolaru added a commit that referenced this pull request Oct 31, 2023
#2321) (#2324)

This adds the missing scores parameter that was introduced in v8.9.0.

[Docs](https://www.elastic.co/guide/en/elasticsearch/reference/8.9/put-trained-model-vocabulary.html#ml-put-trained-model-vocabulary-request-body) say:

> (Optional, array) Vocabulary value scores used by sentence-piece tokenization. Must have the same length as vocabulary. Required for unigram sentence-piece tokenized models like XLMRoberta and T5.

(cherry picked from commit e279583)

Co-authored-by: David Olaru <[email protected]>
dolaru added a commit that referenced this pull request Oct 31, 2023
#2321) (#2323)

This adds the missing scores parameter that was introduced in v8.9.0.

[Docs](https://www.elastic.co/guide/en/elasticsearch/reference/8.9/put-trained-model-vocabulary.html#ml-put-trained-model-vocabulary-request-body) say:

> (Optional, array) Vocabulary value scores used by sentence-piece tokenization. Must have the same length as vocabulary. Required for unigram sentence-piece tokenized models like XLMRoberta and T5.

(cherry picked from commit e279583)

Co-authored-by: David Olaru <[email protected]>
dolaru added a commit that referenced this pull request Oct 31, 2023
#2321) (#2322)

This adds the missing scores parameter that was introduced in v8.9.0.

[Docs](https://www.elastic.co/guide/en/elasticsearch/reference/8.9/put-trained-model-vocabulary.html#ml-put-trained-model-vocabulary-request-body) say:

> (Optional, array) Vocabulary value scores used by sentence-piece tokenization. Must have the same length as vocabulary. Required for unigram sentence-piece tokenized models like XLMRoberta and T5.

(cherry picked from commit e279583)

Co-authored-by: David Olaru <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants