gte-Qwen2-7b-instruct and related Qwen2 models have incorrect max_token size #6

tosaddler · 2024-07-10T23:08:37Z

Not sure where the info is pulled from, but it is showing 131072 rather than 32768.

The text was updated successfully, but these errors were encountered:

orionw · 2024-07-11T15:40:40Z

Thanks @tosaddler for noticing, will fix!

orionw · 2024-07-11T18:54:11Z

Seems like it may be re-appearing. Will take a deeper look later.

orionw · 2024-07-11T20:49:04Z

I can make a manual fix for this but would have to write it in.

FWIW, this is caused by extracting it out their their config.json file. I think it's a bug on their part. We can either provide some code to override the extraction or we can leave it.

Thoughts @KennethEnevoldsen?

KennethEnevoldsen · 2024-07-12T11:05:12Z

Ahh that is frustrating - I would probably create an overwrite on our end (create some sort of file which takes priority).

orionw mentioned this issue Jul 11, 2024

Fix Paws-X naming #7

Merged

orionw closed this as completed in #7 Jul 11, 2024

orionw reopened this Jul 11, 2024

orionw mentioned this issue Jul 11, 2024

add qwen2 model meta #13

Merged

orionw closed this as completed in #13 Jul 11, 2024

orionw reopened this Jul 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gte-Qwen2-7b-instruct and related Qwen2 models have incorrect max_token size #6

gte-Qwen2-7b-instruct and related Qwen2 models have incorrect max_token size #6

tosaddler commented Jul 10, 2024

orionw commented Jul 11, 2024

orionw commented Jul 11, 2024

orionw commented Jul 11, 2024

KennethEnevoldsen commented Jul 12, 2024

gte-Qwen2-7b-instruct and related Qwen2 models have incorrect max_token size #6

gte-Qwen2-7b-instruct and related Qwen2 models have incorrect max_token size #6

Comments

tosaddler commented Jul 10, 2024

orionw commented Jul 11, 2024

orionw commented Jul 11, 2024

orionw commented Jul 11, 2024

KennethEnevoldsen commented Jul 12, 2024