You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was doing some code completion evaluation using codefuseEval, on the Qwen2.5-Coder base model. When I ran a Java evaluation, I saw a lot of <|fim_prefix|> and code> markups in the generated code. So I followed the issue #99 and added the special tokens to the tokenizer, as follows:
Then, when I ran the evaluation again after modifying the evaluation code, adding the lines above, the <|fim_prefix|> markups are gone, but the code> markups are still there.
What do I need to do in order to get rid of the code> markups?
The text was updated successfully, but these errors were encountered:
ytxmobile98
changed the title
Why do I have a lot of code> in generated Java code?
Why do I have a lot of code> in generated Java code? What should I do to get rid of them?
Oct 25, 2024
I was doing some code completion evaluation using codefuseEval, on the Qwen2.5-Coder base model. When I ran a Java evaluation, I saw a lot of
<|fim_prefix|>
andcode>
markups in the generated code. So I followed the issue #99 and added the special tokens to the tokenizer, as follows:Then, when I ran the evaluation again after modifying the evaluation code, adding the lines above, the
<|fim_prefix|>
markups are gone, but thecode>
markups are still there.What do I need to do in order to get rid of the
code>
markups?The text was updated successfully, but these errors were encountered: