You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, I am trying to finetune the TrOCR to recognize latex math expression on my custom dataset.
Do I need to train a custom tokenizer for latex format?
Or is the pretrained tokenizer one included with TrOCR good enough?
Any pointers or help would be greatly appreciated.
Thank you!
The text was updated successfully, but these errors were encountered:
Yes, my results using the pre-trained tokenizer were not satisfactory. I believe it's necessary to train a dedicated tokenizer based on mathematical symbols. Perhaps there already exists.
Yes, my results using the pre-trained tokenizer were not satisfactory. I believe it's necessary to train a dedicated tokenizer based on mathematical symbols. Perhaps there already exists.
Hello, I am trying to finetune the TrOCR to recognize latex math expression on my custom dataset.
Do I need to train a custom tokenizer for latex format?
Or is the pretrained tokenizer one included with TrOCR good enough?
Any pointers or help would be greatly appreciated.
Thank you!
The text was updated successfully, but these errors were encountered: