Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

some labels are missing #52

Open
alireza-hariri opened this issue Nov 7, 2024 · 1 comment
Open

some labels are missing #52

alireza-hariri opened this issue Nov 7, 2024 · 1 comment

Comments

@alireza-hariri
Copy link

alireza-hariri commented Nov 7, 2024

image
I just noticed that some words in the cover image are missing.

I couldn't find any code for generating this dataset from the original docs to suggest an edit.

Note: The second error in the image is the word "second" which splited with a dash. This err makes sense but I couldn't reason about the first error.

@alireza-hariri
Copy link
Author

alireza-hariri commented Nov 7, 2024

after more inspection i found some other problems

but there are some other problems with box sizes:

  1. There are a lot of boxes with zero width or height (even when the label is "paragraph" and the token doesn't include "Line##" )
  2. There are a lot of boxes (with paragraph label) that are too tall (see the image)

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant