Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PDFFigure2 might do better job in separating figures compared to the current model #322

Open
X-Bruce-Y opened this issue Oct 28, 2024 · 1 comment

Comments

@X-Bruce-Y
Copy link

Hi VikParuchuri, I'm really grateful that you share such a nice tool. I happen to find in one case that PDFFigure2 does better in seperating figures, using the same pdf https://www.thelancet.com/journals/ebiom/article/PIIS2352-3964(22)00586-2/fulltext (open access, downloadable). Specifially, figure 5 got chunked differentially.


marker (into 2 pieces, and overall missing one part from the original figure)

image

and

image

PDFFigure2

image


I'm no expert and I used PDFFigure2 through https://github.com/MuiseDestiny/zotero-figure.

It would be nice if you find out something useful there.

Cheers,
Bruce

@X-Bruce-Y
Copy link
Author

Forgot to mention, marker also extracts tiny icons in this example, which is not favorable. It would be better if there could be a flag to screen images (based on size, etc).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant