Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Documenting edge-case-y inspection PDFs for parser testing #22

Closed
jsvine opened this issue Dec 21, 2022 · 7 comments
Closed

Documenting edge-case-y inspection PDFs for parser testing #22

jsvine opened this issue Dec 21, 2022 · 7 comments
Labels
documentation Improvements or additions to documentation

Comments

@jsvine
Copy link
Contributor

jsvine commented Dec 21, 2022

As preparation for a more comprehensive parsing of the inspection reports, I think it'll be helpful to document some of the quirks we're seeing in the PDFs. Here's a start:

@jsvine jsvine added the documentation Improvements or additions to documentation label Dec 21, 2022
@jsvine
Copy link
Contributor Author

jsvine commented Jan 12, 2023

Here's another, where the species names span multiple lines: 185280e3821720b9 (uploaded)

Screen Shot

@jsvine
Copy link
Contributor Author

jsvine commented Jan 12, 2023

Another, where the species list is blank, but there's still a "Total" row: ccda727387d4c850 (uploaded)

Screen Shot

@jsvine
Copy link
Contributor Author

jsvine commented Jan 12, 2023

Here's a fun one — "Page {cp} of 1": 22c3072fd5740ef1 (uploaded)

Screen Shot

@mbpell
Copy link

mbpell commented Jan 12, 2023 via email

@jsvine
Copy link
Contributor Author

jsvine commented Jan 12, 2023

A zoo?

Indeed, lots of zoos in the data!

@jsvine
Copy link
Contributor Author

jsvine commented Feb 10, 2023

Closing this issue since the core related tasks are done, but will pin it for future reference.

@jsvine jsvine closed this as completed Feb 10, 2023
@jsvine jsvine pinned this issue Feb 10, 2023
@jsvine
Copy link
Contributor Author

jsvine commented Apr 6, 2023

Here's something that looks like a violation heading, but (a) does have an actual statute citation, and (b) appears, on cross-referencing with the web portal metadata, not actually to be a violation that APHIS is counting — 0db69ec135a5b244:

Screenshot

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

2 participants