Skip to content

Commit

Permalink
Merge pull request #40 from goldmermaid/0722
Browse files Browse the repository at this point in the history
add pdf to markdown sample notebook
  • Loading branch information
Cambio ML authored Jul 22, 2024
2 parents 241308f + d7d2bdb commit 89579a3
Show file tree
Hide file tree
Showing 3 changed files with 324 additions and 2 deletions.
7 changes: 5 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -38,8 +38,11 @@ bash parse.sh gl************************************** /path/to/your/file.pdf "
AnyParser can extract text, numbers and symbols from PDF, images, etc. Check out each notebook below to run AnyParser within 10 lines of code!
### [Extract a Table from PDF into Excel](https://github.com/CambioML/any-parser/blob/main/demo/pdf_to_html_to_excel.ipynb)
Do you want to extract a complicated table from a financial report (PDF) into Excel spread sheet? Check out this notebook (3-min read)!
### [Extract all text and layout from PDF into Markdown Format](https://github.com/CambioML/any-parser/blob/main/examples/pdf_to_markdown.ipynb)
Are you an AI engineer who need to ACCURATELY extract both the text and its layout (e.g. table of content or markdown headers hierarchy) from a PDF. Check out this notebook demo (3-min read)!
### [Extract a Table from an Image into Markdown Format](https://github.com/CambioML/any-parser/blob/main/examples/extract_table_from_image_to_markdown.ipynb)
Are you a financial analyst who need to extract ACCURATE number from a table in an image or a PDF. Check out this notebook (3-min read)!
### [Extract a Table from PDF into Excel](https://github.com/CambioML/any-parser/blob/main/examples/pdf_to_html_to_excel.ipynb)
Do you want to extract a complicated table from a financial report (PDF) into Excel spread sheet? Check out this notebook (3-min read)!
319 changes: 319 additions & 0 deletions examples/pdf_to_markdown.ipynb

Large diffs are not rendered by default.

Binary file added examples/sample_data/stoxx_index_guide_0003.pdf
Binary file not shown.

0 comments on commit 89579a3

Please sign in to comment.