-
Notifications
You must be signed in to change notification settings - Fork 8
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
1c06041
commit fc6dbc6
Showing
2 changed files
with
2 additions
and
14 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -4,7 +4,7 @@ AnyParser provides an API to accurately extract your unstructured data (e.g. PDF | |
|
||
## :seedling: Set up your AnyParser API key | ||
|
||
AnyParser is still in private beta. If you are interested in testing our document models, please reach out at [email protected] for a FREE testing API key. | ||
You can generate your keys at the [Playground Account Page](https://www.cambioml.com/account) with up to 2 keys and 100 total free pages per account. If you're interested in more AnyParser usage and applications, please reach out at [email protected] for details. | ||
|
||
|
||
To set up your API key `CAMBIO_API_KEY`, you will need to : | ||
|
@@ -38,17 +38,6 @@ If you want to run pdf_to_markdown.ipynb, install the following: | |
choco install poppler | ||
``` | ||
## bashfile usage | ||
To use AnyParser via `curl` requests, you can run the following bash command from the root folder of this repository: | ||
``` | ||
bash parse.sh <your apiKey> <file path> <prompt for parse (optional, default="")> | ||
``` | ||
For example, to extract a table from a PDF file, you can run the following command: | ||
``` | ||
bash parse.sh gl************************************** /path/to/your/file.pdf "Return the table in a JSON format with each box's key and value." | ||
``` | ||
## :scroll: Examples | ||
AnyParser can extract text, numbers and symbols from PDF, images, etc. Check out each notebook below to run AnyParser within 10 lines of code! | ||
|
@@ -59,5 +48,3 @@ Are you an AI engineer who need to ACCURATELY extract both the text and its layo | |
### [Extract a Table from an Image into Markdown Format](https://github.com/CambioML/any-parser/blob/main/examples/extract_table_from_image_to_markdown.ipynb) | ||
Are you a financial analyst who need to extract ACCURATE number from a table in an image or a PDF. Check out this notebook (3-min read)! | ||
### [Extract a Table from PDF into Excel](https://github.com/CambioML/any-parser/blob/main/examples/pdf_to_html_to_excel.ipynb) | ||
Do you want to extract a complicated table from a financial report (PDF) into Excel spread sheet? Check out this notebook (3-min read)! |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -9,6 +9,7 @@ | |
|
||
from any_parser import AnyParser # noqa: E402 | ||
|
||
|
||
if __name__ == "__main__": | ||
load_dotenv() | ||
|
||
|