GitHub - asif-faizan/PDF_to_Excel1: pdf to excel

README - PDF to Excel Conversion with OCR This Python script converts a PDF document into an Excel file using Adobe PDF Services API. It supports OCR (Optical Character Recognition) for scanned PDFs to extract text.

Features: Converts PDFs to Excel format (.xlsx). Supports OCR for extracting text from scanned PDFs. Logs all operations to a log file with the current date.

Requirements: adobe-pdfservices-sdk python-dotenv Valid Adobe PDF Services credentials.

Usage: Set up a .env file with your Adobe API credentials.

Run the script using: python script.py --file <path_to_pdf> --output <output_directory>

If --output is not specified, the Excel file is saved in the input file’s directory.

Environment Variables: Set the following in your .env file:

PDF_SERVICES_CLIENT_ID=<your_client_id> PDF_SERVICES_CLIENT_SECRET=<your_client_secret> This script logs all actions and errors to a log file located in the specified directory.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.idea		.idea
July_2024_Revenue_Report.pdf		July_2024_Revenue_Report.pdf
PDF_to_Excel_Adobe1.py		PDF_to_Excel_Adobe1.py
README.md		README.md
pass1.env		pass1.env
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

asif-faizan/PDF_to_Excel1

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages