Skip to content

Releases: tylerdq/pdfca

2.3.0

19 Sep 19:57
Compare
Choose a tag to compare

New feature, improved functionality, and bugfixes/improvements:

  • New option: 'Extract" command now includes "walk" option that scans recursively through a directory tree
  • "Search" command indexing and drill-down now uses numerical keys for simplicity
  • Smaller bugfixes and improvements

2.2.1

16 Aug 13:32
Compare
Choose a tag to compare

This is a bugfix release:

  • Fixed critical error in file listing that broke most extraction scenarios
  • Threaded dataframe saves to prevent data corruption in case of abort
  • Added more detailed feedback to extract command
  • Some minor fixes to documentation

2.2.0

13 Aug 21:46
Compare
Choose a tag to compare

New features, improved functionality, and bugfixes:

  • New command: "Convert" now allows for converting binaries between ".feather" and ".parquet" formats
  • Updated command: "Search" now asks to drill down into a more detailed single-PDF view after displaying results
  • Updated command: "Search" option "truncate" is now "number" and requires an integer
  • Updated command: "Search" option "group" has been temporarily removed until more attributes are added
  • Updated command: "Extract" now has "report" option for viewing overall status or saving detailed status to .csv
  • Updated command: "Extract" now has "incremental" option for saving the dataframe after processing each file
  • Various bugfixes and improvements

2.1.1

12 Aug 18:13
Compare
Choose a tag to compare

This is a bugfix update:

  • Solved critical file saving error when running as a package
  • Updated a docstring to match readme

2.1.0

12 Aug 17:37
Compare
Choose a tag to compare

Added new features and improved stability, reliability, ease of installation, and user feedback.

  • Added support to install package from local clone using pip
  • New command: Summarize (provides summary statistics as "View" command previously did)
  • Updated command: View (now provides default Pandas view into dataframe)
  • Added new options to "View" command to support better data introspection
  • Rewrote "Init" command confirmation message to be more clear
  • Made the output of "Cut" command more declarative and clear
  • Added .yml to support conda users (a conda package may be coming)
  • Improved exception handling in various cases
  • Improved binary verification logic
  • Updated internal and external documentation
  • Fixed issue with unreliable term matching
  • Fixed bug with binary verification exception handling
  • Various minor code optimizations and improvements

2.0.0

21 Jun 20:31
Compare
Choose a tag to compare

Refocused, rebuilt, and refactored program from the ground up.

1.0.0

05 Dec 15:29
Compare
Choose a tag to compare

Script functionality working as intended.