1.3.0
Release Notes for Loghi-HTR Version 1.3.0
Date: 2023-11-14
Overview
In version 1.3.0, we've introduced significant improvements, including enhanced normalization features for CER and CER lower, a simplified confidence interval, and various API enhancements. Fixes have been made to the ResidualBlock implementation and freezing mechanism, and models now automatically save in the new .keras file format. Several changes have also been made to the API to improve usability and performance.
New Features
- Normalization for CER and CER Lower: Added functionality to normalize for Character Error Rate (CER) and its lower case version using the
--normalization_file
argument. This update also displays the ground truth and prediction in a normalized form. - Simplified Confidence Interval: Introduced a more straightforward method for calculating confidence intervals.
Enhancements
- Model File Format: Models now automatically get saved in the new
.keras
file format, while still supporting loading of both.pb
and.keras
files.
Bug Fixes
- ResidualBlock Implementation Fix: Addressed an issue where saving a model and then continuing training was not working properly.
- ResidualBlock Freezing Fix: Corrected the freezing of convolutional layers in the residual blocks with
--freeze_conv_layers
.
API Specific Changes
- Environment Variable Simplifications: Removed the necessity of
LOGHI_INPUT_CHANNELS
andLOGHI_CHARLIST_PATH
environment variables, which are now read directly from the model'sconfig.json
andcharlist.txt
respectively. - Reduced OOM Errors: Enhanced batch processing to split recursively on Out-Of-Memory (OOM) errors, failing only the problematic image instead of the entire batch.
- Improved Image Padding: Adjusted image padding during processing for better alignment with training, marginally improving confidence and output.
- Dynamic Model Switching in API: Introduced the ability to switch models during an API call using the "model" field, though it's advised to use caution as it can slow down inference.
- Error Output for Failed Predictions: Text line images that fail during prediction are now outputted to
LOGHI_OUTPUT_PATH/group_id/identifier.error
with the specific error message.
Contributors
- @Thelukepet: Major contributions to normalization for CER and CER lower, and the simplified confidence interval.
- @TimKoornstra: Significant contributions across various aspects including bug fixes, API enhancements, and overall improvements.
Full changelog: 1.2.10...1.3.0