Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tesseract: 5.3.4 -> 5.4.1 #317800

Draft
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

r-ryantm
Copy link
Contributor

@r-ryantm r-ryantm commented Jun 6, 2024

Automatic update generated by nixpkgs-update tools. This update was made based on information from https://github.com/tesseract-ocr/tesseract/releases.

meta.description for tesseract is: OCR engine

meta.homepage for tesseract is: https://github.com/tesseract-ocr/tesseract

Updates performed
  • Version update
To inspect upstream changes
Impact

Checks done


  • built on NixOS
  • The tests defined in passthru.tests, if any, passed
  • found 5.4.0 with grep in /nix/store/h0cxkwwqg8kklsi2q2ricraakp0hm0hn-tesseract-5.4.0
  • found 5.4.0 in filename of file in /nix/store/h0cxkwwqg8kklsi2q2ricraakp0hm0hn-tesseract-5.4.0

Rebuild report (if merged into master) (click to expand)
58 total rebuild path(s)

58 package rebuild(s)

First fifty rebuilds by attrpath

almanah
arcan
arcan-all-wrapped
arcan-wrapped
browsr
cat9-wrapped
durden-wrapped
evolution
evolution-ews
evolutionWithPlugins
gImageReader
gnome-frog
gscan2pdf
invoice2data
kdePackages.skanpage
khoj
libsForQt5.mauikit-imagetools
libsForQt5.pix
manga-cli
mcomix
obs-studio-plugins.advanced-scene-switcher
ocrmypdf
paperless-ngx
pdfsandwich
perl536Packages.ImageOCRTesseract
perl538Packages.ImageOCRTesseract
pipeworld-wrapped
plasma5Packages.mauikit-imagetools
plasma5Packages.pix
prio-wrapped
private-gpt
python311Packages.layoutparser
python311Packages.llama-index
python311Packages.llama-index-readers-file
python311Packages.llama-index-readers-s3
python311Packages.ocrmypdf
python311Packages.pdf2docx
python311Packages.private-gpt
python311Packages.pymupdf
python311Packages.pytesseract
python311Packages.pytikz-allefeld
python311Packages.videocr
python312Packages.layoutparser
python312Packages.ocrmypdf
python312Packages.pdf2docx
python312Packages.pymupdf
python312Packages.pytesseract
python312Packages.pytikz-allefeld
python312Packages.videocr
Instructions to test this update (click to expand)

Either download from Cachix:

nix-store -r /nix/store/h0cxkwwqg8kklsi2q2ricraakp0hm0hn-tesseract-5.4.0 \
  --option binary-caches 'https://cache.nixos.org/ https://nix-community.cachix.org/' \
  --option trusted-public-keys '
  nix-community.cachix.org-1:mB9FSh9qf2dCimDSUo8Zy7bkq5CX+/rkCWyvRCYg3Fs=
  cache.nixos.org-1:6NCHdD59X431o0gWypbMrAURkbJ16ZPMQFGspcDShjY=
  '

(The Cachix cache is only trusted for this store-path realization.)
For the Cachix download to work, your user must be in the trusted-users list or you can use sudo since root is effectively trusted.

Or, build yourself:

nix-build -A tesseract https://github.com/r-ryantm/nixpkgs/archive/b452039b94dc42b34a1d035d0aff3e95fc129eca.tar.gz

Or:

nix build github:r-ryantm/nixpkgs/b452039b94dc42b34a1d035d0aff3e95fc129eca#tesseract

After you've downloaded or built it, look at the files and if there are any, run the binaries:

ls -la /nix/store/h0cxkwwqg8kklsi2q2ricraakp0hm0hn-tesseract-5.4.0
ls -la /nix/store/h0cxkwwqg8kklsi2q2ricraakp0hm0hn-tesseract-5.4.0/bin


Pre-merge build results

We have automatically built all packages that will get rebuilt due to
this change.

This gives evidence on whether the upgrade will break dependent packages.
Note sometimes packages show up as failed to build independent of the
change, simply because they are already broken on the target branch.

Result of nixpkgs-review --extra-nixpkgs-config '{ allowInsecurePredicate = x: true; }' run on x86_64-linux 1

2 packages marked as broken and skipped:
  • khoj
  • khoj.dist
29 packages failed to build:
  • gnome-frog
  • gscan2pdf
  • gscan2pdf.man
  • libsForQt5.pix
  • ocrmypdf (python311Packages.ocrmypdf)
  • ocrmypdf.dist (python311Packages.ocrmypdf.dist)
  • paperless-ngx
  • perl536Packages.ImageOCRTesseract
  • perl536Packages.ImageOCRTesseract.devdoc
  • perl538Packages.ImageOCRTesseract
  • perl538Packages.ImageOCRTesseract.devdoc
  • python311Packages.layoutparser
  • python311Packages.layoutparser.dist
  • python311Packages.pdf2docx
  • python311Packages.pdf2docx.dist
  • python311Packages.pytesseract
  • python311Packages.pytesseract.dist
  • python311Packages.videocr
  • python311Packages.videocr.dist
  • python312Packages.layoutparser
  • python312Packages.layoutparser.dist
  • python312Packages.ocrmypdf
  • python312Packages.ocrmypdf.dist
  • python312Packages.pdf2docx
  • python312Packages.pdf2docx.dist
  • python312Packages.pytesseract
  • python312Packages.pytesseract.dist
  • python312Packages.videocr
  • python312Packages.videocr.dist
53 packages built:
  • almanah
  • arcan
  • arcan-all-wrapped
  • arcan-wrapped
  • browsr
  • browsr.dist
  • cat9-wrapped
  • durden-wrapped
  • evolution
  • evolution-ews
  • evolutionWithPlugins
  • gImageReader
  • invoice2data
  • invoice2data.dist
  • kdePackages.skanpage
  • kdePackages.skanpage.debug
  • kdePackages.skanpage.dev
  • kdePackages.skanpage.devtools
  • libsForQt5.mauikit-imagetools
  • manga-cli
  • mcomix
  • mcomix.dist
  • obs-studio-plugins.advanced-scene-switcher
  • pdfsandwich
  • pipeworld-wrapped
  • prio-wrapped
  • private-gpt
  • private-gpt.dist
  • python311Packages.llama-index
  • python311Packages.llama-index-readers-file
  • python311Packages.llama-index-readers-file.dist
  • python311Packages.llama-index-readers-s3
  • python311Packages.llama-index-readers-s3.dist
  • python311Packages.llama-index.dist
  • python311Packages.private-gpt
  • python311Packages.private-gpt.dist
  • python311Packages.pymupdf
  • python311Packages.pymupdf.dist
  • python311Packages.pytikz-allefeld
  • python311Packages.pytikz-allefeld.dist
  • python312Packages.pymupdf
  • python312Packages.pymupdf.dist
  • python312Packages.pytikz-allefeld
  • python312Packages.pytikz-allefeld.dist
  • spamassassin
  • spamassassin.devdoc
  • termpdfpy
  • termpdfpy.dist
  • tesseract
  • textsnatcher
  • vimPlugins.openscad-nvim
  • xarcan
  • zathura

Maintainer pings

cc @schuelermine for testing.

@r-ryantm r-ryantm changed the title tesseract: 5.3.4 -> 5.4.0 tesseract: 5.3.4 -> 5.4.1 Jun 13, 2024
@r-ryantm
Copy link
Contributor Author

Automatic update generated by nixpkgs-update tools. This update was made based on information from https://github.com/tesseract-ocr/tesseract/releases.

meta.description for tesseract is: OCR engine

meta.homepage for tesseract is: https://github.com/tesseract-ocr/tesseract

Updates performed
  • Version update
To inspect upstream changes
Impact

Checks done


  • built on NixOS
  • The tests defined in passthru.tests, if any, passed
  • found 5.4.1 with grep in /nix/store/v1ch5fknvgasmfzlry870h0m060isafg-tesseract-5.4.1
  • found 5.4.1 in filename of file in /nix/store/v1ch5fknvgasmfzlry870h0m060isafg-tesseract-5.4.1

Rebuild report (if merged into master) (click to expand)
59 total rebuild path(s)

59 package rebuild(s)

First fifty rebuilds by attrpath

almanah
arcan
arcan-all-wrapped
arcan-wrapped
browsr
cat9-wrapped
durden-wrapped
evolution
evolution-ews
evolutionWithPlugins
gImageReader
gnome-frog
gscan2pdf
invoice2data
k2pdfopt
kdePackages.skanpage
khoj
libsForQt5.mauikit-imagetools
libsForQt5.pix
manga-cli
mcomix
obs-studio-plugins.advanced-scene-switcher
ocrmypdf
paperless-ngx
pdfsandwich
perl536Packages.ImageOCRTesseract
perl538Packages.ImageOCRTesseract
pipeworld-wrapped
plasma5Packages.mauikit-imagetools
plasma5Packages.pix
prio-wrapped
private-gpt
python311Packages.layoutparser
python311Packages.llama-index
python311Packages.llama-index-readers-file
python311Packages.llama-index-readers-s3
python311Packages.ocrmypdf
python311Packages.pdf2docx
python311Packages.private-gpt
python311Packages.pymupdf
python311Packages.pytesseract
python311Packages.pytikz-allefeld
python311Packages.videocr
python312Packages.layoutparser
python312Packages.ocrmypdf
python312Packages.pdf2docx
python312Packages.pymupdf
python312Packages.pytesseract
python312Packages.pytikz-allefeld
Instructions to test this update (click to expand)

Either download from Cachix:

nix-store -r /nix/store/v1ch5fknvgasmfzlry870h0m060isafg-tesseract-5.4.1 \
  --option binary-caches 'https://cache.nixos.org/ https://nix-community.cachix.org/' \
  --option trusted-public-keys '
  nix-community.cachix.org-1:mB9FSh9qf2dCimDSUo8Zy7bkq5CX+/rkCWyvRCYg3Fs=
  cache.nixos.org-1:6NCHdD59X431o0gWypbMrAURkbJ16ZPMQFGspcDShjY=
  '

(The Cachix cache is only trusted for this store-path realization.)
For the Cachix download to work, your user must be in the trusted-users list or you can use sudo since root is effectively trusted.

Or, build yourself:

nix-build -A tesseract https://github.com/r-ryantm/nixpkgs/archive/fe48a643d5a8b85232dcbc3347c5f7053c72f2c5.tar.gz

Or:

nix build github:r-ryantm/nixpkgs/fe48a643d5a8b85232dcbc3347c5f7053c72f2c5#tesseract

After you've downloaded or built it, look at the files and if there are any, run the binaries:

ls -la /nix/store/v1ch5fknvgasmfzlry870h0m060isafg-tesseract-5.4.1
ls -la /nix/store/v1ch5fknvgasmfzlry870h0m060isafg-tesseract-5.4.1/bin


Pre-merge build results

We have automatically built all packages that will get rebuilt due to
this change.

This gives evidence on whether the upgrade will break dependent packages.
Note sometimes packages show up as failed to build independent of the
change, simply because they are already broken on the target branch.

Result of nixpkgs-review --extra-nixpkgs-config '{ allowInsecurePredicate = x: true; }' run on x86_64-linux 1

2 packages marked as broken and skipped:
  • khoj
  • khoj.dist
12 packages failed to build:
  • gscan2pdf
  • gscan2pdf.man
  • k2pdfopt
  • libsForQt5.pix
  • python311Packages.layoutparser
  • python311Packages.layoutparser.dist
  • python311Packages.pdf2docx
  • python311Packages.pdf2docx.dist
  • python312Packages.layoutparser
  • python312Packages.layoutparser.dist
  • python312Packages.pdf2docx
  • python312Packages.pdf2docx.dist
71 packages built:
  • almanah
  • arcan
  • arcan-all-wrapped
  • arcan-wrapped
  • browsr
  • browsr.dist
  • cat9-wrapped
  • durden-wrapped
  • evolution
  • evolution-ews
  • evolutionWithPlugins
  • gImageReader
  • gnome-frog
  • invoice2data
  • invoice2data.dist
  • kdePackages.skanpage
  • kdePackages.skanpage.debug
  • kdePackages.skanpage.dev
  • kdePackages.skanpage.devtools
  • libsForQt5.mauikit-imagetools
  • manga-cli
  • mcomix
  • mcomix.dist
  • obs-studio-plugins.advanced-scene-switcher
  • ocrmypdf (python311Packages.ocrmypdf)
  • ocrmypdf.dist (python311Packages.ocrmypdf.dist)
  • paperless-ngx
  • pdfsandwich
  • perl536Packages.ImageOCRTesseract
  • perl536Packages.ImageOCRTesseract.devdoc
  • perl538Packages.ImageOCRTesseract
  • perl538Packages.ImageOCRTesseract.devdoc
  • pipeworld-wrapped
  • prio-wrapped
  • private-gpt
  • private-gpt.dist
  • python311Packages.llama-index
  • python311Packages.llama-index-readers-file
  • python311Packages.llama-index-readers-file.dist
  • python311Packages.llama-index-readers-s3
  • python311Packages.llama-index-readers-s3.dist
  • python311Packages.llama-index.dist
  • python311Packages.private-gpt
  • python311Packages.private-gpt.dist
  • python311Packages.pymupdf
  • python311Packages.pymupdf.dist
  • python311Packages.pytesseract
  • python311Packages.pytesseract.dist
  • python311Packages.pytikz-allefeld
  • python311Packages.pytikz-allefeld.dist
  • python311Packages.videocr
  • python311Packages.videocr.dist
  • python312Packages.ocrmypdf
  • python312Packages.ocrmypdf.dist
  • python312Packages.pymupdf
  • python312Packages.pymupdf.dist
  • python312Packages.pytesseract
  • python312Packages.pytesseract.dist
  • python312Packages.pytikz-allefeld
  • python312Packages.pytikz-allefeld.dist
  • python312Packages.videocr
  • python312Packages.videocr.dist
  • spamassassin
  • spamassassin.devdoc
  • termpdfpy
  • termpdfpy.dist
  • tesseract
  • textsnatcher
  • vimPlugins.openscad-nvim
  • xarcan
  • zathura

Maintainer pings

cc @schuelermine for testing.

Copy link
Contributor

@stweil stweil left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The links in the first PR message still refer to release 5.4.0. Maybe it would be better to close outdated pull requests and to create a new one instead of force pushing to existing pull requests without updating the links.

@wegank wegank added the 12.approvals: 1 This PR was reviewed and approved by one reputable person label Jun 20, 2024
@SuperSandro2000
Copy link
Member

@bosu @danielfullmer can you please provide a patch for k2pdfopt?

@wegank wegank marked this pull request as draft July 7, 2024 21:49
@sivizius
Copy link
Contributor

Obsolete by #353902?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
10.rebuild-darwin: 11-100 10.rebuild-linux: 11-100 12.approvals: 1 This PR was reviewed and approved by one reputable person
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants