Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tesseract: 5.3.4 -> 5.5.0 #353902

Open
wants to merge 4 commits into
base: master
Choose a base branch
from
Open

Conversation

PatrickDaG
Copy link
Contributor

tesseract-ocr/tesseract@5.3.4...5.4.1

Added update script and myself as maintainer.

Things done

  • Built on platform(s)
    • x86_64-linux
    • aarch64-linux
    • x86_64-darwin
    • aarch64-darwin
  • For non-Linux: Is sandboxing enabled in nix.conf? (See Nix manual)
    • sandbox = relaxed
    • sandbox = true
  • Tested, as applicable:
  • Tested compilation of all packages that depend on this change using nix-shell -p nixpkgs-review --run "nixpkgs-review rev HEAD". Note: all changes have to be committed, also see nixpkgs-review usage
  • Tested basic functionality of all binary files (usually in ./result/bin/)
  • 24.11 Release Notes (or backporting 23.11 and 24.05 Release notes)
    • (Package updates) Added a release notes entry if the change is major or breaking
    • (Module updates) Added a release notes entry if the change is significant
    • (Module addition) Added a release notes entry if adding a new NixOS module
  • Fits CONTRIBUTING.md.

Add a 👍 reaction to pull requests you find important.

@ofborg ofborg bot added 11.by: package-maintainer This PR was created by the maintainer of the package it changes 10.rebuild-darwin: 11-100 10.rebuild-linux: 11-100 labels Nov 6, 2024
@PatrickDaG PatrickDaG force-pushed the update-tesseract branch 2 times, most recently from 36efdf4 to 23db679 Compare November 15, 2024 17:03
@khaneliman
Copy link
Contributor

nixpkgs-review result

Generated using nixpkgs-review.

Command: nixpkgs-review pr 353902


x86_64-linux

⏩ 2 packages marked as broken and skipped:
  • khoj
  • khoj.dist
❌ 7 packages failed to build:
  • almanah
  • evolution
  • evolution-ews
  • evolutionWithPlugins
  • k2pdfopt
  • spamassassin
  • spamassassin.devdoc
✅ 88 packages built:
  • arcan
  • arcan-all-wrapped
  • arcan-wrapped
  • arcan.dev
  • arcan.lib
  • arcan.man
  • browsr
  • browsr.dist
  • cat9-wrapped
  • ccextractor
  • durden-wrapped
  • gImageReader
  • gnome-frog
  • gscan2pdf
  • gscan2pdf.man
  • invoice2data
  • invoice2data.dist
  • kdePackages.skanpage
  • kdePackages.skanpage.debug
  • kdePackages.skanpage.dev
  • kdePackages.skanpage.devtools
  • libsForQt5.mauikit-imagetools (plasma5Packages.mauikit-imagetools)
  • libsForQt5.pix (plasma5Packages.pix)
  • manga-cli
  • mcomix
  • mcomix.dist
  • ocrmypdf (python312Packages.ocrmypdf)
  • ocrmypdf.dist (python312Packages.ocrmypdf.dist)
  • paperless-ngx
  • pdfsandwich
  • perl538Packages.ImageOCRTesseract
  • perl538Packages.ImageOCRTesseract.devdoc
  • perl540Packages.ImageOCRTesseract
  • perl540Packages.ImageOCRTesseract.devdoc
  • pipeworld-wrapped
  • prio-wrapped
  • private-gpt
  • private-gpt.dist
  • python311Packages.layoutparser
  • python311Packages.layoutparser.dist
  • python311Packages.llama-index
  • python311Packages.llama-index-readers-file
  • python311Packages.llama-index-readers-file.dist
  • python311Packages.llama-index-readers-s3
  • python311Packages.llama-index-readers-s3.dist
  • python311Packages.llama-index.dist
  • python311Packages.ocrmypdf
  • python311Packages.ocrmypdf.dist
  • python311Packages.pdf2docx
  • python311Packages.pdf2docx.dist
  • python311Packages.private-gpt
  • python311Packages.private-gpt.dist
  • python311Packages.pymupdf
  • python311Packages.pymupdf.dist
  • python311Packages.pytesseract
  • python311Packages.pytesseract.dist
  • python311Packages.pytikz-allefeld
  • python311Packages.pytikz-allefeld.dist
  • python311Packages.videocr
  • python311Packages.videocr.dist
  • python312Packages.layoutparser
  • python312Packages.layoutparser.dist
  • python312Packages.llama-index
  • python312Packages.llama-index-readers-file
  • python312Packages.llama-index-readers-file.dist
  • python312Packages.llama-index-readers-s3
  • python312Packages.llama-index-readers-s3.dist
  • python312Packages.llama-index.dist
  • python312Packages.pdf2docx
  • python312Packages.pdf2docx.dist
  • python312Packages.private-gpt
  • python312Packages.private-gpt.dist
  • python312Packages.pymupdf
  • python312Packages.pymupdf.dist
  • python312Packages.pytesseract
  • python312Packages.pytesseract.dist
  • python312Packages.pytikz-allefeld
  • python312Packages.pytikz-allefeld.dist
  • python312Packages.videocr
  • python312Packages.videocr.dist
  • termpdfpy
  • termpdfpy.dist
  • tesseract (tesseract5)
  • textsnatcher
  • tika
  • vimPlugins.openscad-nvim
  • xarcan
  • zathura

aarch64-linux

⏩ 12 packages marked as broken and skipped:
  • khoj
  • khoj.dist
  • private-gpt
  • private-gpt.dist
  • python311Packages.llama-index
  • python311Packages.llama-index.dist
  • python311Packages.private-gpt
  • python311Packages.private-gpt.dist
  • python312Packages.llama-index
  • python312Packages.llama-index.dist
  • python312Packages.private-gpt
  • python312Packages.private-gpt.dist
❌ 1 package failed to build:
  • k2pdfopt
✅ 84 packages built:
  • almanah
  • arcan
  • arcan-all-wrapped
  • arcan-wrapped
  • arcan.dev
  • arcan.lib
  • arcan.man
  • browsr
  • browsr.dist
  • cat9-wrapped
  • ccextractor
  • durden-wrapped
  • evolution
  • evolution-ews
  • evolutionWithPlugins
  • gImageReader
  • gnome-frog
  • gscan2pdf
  • gscan2pdf.man
  • invoice2data
  • invoice2data.dist
  • kdePackages.skanpage
  • kdePackages.skanpage.debug
  • kdePackages.skanpage.dev
  • kdePackages.skanpage.devtools
  • libsForQt5.mauikit-imagetools (plasma5Packages.mauikit-imagetools)
  • libsForQt5.pix (plasma5Packages.pix)
  • manga-cli
  • mcomix
  • mcomix.dist
  • ocrmypdf (python312Packages.ocrmypdf)
  • ocrmypdf.dist (python312Packages.ocrmypdf.dist)
  • paperless-ngx
  • pdfsandwich
  • perl538Packages.ImageOCRTesseract
  • perl538Packages.ImageOCRTesseract.devdoc
  • perl540Packages.ImageOCRTesseract
  • perl540Packages.ImageOCRTesseract.devdoc
  • pipeworld-wrapped
  • prio-wrapped
  • python311Packages.layoutparser
  • python311Packages.layoutparser.dist
  • python311Packages.llama-index-readers-file
  • python311Packages.llama-index-readers-file.dist
  • python311Packages.llama-index-readers-s3
  • python311Packages.llama-index-readers-s3.dist
  • python311Packages.ocrmypdf
  • python311Packages.ocrmypdf.dist
  • python311Packages.pdf2docx
  • python311Packages.pdf2docx.dist
  • python311Packages.pymupdf
  • python311Packages.pymupdf.dist
  • python311Packages.pytesseract
  • python311Packages.pytesseract.dist
  • python311Packages.pytikz-allefeld
  • python311Packages.pytikz-allefeld.dist
  • python311Packages.videocr
  • python311Packages.videocr.dist
  • python312Packages.layoutparser
  • python312Packages.layoutparser.dist
  • python312Packages.llama-index-readers-file
  • python312Packages.llama-index-readers-file.dist
  • python312Packages.llama-index-readers-s3
  • python312Packages.llama-index-readers-s3.dist
  • python312Packages.pdf2docx
  • python312Packages.pdf2docx.dist
  • python312Packages.pymupdf
  • python312Packages.pymupdf.dist
  • python312Packages.pytesseract
  • python312Packages.pytesseract.dist
  • python312Packages.pytikz-allefeld
  • python312Packages.pytikz-allefeld.dist
  • python312Packages.videocr
  • python312Packages.videocr.dist
  • spamassassin
  • spamassassin.devdoc
  • termpdfpy
  • termpdfpy.dist
  • tesseract (tesseract5)
  • textsnatcher
  • tika
  • vimPlugins.openscad-nvim
  • xarcan
  • zathura

x86_64-darwin

⏩ 35 packages marked as broken and skipped:
  • almanah
  • arcan
  • arcan-all-wrapped
  • arcan-wrapped
  • arcan.dev
  • arcan.lib
  • arcan.man
  • cat9-wrapped
  • durden-wrapped
  • evolutionWithPlugins
  • gscan2pdf
  • gscan2pdf.man
  • khoj
  • khoj.dist
  • perl538Packages.ImageOCRTesseract
  • perl538Packages.ImageOCRTesseract.devdoc
  • perl540Packages.ImageOCRTesseract
  • perl540Packages.ImageOCRTesseract.devdoc
  • pipeworld-wrapped
  • prio-wrapped
  • private-gpt
  • private-gpt.dist
  • python311Packages.private-gpt
  • python311Packages.private-gpt.dist
  • python312Packages.llama-index
  • python312Packages.llama-index-readers-file
  • python312Packages.llama-index-readers-file.dist
  • python312Packages.llama-index-readers-s3
  • python312Packages.llama-index-readers-s3.dist
  • python312Packages.llama-index.dist
  • python312Packages.private-gpt
  • python312Packages.private-gpt.dist
  • spamassassin
  • spamassassin.devdoc
  • xarcan
❌ 3 packages failed to build:
  • paperless-ngx
  • python311Packages.pdf2docx
  • python311Packages.pdf2docx.dist
✅ 45 packages built:
  • browsr
  • browsr.dist
  • ccextractor
  • invoice2data
  • invoice2data.dist
  • manga-cli
  • mcomix
  • mcomix.dist
  • ocrmypdf (python312Packages.ocrmypdf)
  • ocrmypdf.dist (python312Packages.ocrmypdf.dist)
  • python311Packages.layoutparser
  • python311Packages.layoutparser.dist
  • python311Packages.llama-index
  • python311Packages.llama-index-readers-file
  • python311Packages.llama-index-readers-file.dist
  • python311Packages.llama-index-readers-s3
  • python311Packages.llama-index-readers-s3.dist
  • python311Packages.llama-index.dist
  • python311Packages.ocrmypdf
  • python311Packages.ocrmypdf.dist
  • python311Packages.pymupdf
  • python311Packages.pymupdf.dist
  • python311Packages.pytesseract
  • python311Packages.pytesseract.dist
  • python311Packages.pytikz-allefeld
  • python311Packages.pytikz-allefeld.dist
  • python311Packages.videocr
  • python311Packages.videocr.dist
  • python312Packages.layoutparser
  • python312Packages.layoutparser.dist
  • python312Packages.pdf2docx
  • python312Packages.pdf2docx.dist
  • python312Packages.pymupdf
  • python312Packages.pymupdf.dist
  • python312Packages.pytesseract
  • python312Packages.pytesseract.dist
  • python312Packages.pytikz-allefeld
  • python312Packages.pytikz-allefeld.dist
  • python312Packages.videocr
  • python312Packages.videocr.dist
  • termpdfpy
  • termpdfpy.dist
  • tesseract (tesseract5)
  • vimPlugins.openscad-nvim
  • zathura

aarch64-darwin

⏩ 41 packages marked as broken and skipped:
  • almanah
  • arcan
  • arcan-all-wrapped
  • arcan-wrapped
  • arcan.dev
  • arcan.lib
  • arcan.man
  • cat9-wrapped
  • durden-wrapped
  • evolutionWithPlugins
  • gscan2pdf
  • gscan2pdf.man
  • khoj
  • khoj.dist
  • perl538Packages.ImageOCRTesseract
  • perl538Packages.ImageOCRTesseract.devdoc
  • perl540Packages.ImageOCRTesseract
  • perl540Packages.ImageOCRTesseract.devdoc
  • pipeworld-wrapped
  • prio-wrapped
  • private-gpt
  • private-gpt.dist
  • python311Packages.llama-index
  • python311Packages.llama-index-readers-file
  • python311Packages.llama-index-readers-file.dist
  • python311Packages.llama-index-readers-s3
  • python311Packages.llama-index-readers-s3.dist
  • python311Packages.llama-index.dist
  • python311Packages.private-gpt
  • python311Packages.private-gpt.dist
  • python312Packages.llama-index
  • python312Packages.llama-index-readers-file
  • python312Packages.llama-index-readers-file.dist
  • python312Packages.llama-index-readers-s3
  • python312Packages.llama-index-readers-s3.dist
  • python312Packages.llama-index.dist
  • python312Packages.private-gpt
  • python312Packages.private-gpt.dist
  • spamassassin
  • spamassassin.devdoc
  • xarcan
❌ 19 packages failed to build:
  • browsr
  • browsr.dist
  • mcomix
  • mcomix.dist
  • paperless-ngx
  • python311Packages.pdf2docx
  • python311Packages.pdf2docx.dist
  • python311Packages.pymupdf
  • python311Packages.pymupdf.dist
  • python311Packages.pytikz-allefeld
  • python311Packages.pytikz-allefeld.dist
  • python312Packages.pdf2docx
  • python312Packages.pdf2docx.dist
  • python312Packages.pymupdf
  • python312Packages.pymupdf.dist
  • python312Packages.pytikz-allefeld
  • python312Packages.pytikz-allefeld.dist
  • termpdfpy
  • termpdfpy.dist
✅ 23 packages built:
  • ccextractor
  • invoice2data
  • invoice2data.dist
  • manga-cli
  • ocrmypdf (python312Packages.ocrmypdf)
  • ocrmypdf.dist (python312Packages.ocrmypdf.dist)
  • python311Packages.layoutparser
  • python311Packages.layoutparser.dist
  • python311Packages.ocrmypdf
  • python311Packages.ocrmypdf.dist
  • python311Packages.pytesseract
  • python311Packages.pytesseract.dist
  • python311Packages.videocr
  • python311Packages.videocr.dist
  • python312Packages.layoutparser
  • python312Packages.layoutparser.dist
  • python312Packages.pytesseract
  • python312Packages.pytesseract.dist
  • python312Packages.videocr
  • python312Packages.videocr.dist
  • tesseract (tesseract5)
  • vimPlugins.openscad-nvim
  • zathura

@PatrickDaG PatrickDaG changed the title tesseract: 5.3.4 -> 5.4.1 tesseract: 5.3.4 -> 5.5.0 Nov 20, 2024
@PatrickDaG
Copy link
Contributor Author

PatrickDaG commented Nov 20, 2024

nixpkgs-review result

Generated using nixpkgs-review.

Command: nixpkgs-review pr 353902


x86_64-linux

⏩ 2 packages marked as broken and skipped:
  • khoj
  • khoj.dist
❌ 24 packages failed to build:
  • gnome-frog
  • k2pdfopt
  • private-gpt
  • private-gpt.dist
  • python311Packages.layoutparser
  • python311Packages.layoutparser.dist
  • python311Packages.pytesseract
  • python311Packages.pytesseract.dist
  • python311Packages.videocr
  • python311Packages.videocr.dist
  • python312Packages.layoutparser
  • python312Packages.layoutparser.dist
  • python312Packages.llama-index
  • python312Packages.llama-index-readers-file
  • python312Packages.llama-index-readers-file.dist
  • python312Packages.llama-index-readers-s3
  • python312Packages.llama-index-readers-s3.dist
  • python312Packages.llama-index.dist
  • python312Packages.private-gpt
  • python312Packages.private-gpt.dist
  • python312Packages.pytesseract
  • python312Packages.pytesseract.dist
  • python312Packages.videocr
  • python312Packages.videocr.dist
✅ 71 packages built:
  • almanah
  • arcan
  • arcan-all-wrapped
  • arcan-wrapped
  • arcan.dev
  • arcan.lib
  • arcan.man
  • browsr
  • browsr.dist
  • cat9-wrapped
  • ccextractor
  • durden-wrapped
  • evolution
  • evolution-ews
  • evolutionWithPlugins
  • gImageReader
  • gscan2pdf
  • gscan2pdf.man
  • invoice2data
  • invoice2data.dist
  • kdePackages.skanpage
  • kdePackages.skanpage.debug
  • kdePackages.skanpage.dev
  • kdePackages.skanpage.devtools
  • libsForQt5.mauikit-imagetools
  • libsForQt5.pix
  • manga-cli
  • mcomix
  • mcomix.dist
  • ocrmypdf (python312Packages.ocrmypdf)
  • ocrmypdf.dist (python312Packages.ocrmypdf.dist)
  • paperless-ngx
  • pdfsandwich
  • perl538Packages.ImageOCRTesseract
  • perl538Packages.ImageOCRTesseract.devdoc
  • perl540Packages.ImageOCRTesseract
  • perl540Packages.ImageOCRTesseract.devdoc
  • pipeworld-wrapped
  • prio-wrapped
  • python311Packages.llama-index
  • python311Packages.llama-index-readers-file
  • python311Packages.llama-index-readers-file.dist
  • python311Packages.llama-index-readers-s3
  • python311Packages.llama-index-readers-s3.dist
  • python311Packages.llama-index.dist
  • python311Packages.ocrmypdf
  • python311Packages.ocrmypdf.dist
  • python311Packages.pdf2docx
  • python311Packages.pdf2docx.dist
  • python311Packages.private-gpt
  • python311Packages.private-gpt.dist
  • python311Packages.pymupdf
  • python311Packages.pymupdf.dist
  • python311Packages.pytikz-allefeld
  • python311Packages.pytikz-allefeld.dist
  • python312Packages.pdf2docx
  • python312Packages.pdf2docx.dist
  • python312Packages.pymupdf
  • python312Packages.pymupdf.dist
  • python312Packages.pytikz-allefeld
  • python312Packages.pytikz-allefeld.dist
  • spamassassin
  • spamassassin.devdoc
  • termpdfpy
  • termpdfpy.dist
  • tesseract
  • textsnatcher
  • tika
  • vimPlugins.openscad-nvim
  • xarcan
  • zathura

Quite a lot of the failures are related to pytesseract, where the bug is upstream. Hopefully that gets merged soon, then I can see what's left and try and fix those.

@PatrickDaG
Copy link
Contributor Author

Since pytesseract will not be fixed upstream in the near future I've disabled the failing tests for now.

@PatrickDaG
Copy link
Contributor Author

nixpkgs-review result

Generated using nixpkgs-review.

Command: nixpkgs-review pr 353902


x86_64-linux

⏩ 2 packages marked as broken and skipped:
  • khoj
  • khoj.dist
❌ 3 packages failed to build:
  • gscan2pdf
  • gscan2pdf.man
  • k2pdfopt
✅ 92 packages built:
  • almanah
  • arcan
  • arcan-all-wrapped
  • arcan-wrapped
  • arcan.dev
  • arcan.lib
  • arcan.man
  • browsr
  • browsr.dist
  • cat9-wrapped
  • ccextractor
  • durden-wrapped
  • evolution
  • evolution-ews
  • evolutionWithPlugins
  • gImageReader
  • gnome-frog
  • invoice2data
  • invoice2data.dist
  • kdePackages.skanpage
  • kdePackages.skanpage.debug
  • kdePackages.skanpage.dev
  • kdePackages.skanpage.devtools
  • libsForQt5.mauikit-imagetools
  • libsForQt5.pix
  • manga-cli
  • mcomix
  • mcomix.dist
  • ocrmypdf (python312Packages.ocrmypdf)
  • ocrmypdf.dist (python312Packages.ocrmypdf.dist)
  • paperless-ngx
  • pdfsandwich
  • perl538Packages.ImageOCRTesseract
  • perl538Packages.ImageOCRTesseract.devdoc
  • perl540Packages.ImageOCRTesseract
  • perl540Packages.ImageOCRTesseract.devdoc
  • pipeworld-wrapped
  • prio-wrapped
  • private-gpt
  • private-gpt.dist
  • python311Packages.layoutparser
  • python311Packages.layoutparser.dist
  • python311Packages.llama-index
  • python311Packages.llama-index-readers-file
  • python311Packages.llama-index-readers-file.dist
  • python311Packages.llama-index-readers-s3
  • python311Packages.llama-index-readers-s3.dist
  • python311Packages.llama-index.dist
  • python311Packages.ocrmypdf
  • python311Packages.ocrmypdf.dist
  • python311Packages.pdf2docx
  • python311Packages.pdf2docx.dist
  • python311Packages.private-gpt
  • python311Packages.private-gpt.dist
  • python311Packages.pymupdf
  • python311Packages.pymupdf.dist
  • python311Packages.pytesseract
  • python311Packages.pytesseract.dist
  • python311Packages.pytikz-allefeld
  • python311Packages.pytikz-allefeld.dist
  • python311Packages.videocr
  • python311Packages.videocr.dist
  • python312Packages.layoutparser
  • python312Packages.layoutparser.dist
  • python312Packages.llama-index
  • python312Packages.llama-index-readers-file
  • python312Packages.llama-index-readers-file.dist
  • python312Packages.llama-index-readers-s3
  • python312Packages.llama-index-readers-s3.dist
  • python312Packages.llama-index.dist
  • python312Packages.pdf2docx
  • python312Packages.pdf2docx.dist
  • python312Packages.private-gpt
  • python312Packages.private-gpt.dist
  • python312Packages.pymupdf
  • python312Packages.pymupdf.dist
  • python312Packages.pytesseract
  • python312Packages.pytesseract.dist
  • python312Packages.pytikz-allefeld
  • python312Packages.pytikz-allefeld.dist
  • python312Packages.videocr
  • python312Packages.videocr.dist
  • spamassassin
  • spamassassin.devdoc
  • termpdfpy
  • termpdfpy.dist
  • tesseract
  • textsnatcher
  • tika
  • vimPlugins.openscad-nvim
  • xarcan
  • zathura

@PatrickDaG
Copy link
Contributor Author

waiting on #358158 and #357698

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants