Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Docling crashes when using EasyOCR on Windows 11 #318

Open
cau-git opened this issue Nov 12, 2024 · 2 comments
Open

Docling crashes when using EasyOCR on Windows 11 #318

cau-git opened this issue Nov 12, 2024 · 2 comments
Labels
bug Something isn't working help wanted Extra attention is needed

Comments

@cau-git
Copy link
Contributor

cau-git commented Nov 12, 2024

Bug

On a Windows 11 installation using ARM64 CPU (UTM virtual machine on macOS host), docling silently crashes without generating output when a document is converted with OCR enabled (using default EasyOCR).

❓ Everyone in the community, can you please help us reproducing this on a Windows x86 (64bit) machine and report if this is broken? 🙏
⚠️ Note: This is observed with the native Windows docling package, not in WSL.

Steps to reproduce

  1. Install windows-native Python (3.12) through Microsoft Store
  2. Set up a fresh virtualenv on a Windows system with python 3.12 and activate
    python -m venv venv
    venv\Scripts\activate.bat
    
  3. Install docling through pip install docling
  4. Run docling on a command prompt with: docling https://arxiv.org/pdf/2206.01062.pdf, observe silent crash and no output.
  5. Couter-check: Running docling https://arxiv.org/pdf/2206.01062.pdf --no-ocr should work fine.

Docling version

Docling version: 2.5.0
Docling Core version: 2.3.2
Docling IBM Models version: 2.0.3
Docling Parse version: 2.0.3

Python version

Python 3.12.7
@cau-git cau-git added bug Something isn't working help wanted Extra attention is needed labels Nov 12, 2024
@maxmnemonic
Copy link
Contributor

I checked on Windows 11 Pro 64bit x86 bare metal machine.
Both commands (with OCR and without OCR) worked fine, and produced MD files

@jokus-pokus
Copy link

jokus-pokus commented Nov 14, 2024

Hey @cau-git have you checked if your machine runs out of resources maybe?
I had issues with vision models on an arm machine as well and never found the issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

3 participants