Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OCR模型应该下载哪个文件 ,我在文档里没有找到 #197

Open
lemsn opened this issue Dec 25, 2024 · 1 comment
Open

OCR模型应该下载哪个文件 ,我在文档里没有找到 #197

lemsn opened this issue Dec 25, 2024 · 1 comment

Comments

@lemsn
Copy link

lemsn commented Dec 25, 2024

OCR模型应该下载哪个文件 ,我在文档里没有找到, 只有:
det_model_dir: models/OCR/PaddleOCR/det/ch_PP-OCRv4_det
rec_model_dir: models/OCR/PaddleOCR/rec/ch_PP-OCRv4_rec

ModelScope也没有找到对应的模型,所以好像也用不了snapshot_download

官方下载是这两个:
ch_PP-OCRv4_rec_infer.tar
ch_PP-OCRv4_det_infer.tar
解压后,是各有3个文件, 是解压到放到这里吗?
models/OCR/PaddleOCR/rec/ch_PP-OCRv4_rec

请问我应该如何做?

@wufan-tb
Copy link
Collaborator

wufan-tb commented Jan 3, 2025

huggingface仓库是有模型的,https://huggingface.co/opendatalab/PDF-Extract-Kit-1.0/tree/main/models/OCR/PaddleOCR
另外配置文件里设置好det_model_dir和rec_model_dir,运行的时候应该会自动检查该路径是否有模型,没有的话会自动下载

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants