Repository with all models in ONNX format #786

Kromtar · 2022-07-17T02:33:35Z

In this Drive folder you can find all EasyOCR models in ONNX format covering all currently available languages. Look at the text file in the folder to see which one corresponds to the language you are using.

ONNX is an agnostic and standardized format for storing models. Models in ONNX format can be imported and used in several Runtimes including PyTorch, TensorFlow and OpenVINO. The ONNX team also provides its own runtine, ONNX Runtime. This Runtime allows development in several languages, operating systems and acceleration hardware. By owning the EasyOCR models in ONNX it is possible to cover new scenarios where the use of Python and/or PyTorch is not adequate.

In my use case, I am using the EasyOCR and ONNX Runtime models to be able to develop C++ applications. This makes it much easier to bring EasyOCR functionalities into production.

In this issue you can find a general guide on how I have managed to modify EasyOCR to export the models, initially developed in Torch to ONNX. Note that the export process explained in the guide can only be achieved if you have CUDA acceleration. As not everyone has access to such hardware, I have decided to share a copy of the models already exported.

The models that I make available are a first version, I may optimize the exported models in the next weeks.

RaiAmanRai · 2022-07-27T10:41:11Z

Hi @Kromtar , I am trying to convert these onnx models to tensorrt formats using NVIDIA TensorRT containers.

Everything works fine with the detection model but when I am exporting the recognition model, it fails

Error[4]: [graphShapeAnalyzer.cpp::processCheck::581] Error Code 4: Internal Error (Conv_16: spatial dimension of convolution output cannot be negative (build-time output dimension of axis 3 is -1))

However, I suspect this is only limited to smaller models of size ~approx. 15mb . I didn't try everyone of them but I did manage to convert 10_recognition_model.onnx to tensorrt formt without any problem.

Do you suspect anything that is gong wrong?

Also could you please clarify if you are using two different model architectures ?

DLumi · 2022-11-08T07:39:44Z

Hi @Kromtar.
Any luck in reworking the original codebase?
I tried to mess around with your ONNX models, but it didn't get me far without proper pre/postprocessing steps. I tried to replicate them in numpy, but since the original code is rather messy and tangled, I wasn't able to get this thing working properly as I've probably missed something.

alib022 · 2022-11-23T00:33:16Z

Hi @Kromtar
I managed to load your onnx models into the Triton inference server. I reworked the whole code to work and everything goes end to end. However, the results are not right!
for a speed limit of 35 I am getting

[([[6, 4], [83, 4], [83, 59], [6, 59]], 'ISAP7', 0.07690704349016575),
([[6, 46], [82, 46], [82, 101], [6, 101]], '351', 0.9431633933333133)]

'as' suppose to be 35

The result from easyOCR itself:
([[6, 4], [81, 4], [81, 58], [6, 58]], '[serp]', 0.024114014123917345) ([[8, 48], [76, 48], [76, 98], [8, 98]], '35', 0.9999992413568111)

And the bounding boxes are all correct, compare to running the same image with easyOCR.
However, seems like this implementation has a different config code or something that I can't figure out!

nissansz · 2023-01-15T05:11:22Z

How to use these onnx samples? Is it possible to use these onnx without EasyOcr?
Any dict for chars needed?

light42 · 2023-02-13T15:22:36Z

@Kromtar not only conversion to onnx already discussed many times since two years ago, easyocr already provide the script to export the model into onnx format.

What's the difference between your approach with the one provided officially by JaidedAI?

light42 · 2023-02-17T05:58:21Z

This is what chatgpt spews out when I asked it to also consider dict and configs:

import onnx

# Define the output ONNX file path
output_path = 'model.onnx'

# Convert the PyTorch model to ONNX format
torch.onnx.export(model, input_data, output_path, export_params=True, opset_version=11, input_names=['input'], output_names=['output'], dynamic_axes={'input': {0: 'batch_size'}, 'output': {0: 'batch_size'}})

# Update the ONNX model with the dictionary and configuration files
model_proto = onnx.load(output_path)
model_proto.graph.input[0].type.tensor_type.shape.dim[0].dim_param = 'batch_size'
model_proto.graph.output[0].type.tensor_type.shape.dim[0].dim_param = 'batch_size'

# Add the dictionary and configuration files as attributes to the ONNX model
dict_proto = onnx.helper.make_tensor("dictionary", onnx.TensorProto.STRING, [len(dictionary)], dictionary)
config_proto = onnx.helper.make_tensor("config", onnx.TensorProto.STRING, [1], [json.dumps(config)])
model_proto.graph.initializer.extend([dict_proto, config_proto])

I haven't checked the validity of this script yet, but I hope this will give inspirations for you @Kromtar.

kadmor · 2023-03-02T06:25:50Z

Привет@Kromtar Мне удалось загрузить ваши модели onnx на сервер вывода Triton. Я переработал весь код, чтобы он работал и все идет от и до. Тем не менее, результаты не являются правильными! для ограничения скорости 35 я получаю

[([[6, 4], [83, 4], [83, 59], [6, 59]], 'ISAP7', 0.07690704349016575), ([[6, 46], [82, 46], [ 82, 101], [6, 101]], '351', 0,9431633933333133)]

'как' должно быть 35

Результат самого easyOCR: ([[6, 4], [81, 4], [81, 58], [6, 58]], '[serp]', 0.024114014123917345) ([[8, 48], [76, 48], [76, 98], [8, 98]], '35', 0.9999992413568111)

И все ограничивающие рамки правильные, сравните с запуском того же изображения с помощью easyOCR. Однако похоже, что эта реализация имеет другой код конфигурации или что-то, что я не могу понять!

Hello @alib022 Did you manage to solve this problem? Can you show your config.pbtxt for detector.onnx and recognition.onnx models?

alib022 · 2023-03-02T16:05:46Z

right now I have these in dev environment
`name: "ocr-detection-onnx"
platform: "onnxruntime_onnx"
max_batch_size : 0
input [
{
name: "input1"
data_type: TYPE_FP32
format: FORMAT_NCHW
dims: [ 3, -1, -1 ]
reshape { shape: [ 1, 3, -1, -1 ] }
}
]

output [
{
name: "output"
data_type: TYPE_FP32
dims: [1,-1,-1,2]

}
]
instance_group [
{
count: 1
kind: KIND_GPU

}
]
name: "ocr-recognition-onnx"
platform: "onnxruntime_onnx"
max_batch_size : 0
input [
{
name: "input1"
data_type: TYPE_FP32
dims: [1,1,64,-1]

}
]
output [
{
name: "output"
data_type: TYPE_FP32
dims: [1,-1,97]

}
]

instance_group [
{
count: 1
kind: KIND_GPU

}
]`

kadmor · 2023-03-03T06:58:06Z

right now I have these in dev environment `name: "ocr-detection-onnx" platform: "onnxruntime_onnx" max_batch_size : 0 input [ { name: "input1" data_type: TYPE_FP32 format: FORMAT_NCHW dims: [ 3, -1, -1 ] reshape { shape: [ 1, 3, -1, -1 ] } } ]

output [ { name: "output" data_type: TYPE_FP32 dims: [1,-1,-1,2]

} ] instance_group [ { count: 1 kind: KIND_GPU

} ] name: "ocr-recognition-onnx" platform: "onnxruntime_onnx" max_batch_size : 0 input [ { name: "input1" data_type: TYPE_FP32 dims: [1,1,64,-1]

} ] output [ { name: "output" data_type: TYPE_FP32 dims: [1,-1,97]

} ]

instance_group [ { count: 1 kind: KIND_GPU

} ]`

@alib022 Thank you very much!

nissansz · 2023-04-04T11:43:59Z

Any latest models in onnx format?

masoudMZB · 2023-06-28T08:21:24Z

@light42 you said:

@Kromtar not only conversion to onnx already discussed many times since two years ago, easyocr already provide the script to export the model into onnx format.

where is the script to convert my text recognition model to onnx?
there is export.py in easyocr but it is only for text detection. any code for recognition?

as1605 · 2023-09-02T02:44:24Z

Hi @Kromtar @alib022 , I am trying to run EasyOCR on NVIDIA Triton with ONNX.

Currently, the ONNX model has loaded as

"input": [
    {
      "name": "input1",
      "data_type": "TYPE_FP32",
      "format": "FORMAT_NONE",
      "dims": [1, 1, 64, -1],
      "is_shape_tensor": false,
      "allow_ragged_batch": false,
      "optional": false
    }
  ],
  "output": [
    {
      "name": "output",
      "data_type": "TYPE_FP32",
      "dims": [1, -1, 97],
      "label_filename": "",
      "is_shape_tensor": false
    }
  ],

Can anyone help me with how to go about with making inferences and making sense of the output? What are the 64 and 97 in dimensions for?

alib022 · 2023-09-02T03:18:23Z

Hi,
The EasyOCR is a four-step process, 1) pre-processing, 2) detection, 3) classification, 4) post-processing
I moved steps 2 and 3 to Tirton and 1 and 4 are left on the client. I will try to open a public repo in a few weeks with all the codes for everyone to use

as1605 · 2023-09-02T05:18:55Z

Hi, The EasyOCR is a four-step process, 1) pre-processing, 2) detection, 3) classification, 4) post-processing I moved steps 2 and 3 to Tirton and 1 and 4 are left on the client. I will try to open a public repo in a few weeks with all the codes for everyone to use

@alib022
Thanks, this is exactly what I need. Actually, I need this a bit early so can you please share a preview with me, or maybe I can help in making the repo public?

arcb01 · 2023-10-04T09:52:15Z

Hi,
The EasyOCR is a four-step process, 1) pre-processing, 2) detection, 3) classification, 4) post-processing
I moved steps 2 and 3 to Tirton and 1 and 4 are left on the client. I will try to open a public repo in a few weeks with all the codes for everyone to use

Looking forward to it!

brooksyott · 2023-12-19T18:00:53Z

Hi, The EasyOCR is a four-step process, 1) pre-processing, 2) detection, 3) classification, 4) post-processing I moved steps 2 and 3 to Tirton and 1 and 4 are left on the client. I will try to open a public repo in a few weeks with all the codes for everyone to use

Wondering if you are able to share your work?

namogg · 2024-01-04T10:33:52Z

For the regconition network, im getting this error.
RuntimeError: Only tuples, lists and Variables are supported as JIT inputs/outputs. Dictionaries and strings are also accepted, but their usage is not recommended. Here, received an input of unsupported type: torch._C.ScriptObject

rm-asif-amin · 2024-04-17T13:20:46Z

Hi, The EasyOCR is a four-step process, 1) pre-processing, 2) detection, 3) classification, 4) post-processing I moved steps 2 and 3 to Tirton and 1 and 4 are left on the client. I will try to open a public repo in a few weeks with all the codes for everyone to use

@alib022 thanks for sharing your insights in this thread, can I get access to your code please? I have a similar use case

vidysqrx · 2024-08-13T02:24:18Z

I am trying to access the link to google drive, but I get 404 page.
Maybe the link is dead now, could you help to share latest link

Kromtar mentioned this issue Jul 17, 2022

Export to ONNX and use ONNX Runtime, working. Guide. #746

Open

Kamilya2020 mentioned this issue Jun 7, 2023

Add onnx support for VisionEncoderDecoder huggingface/transformers#19254

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository with all models in ONNX format #786

Repository with all models in ONNX format #786

Kromtar commented Jul 17, 2022

RaiAmanRai commented Jul 27, 2022 •

edited

Loading

DLumi commented Nov 8, 2022

alib022 commented Nov 23, 2022 •

edited

Loading

nissansz commented Jan 15, 2023

light42 commented Feb 13, 2023

light42 commented Feb 17, 2023

kadmor commented Mar 2, 2023 •

edited

Loading

alib022 commented Mar 2, 2023

kadmor commented Mar 3, 2023 •

edited

Loading

nissansz commented Apr 4, 2023

masoudMZB commented Jun 28, 2023 •

edited

Loading

as1605 commented Sep 2, 2023 •

edited

Loading

alib022 commented Sep 2, 2023

as1605 commented Sep 2, 2023 •

edited

Loading

arcb01 commented Oct 4, 2023

brooksyott commented Dec 19, 2023

namogg commented Jan 4, 2024

rm-asif-amin commented Apr 17, 2024 •

edited

Loading

vidysqrx commented Aug 13, 2024

Repository with all models in ONNX format #786

Repository with all models in ONNX format #786

Comments

Kromtar commented Jul 17, 2022

RaiAmanRai commented Jul 27, 2022 • edited Loading

DLumi commented Nov 8, 2022

alib022 commented Nov 23, 2022 • edited Loading

nissansz commented Jan 15, 2023

light42 commented Feb 13, 2023

light42 commented Feb 17, 2023

kadmor commented Mar 2, 2023 • edited Loading

alib022 commented Mar 2, 2023

kadmor commented Mar 3, 2023 • edited Loading

nissansz commented Apr 4, 2023

masoudMZB commented Jun 28, 2023 • edited Loading

as1605 commented Sep 2, 2023 • edited Loading

alib022 commented Sep 2, 2023

as1605 commented Sep 2, 2023 • edited Loading

arcb01 commented Oct 4, 2023

brooksyott commented Dec 19, 2023

namogg commented Jan 4, 2024

rm-asif-amin commented Apr 17, 2024 • edited Loading

vidysqrx commented Aug 13, 2024

RaiAmanRai commented Jul 27, 2022 •

edited

Loading

alib022 commented Nov 23, 2022 •

edited

Loading

kadmor commented Mar 2, 2023 •

edited

Loading

kadmor commented Mar 3, 2023 •

edited

Loading

masoudMZB commented Jun 28, 2023 •

edited

Loading

as1605 commented Sep 2, 2023 •

edited

Loading

as1605 commented Sep 2, 2023 •

edited

Loading

rm-asif-amin commented Apr 17, 2024 •

edited

Loading