Improve performance of conversion #190

kavitharaju · 2022-11-07T05:09:42Z

The parsing with the tree-sitter module is quite fast even for large and complex usfm files. Then we do a sequential parsing of the output syntax tree to convert them to USX , JSON etc. In doing so, the performance is greatly affected. Need to look into some alternate programming methodologies like callbacks to improve this.

shadow-light · 2024-11-21T03:29:47Z

Yes, just to give some real world stats:

https://github.com/schierlm/BibleMultiConverter can do USFM->USX conversion for a whole Bible in ~6 seconds.

usfm-grammar (Node) takes 3-60 seconds per book, so probably ~2000 seconds for whole Bible. I didn't run the whole thing as might have taken half an hour.

But there's different use cases, and it looks like this could be really useful for a more feature rich converter. I'll be keen to hear if there are performance improvements.

kavitharaju · 2024-11-26T15:38:05Z

Note: In python could use https://docs.python.org/3/library/profile.html to find out where improvement is needed

kavitharaju added the enhancement label Nov 7, 2022

kavitharaju added this to the 2023 First Quarter Release milestone Nov 7, 2022

kavitharaju removed this from the 2023 First Quarter Release milestone Sep 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of conversion #190

Improve performance of conversion #190

kavitharaju commented Nov 7, 2022

shadow-light commented Nov 21, 2024

kavitharaju commented Nov 26, 2024

Improve performance of conversion #190

Improve performance of conversion #190

Comments

kavitharaju commented Nov 7, 2022

shadow-light commented Nov 21, 2024

kavitharaju commented Nov 26, 2024