XDR processing using BytesIO class #646

ReidMcLaughlinSecurrency · 2022-10-17T14:54:13Z

Is your feature request related to a problem? Please describe.
As a team we are processing large sets of stellar operations by decoding the XDR packages available from the public dataset that Stellar has made available via Big Query. Using the current XDR unpacking routines, we are seeing long processing times for large datasets. Based on our tests we have seen the current SDK takes 100ms to perform extraction. We have tested with 50,000,000 rows (1 year of data for one most used token on Stellar) from the BigQuery table for our extractions. The current SDK decoding would take over 2.5 hours.

Describe the solution you'd like
We suggest that the SDK is updated to use the Python BytesIO class for XDR decoding. This would operates on the data completely in memory and will dramatically increase the throughput of the XDR processing task. This change would also bring better compatability to future Python versions.

Additional context
On some prototype code we have tested the processing time has been reduced to around 15 minutes for the same dataset.

overcat · 2022-10-17T23:44:04Z

Hi @ReidMcLaughlinSecurrency, thanks for your feedback, I will improve it later.

leighmcculloch · 2023-02-01T02:55:40Z

What would be required to make this change? Would it break the API of the SDK, or only affect internals?

overcat · 2023-02-01T10:27:17Z

Hi @ReidMcLaughlinSecurrency, could you please demonstrate a demo to let me understand how you achieve such significant performance improvements?

GlennPrimmerSecurrency · 2023-02-15T12:01:17Z

What would be required to make this change? Would it break the API of the SDK, or only affect internals?

If implemented correctly, none of these changes would affect the rest of the code base. Keeping the signatures the same but changing the underlying implementation does not appear to have any issues with our efforts to assess this option and how it would affect the rest of the SDK.

overcat · 2023-02-15T15:16:24Z

I conducted a simple test on my MacBook Air (M1, 2020, 16GB RAM) using version 8.1.1 of the SDK. It takes about 0.18 ms to process a single row (of course, this may vary depending on the data being used - I chose a more complex row). Processing 50,000,000 rows would take approximately 2.5 hours. However, if users are using a version < 7.0.3, additional checks may slow down processing significantly, we have already fixed this issue in version 7.0.3.

Could you please provide your test data, host machine info, and the SDK version used during testing?

simple test code:

import time

from stellar_sdk.xdr import TransactionMeta

xdr = "AAAAAgAAAAIAAAADAqrJHAAAAAAAAAAAPNIroxjSaQU9XQYR+lMTNrVf/RFQQPsJUs0z9bF/fuQAAAAAAbtdHQJum9kAAOpxAAAAAAAAAAAAAAAAAAAAAAEAAAAAAAAAAAAAAQAAAAAAAAAAAAAAAAAAAAAAAAACAAAAAAAAAAAAAAAAAAAAAwAAAAACqsixAAAAAGPaMCwAAAAAAAAAAQKqyRwAAAAAAAAAADzSK6MY0mkFPV0GEfpTEza1X/0RUED7CVLNM/Wxf37kAAAAAAG7XR0CbpvZAADqcgAAAAAAAAAAAAAAAAAAAAABAAAAAAAAAAAAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAgAAAAAAAAAAAAAAAAAAAAMAAAAAAqrJHAAAAABj2jKGAAAAAAAAAAEAAAAGAAAAAwKqyRoAAAACAAAAAHkvUm1dseiBJ0J794UCjBNhZKmhozlrHf+hismu6SxGAAAAAEXcZOEAAAACZUxJR0hUTklORwAAAAAAAHiC0OAPKuoe4wFyGJi8cPjZ6e7wEmThx9PmVSfjN/NdAAAAAAAAABhigcxnAI2z6XD+B2wAAAABAAAAAAAAAAAAAAABAqrJHAAAAAIAAAAAeS9SbV2x6IEnQnv3hQKME2FkqaGjOWsd/6GKya7pLEYAAAAARdxk4QAAAAJlTElHSFROSU5HAAAAAAAAeILQ4A8q6h7jAXIYmLxw+Nnp7vASZOHH0+ZVJ+M3810AAAAAAAAAGGKBzKoAQpA0NROtvAAAAAEAAAAAAAAAAAAAAAMCqskcAAAAAAAAAAB5L1JtXbHogSdCe/eFAowTYWSpoaM5ax3/oYrJruksRgAAAB4wB5lFAjzCMwAAC/0AAAJpAAAAAAAAAAAAAAAAAQAAAAAAAAAAAAABAAAAxLuXMEgAAAAYh+8owwAAAAIAAAAAAAAAAAAAAAAAAAADAAAAAAKqxS8AAAAAY9ob+AAAAAAAAAABAqrJHAAAAAAAAAAAeS9SbV2x6IEnQnv3hQKME2FkqaGjOWsd/6GKya7pLEYAAAAeMAeZRQI8wjMAAAv9AAACaQAAAAAAAAAAAAAAAAEAAAAAAAAAAAAAAQAAAMS7lzBIAAAAGIfvKMMAAAACAAAAAAAAAAAAAAAAAAAAAwAAAAACqsUvAAAAAGPaG/gAAAAAAAAAAwKqyRoAAAABAAAAAHkvUm1dseiBJ0J794UCjBNhZKmhozlrHf+hismu6SxGAAAAAmVMSUdIVE5JTkcAAAAAAAB4gtDgDyrqHuMBchiYvHD42enu8BJk4cfT5lUn4zfzXQAAABhigcy7f/////////8AAAABAAAAAQd7zisvIZSOAAAAGGKBzGcAAAAAAAAAAAAAAAECqskcAAAAAQAAAAB5L1JtXbHogSdCe/eFAowTYWSpoaM5ax3/oYrJruksRgAAAAJlTElHSFROSU5HAAAAAAAAeILQ4A8q6h7jAXIYmLxw+Nnp7vASZOHH0+ZVJ+M3810AAAAYYoHMu3//////////AAAAAQAAAAEHe84rLyGUjgAAABhigcyqAAAAAAAAAAAAAAAA"
time_start = time.time()
for _ in range(100_000):
    TransactionMeta.from_xdr(xdr)
time_end = time.time()
print(f"avg time: {(time_end - time_start) / 100_000 * 1000} ms")

Thank you.

GlennPrimmerSecurrency · 2023-02-15T16:23:32Z

Whenever doing performance testing it is critical that we choose the right data to perform those tests. Your test is showing a "more complex" I however would say that the selected item is actually less complex than most of the data that we process.

I agree, that it isn't a super simple sample, but I would also argue that the operations in this test data sample is mostly buy/sell offer operations. I think that if you ran some tests with samples that had other operations that are complex, you would likely see the performance numbers I have stated in the ticket.

So, lets discuss how tests are run to get an idea of real world performance. So we would parse the data for the Stellar for a day, maybe a week tops. We would statistically determine what operations were performed and then try to find a representative example data sample. This of course is just a swag at things, but we need something to really tell us how fast or slow things are tested. So we would run our tests on that sample, run some tests on more complex transactions, a few on less complex to ensure that our choices are "good enough" to make a generalized statement on what the performance is.

After doing that, implement these operations used (not optimized of course, but simply switching how things are decoded) and we compare the results from the results and samples we took earlier.

The sample you provided is in fact, one of the simplest in terms of converting XDR data, again most of this is buy/sell offers and those are relatively simpler data structure than other operators.

Try your sampling against a few more items such as thee data for transaction has = 36cdf221cd46dbf3c9e0675abff53b9d93647176d8bfeded9e202e222f82da36

This is not the most complex data item to test, but I believe it will likely show more of the performance issues.

Again, we cannot have a discussion about performance until we are testing the same things and testing something that is indicative of the real world data.

ReidMcLaughlinSecurrency · 2023-06-15T15:39:50Z

It looks like the XDR lib is depricated and is going to be removed soon
"Deprecated since version 3.11, will be removed in version 3.13: The xdrlib module is deprecated (see PEP 594 for details)."

https://docs.python.org/3/library/xdrlib.html

Any update on how this might be handled moving forward?

overcat · 2023-06-16T12:45:12Z

It looks like the XDR lib is depricated and is going to be removed soon "Deprecated since version 3.11, will be removed in version 3.13: The xdrlib module is deprecated (see PEP 594 for details)."

https://docs.python.org/3/library/xdrlib.html

Any update on how this might be handled moving forward?

Yes, please refer to #739

overcat · 2023-06-16T12:49:35Z

Whenever doing performance testing it is critical that we choose the right data to perform those tests. Your test is showing a "more complex" I however would say that the selected item is actually less complex than most of the data that we process.

I agree, that it isn't a super simple sample, but I would also argue that the operations in this test data sample is mostly buy/sell offer operations. I think that if you ran some tests with samples that had other operations that are complex, you would likely see the performance numbers I have stated in the ticket.

So, lets discuss how tests are run to get an idea of real world performance. So we would parse the data for the Stellar for a day, maybe a week tops. We would statistically determine what operations were performed and then try to find a representative example data sample. This of course is just a swag at things, but we need something to really tell us how fast or slow things are tested. So we would run our tests on that sample, run some tests on more complex transactions, a few on less complex to ensure that our choices are "good enough" to make a generalized statement on what the performance is.

After doing that, implement these operations used (not optimized of course, but simply switching how things are decoded) and we compare the results from the results and samples we took earlier.

The sample you provided is in fact, one of the simplest in terms of converting XDR data, again most of this is buy/sell offers and those are relatively simpler data structure than other operators.

Try your sampling against a few more items such as thee data for transaction has = 36cdf221cd46dbf3c9e0675abff53b9d93647176d8bfeded9e202e222f82da36

This is not the most complex data item to test, but I believe it will likely show more of the performance issues.

Again, we cannot have a discussion about performance until we are testing the same things and testing something that is indicative of the real world data.

Sorry, I missed this message. I tested the data you provided on my device (MacBook Air, M1, 2020, 16GB RAM), and it takes about 1ms, which is far from the 100ms mentioned in the issue.

GlennPrimmerSecurrency · 2023-06-16T13:14:29Z

Sorry, haven't seen info posted up here. Care to share your benchmark information. I find it hard to believe that things are 1ms when it was said 18ms was found for a smaller item to decode than the one I mentioned.

GlennPrimmerSecurrency · 2023-06-16T13:19:21Z

It's fine, even 1ms is high for simple decoding of data, going a more performant route gets things down in the low microseconds which is inline with estimates of how its performance should be based on the message formats and lengths etc.

overcat self-assigned this Oct 17, 2022

overcat mentioned this issue Jun 16, 2023

deps: use xdrlib3 instead of xdrlib. #739

Merged

overcat added the help wanted label Jun 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

XDR processing using BytesIO class #646

XDR processing using BytesIO class #646

ReidMcLaughlinSecurrency commented Oct 17, 2022

overcat commented Oct 17, 2022

leighmcculloch commented Feb 1, 2023

overcat commented Feb 1, 2023

GlennPrimmerSecurrency commented Feb 15, 2023

overcat commented Feb 15, 2023

GlennPrimmerSecurrency commented Feb 15, 2023

ReidMcLaughlinSecurrency commented Jun 15, 2023

overcat commented Jun 16, 2023

overcat commented Jun 16, 2023

GlennPrimmerSecurrency commented Jun 16, 2023

GlennPrimmerSecurrency commented Jun 16, 2023

XDR processing using BytesIO class #646

XDR processing using BytesIO class #646

Comments

ReidMcLaughlinSecurrency commented Oct 17, 2022

overcat commented Oct 17, 2022

leighmcculloch commented Feb 1, 2023

overcat commented Feb 1, 2023

GlennPrimmerSecurrency commented Feb 15, 2023

overcat commented Feb 15, 2023

GlennPrimmerSecurrency commented Feb 15, 2023

ReidMcLaughlinSecurrency commented Jun 15, 2023

overcat commented Jun 16, 2023

overcat commented Jun 16, 2023

GlennPrimmerSecurrency commented Jun 16, 2023

GlennPrimmerSecurrency commented Jun 16, 2023