Maximising performance #50

wonk-andy · 2024-09-16T12:18:05Z

I posted these questions earlier on the Community Forum but it possibly makes more sense to ask them here.

I am trying to maximise the performance of the communication between the i.MX8M Mini Cortex-M4 (bare metal) and a QNX application running on the Cortex-A53 using the librpmsg_lite-imx library.

Some questions...

I am currently sending a 212byte message every 10ms from the firmware running on the Cortex-M4 (the remote end of the connection). I'm not seeing any send errors but is that rate possible or am I missing data and not noticing, bearing in mind this is bare metal and no queue. What is the maximum theoretical throughput?

In the multicore examples in the MIMX8MM SDK both of the remote examples have files called rsc_table.[c|h] which defines a remote resource table. What is this used for? As I increase the number of buffers that defined do I need to increase VRING_SIZE?

To give some scope for adding to the size of the data being sent, I have RL_BUFFER_PAYLOAD_SIZE set to 496 and RL_BUFFER_COUNT set to 2048. Does changing these have an impact on the definition of VRING_SIZE?

-Andy.

MichalPrincNXP · 2024-10-01T07:58:34Z

Hello Andy,
I am sorry for late response ... we do not have a dedicated application for the throughput measurement at the moment. It depends mainly on the platform (shared memory access) and the rpmsg-lite configuration (copy, no-copy mechanism). Very rough data captured on an imx8 board is: to send and then receive 496bytes (request+response) takes ~0.2ms/cycle when copy mechanism used, and it takes ~0.05ms/cycle when no-copy mechanism is used. Considering that, sending 212bytes message every 10ms should be far from these limits and should work fine.

rsc_table belongs to the Linux mechanism called remoteproc used for auxiliary core resources initialization/setup/start. I am not sure if QNX provides that remoteproc component, probably not.

As for the VRING_SIZE, yes it depends on the RL_BUFFER_COUNT, so once you change the RL_BUFFER_COUNT the preallocated VRING_SIZE should be adjusted. Please, have a look here how the VRING_SIZE is calculated for some platforms.

Hope it helps
Regards
Michal

wonk-andy · 2024-10-17T13:14:02Z

Hello Michal,

Apologies for the slow response, I managed to catch covid and wasn't capable of working.

Thanks for the information. Based on your feedback, it looks like we should be able to get the data throughput that we need.

The implementation of librpmsg_lite-imx uses the standard RPMsg-Lite library and uses a slightly peculiar mechanism to override some of the standard files with i.MX specific ones. Does that mean that I can dispense with the rsc_table parts?

Thanks,

Andy.

MichalPrincNXP · 2024-10-18T07:17:12Z

@wonk-andy , I am not sure about the rsc_table part, I have asked my colleague, will get back once I have a response...
Michal

MichalPrincNXP self-assigned this Oct 1, 2024

MichalPrincNXP added the question label Oct 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Maximising performance #50

Maximising performance #50

wonk-andy commented Sep 16, 2024

MichalPrincNXP commented Oct 1, 2024

wonk-andy commented Oct 17, 2024

MichalPrincNXP commented Oct 18, 2024

Maximising performance #50

Maximising performance #50

Comments

wonk-andy commented Sep 16, 2024

MichalPrincNXP commented Oct 1, 2024

wonk-andy commented Oct 17, 2024

MichalPrincNXP commented Oct 18, 2024