[FR][NVME-MI] Command level timeout #553

drakedog2008 · 2023-01-09T22:42:23Z

The current timeout implement for libnvme-mi is to attach the timeout to mctp endpoint, i.e. all transactions under same EP share the same timeout.

To comparison, the inband implemented the timeout on io level (i.e. per transaction).

The user case is some time-consuming cmd (e.g. FW commit) will not share the global timeout. The difference between inband and oob makes the client hard to implement a unified solution.

@jk-ozlabs

Is it possible to implement the timeout the same way as inband? So the timeout should be part of the input arg of each cmd?

jk-ozlabs · 2023-01-10T00:48:32Z

Should be possible, yep.

In the case where an API call involves multiple NVMe-MI commands (ie, multiple messages sent & received): are you looking for a timeout on the whole call, or changing the timeout on individual messages of that command?

drakedog2008 · 2023-01-10T01:17:19Z

Can you give an example function what is the multi-msg cmd?

drakedog2008 · 2023-01-10T01:18:07Z

The log page with mi chuck?

drakedog2008 · 2023-01-10T01:20:43Z

If that is the case, the API should define the unified timeout for all involving mctp transactions, given the timeout is on mctp packet level, not message/command level.

jk-ozlabs · 2023-01-10T01:41:28Z

Can you give an example function what is the multi-msg cmd?

Yep, anything requiring chunking; currently Get Log Page, but we should define a general policy.

If that is the case, the API should define the unified timeout for all involving mctp transactions, given the timeout is on mctp packet level, not message/command level.

The MCTP packet level is not really suitable here: there are no timeouts on individual packets, as there are no per-packet responses (and the packetisation is not visible to the socket interface). There may be many packets in flight for each MCTP message. So, we would be defining the timeout at the NVMe-MI command level (ie, corresponding to the MCTP message level) here.

In this case: the timeout would apply separately to each NVMe-MI command + response, rather than the nvme_mi_* API call.

As an example:

calling a nvme_mi_admin_get_log function, specifying a timeout of 2 sec, requesting 12288 (4096 * 3) bytes of data
that call gets chunked into three NVMe-MI Get Log Page commands, each requesting 4096 bytes of data
each NVMe-MI command (consisting of 1 MCTP request message and 1 MCTP response message, themselves consisting of an arbitrary amount of MCTP packets) is completed in 1 sec
so, total call time is 3 sec

this would not timeout, because each command+response is completed before the 2-sec timeout.

Is that what you're intending?

drakedog2008 · 2023-01-10T01:59:46Z

Yeah, the timeout attached on submit is on message level, not the packet level.

And the mechanism you described works for me.

jk-ozlabs · 2023-01-10T02:02:15Z

Sounds good! I'll put something together.

igaw · 2024-07-03T19:51:39Z

I assume we close this one, due no progress.

FWIW, the nvme-cli learned a global --timeout command line argument, though I don't know if this relevant for this here.

jk-ozlabs · 2024-07-04T02:05:25Z

If it's okay with you, I'd like to keep this open - seems like a worthwhile feature in general.

However, if you'd prefer to keep the issues tidier, I can track elsewhere instead :)

igaw · 2024-07-04T07:17:45Z

I am fine with keeping it here open. I was just not sure what's the status is and also too stupid to check the label which says 'enhancement'...

jk-ozlabs self-assigned this Jan 10, 2023

jk-ozlabs added the enhancement New feature or request label Jan 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FR][NVME-MI] Command level timeout #553

[FR][NVME-MI] Command level timeout #553

drakedog2008 commented Jan 9, 2023

jk-ozlabs commented Jan 10, 2023

drakedog2008 commented Jan 10, 2023

drakedog2008 commented Jan 10, 2023

drakedog2008 commented Jan 10, 2023

jk-ozlabs commented Jan 10, 2023

drakedog2008 commented Jan 10, 2023

jk-ozlabs commented Jan 10, 2023

igaw commented Jul 3, 2024

jk-ozlabs commented Jul 4, 2024

igaw commented Jul 4, 2024

[FR][NVME-MI] Command level timeout #553

[FR][NVME-MI] Command level timeout #553

Comments

drakedog2008 commented Jan 9, 2023

jk-ozlabs commented Jan 10, 2023

drakedog2008 commented Jan 10, 2023

drakedog2008 commented Jan 10, 2023

drakedog2008 commented Jan 10, 2023

jk-ozlabs commented Jan 10, 2023

drakedog2008 commented Jan 10, 2023

jk-ozlabs commented Jan 10, 2023

igaw commented Jul 3, 2024

jk-ozlabs commented Jul 4, 2024

igaw commented Jul 4, 2024