mount the dedicated storage for each function #1408

akihikokuroda · 2024-07-15T20:15:18Z

Summary

Mount the dedicated storage for each function

Details and comments

The dedicated storage area is mounted at /function_data for the function provider. When the function doesn't have the associated provider, the storage mounted at /function_date is the same storage as /data.

akihikokuroda · 2024-07-15T20:53:52Z

Based on the current function design, when the Function is provided in the custom image, the code running in the ray nodes is the function code only. The user code doesn't run in the ray nodes so it can not access to the directory.

charts/qiskit-serverless/charts/gateway/templates/rayclustertemplate.yaml

psschwei · 2024-07-19T16:26:39Z

charts/qiskit-serverless/charts/gateway/templates/rayclustertemplate.yaml

+              - mountPath: /function_data
+                name: user-storage
+                subPath: {{`{{ function_data }}`}}


Do we need to mount this with any kind of restricted permissions? In theory, and correct me if I'm wrong here, whoever is running the function (user or provider) would be able to access this location via the container...

My understanding is that with the current custom image function design, the only the code in the custom image is executed in the Ray node. No user code is sent or executed in the Ray node so only the function code read/write to this directory.

Users could in theory find a way around that restriction (if we misimplement something, CVS / exploits, etc.) so it's a risk, we'd just need to determine if it's an acceptable one

Could we just add the path when a function is from a provider? That could solve part of the problem. And correct me if I'm wrong here @akihikokuroda but all the providers will write here so they could potentially see files from other providers with this approach. What is the problem to use the name of provider as sub-path instead of function_data?

My idea was that in that way we could attach the specific provider path only when it's a function from the specific provider (but I don't know if there are limitations in this approach).

I thought function-data was the provider space and data was for users ? since users would already go to data on their own functions

For the provider, it may be useful to have both while they develop the function because it is the same environment when the function is executed by the user.

I thought function-data was the provider space and data was for users ?

yep

since users would already go to data on their own functions

basically that's my comment, for functions that come from the user we are adding /function_data too. It's true that /function_data and /data are going to point to user.username path so not a bid deal but probably we can just avoid add /function_data for user functions.

BTW, another answer that I would understand is: "well, we don't overcomplicate more the template this way" and I would agree. Yep.

Django template in helm template is very tricky :-)

Hahaha I agree, I agree. I'm just reviewing the rest of the code, Aki and I will update my review 👍

akihikokuroda · 2024-07-26T16:49:52Z

@Tansito @psschwei I would like to complete this. What are the remaining concern? Thanks!

psschwei · 2024-07-26T16:52:53Z

Good from my side. Just need to resolve David's question about mounting /data

Tansito · 2024-07-26T16:54:31Z

Sorry, I didn't read your answers yesterday.

akihikokuroda · 2024-07-26T16:59:52Z

The Function can write something that it wants to share with the user. I'm planing providing the loggers to the function one for private logs and the other for the shared with the user. The later log is written in the file in /data directory.

Tansito

Logic LGTM Aki, thank you! BTW something I would like to improve in the near future are logs. We need to introduce more logs in general if not debug this in case of an error would be a nightmare 😅

akihikokuroda added 2 commits July 15, 2024 15:56

add mount /function_data space

7c89c88

add /function_data space

1e97109

akihikokuroda requested review from Tansito, IceKhan13 and psschwei July 15, 2024 20:22

psschwei reviewed Jul 15, 2024

View reviewed changes

charts/qiskit-serverless/charts/gateway/templates/rayclustertemplate.yaml Show resolved Hide resolved

Tansito marked this pull request as draft July 15, 2024 21:16

akihikokuroda added 4 commits July 18, 2024 14:28

update file function with provider

fad981f

client update

bcba7b7

adding test data

c273ed8

fix lint errors

4a80c3e

akihikokuroda marked this pull request as ready for review July 18, 2024 19:29

psschwei reviewed Jul 19, 2024

View reviewed changes

akihikokuroda mentioned this pull request Jul 24, 2024

Configure loggers for function #1421

Draft

Tansito approved these changes Jul 26, 2024

View reviewed changes

akihikokuroda merged commit 843ee36 into Qiskit:main Jul 26, 2024
14 checks passed

akihikokuroda deleted the functionstorage branch July 26, 2024 17:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mount the dedicated storage for each function #1408

mount the dedicated storage for each function #1408

akihikokuroda commented Jul 15, 2024

akihikokuroda commented Jul 15, 2024

psschwei Jul 19, 2024

akihikokuroda Jul 19, 2024

psschwei Jul 19, 2024

Tansito Jul 19, 2024

Tansito Jul 19, 2024

psschwei Jul 26, 2024

akihikokuroda Jul 26, 2024

Tansito Jul 26, 2024 •

edited

Loading

akihikokuroda Jul 26, 2024

Tansito Jul 26, 2024

akihikokuroda commented Jul 26, 2024

psschwei commented Jul 26, 2024

Tansito commented Jul 26, 2024

akihikokuroda commented Jul 26, 2024

Tansito left a comment

mount the dedicated storage for each function #1408

mount the dedicated storage for each function #1408

Conversation

akihikokuroda commented Jul 15, 2024

Summary

Details and comments

akihikokuroda commented Jul 15, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Tansito Jul 26, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

akihikokuroda commented Jul 26, 2024

psschwei commented Jul 26, 2024

Tansito commented Jul 26, 2024

akihikokuroda commented Jul 26, 2024

Tansito left a comment

Choose a reason for hiding this comment

Tansito Jul 26, 2024 •

edited

Loading