Display statistics of computational jobs together with their parent nodes #5816

sanderegg · 2024-05-14T09:08:07Z

Context

Since the public API is available it is now possible to run X computational jobs from a running dynamic service.
Looking at the Usage statistics of a user it currently displays computational jobs separated from their "parent" dynamic service.

Goal

Display the statistics of computational jobs linked to their parent service.

Needed changes

Important note: we should keep backward compatibility of the API, and also for sim4life.io keep the option to pass it via metadata at least for awhile.
modify the oSparc API to create/run computational jobs from a running dynamic service by passing the "parent" node ID (ideally automatically defined, if not then the API shall be modified) - use-case: sim4life, meta-modeling, jupyterlabs, ...
the parent Node ID is passed all the way to the computational backend (already exists, needs to be modified based on 1.)
using the parent NodeID the logs are sent back to the parent project/nodeID if it exists (already exists, needs to be modified based on 1.)
the resource usage tracker shall keep track of the parent node ID if it exists
the frontend shall display the usage with services and their children jobs

Tasks

Give feedback

♻️Ensure parent project/node is well structured in the DB 🗃️ #5874

a:dask-service a:director-v2 a:webserver
resource usage tracker shall keep track of parent project/node IDs #5878

3 of 3

a:resource-usage-tracker
🎨♻️Use structured parent project/node in director-v2 and connect with RUT #5877

a:director-v2
update UI display of usage to show children studies #5879

0 of 2

a:frontend
Adapt PublicAPI/Python client to automatically set the parent project UUID and node ID #5881

1 of 2

a:apiserver a:webserver
Options

sanderegg · 2024-05-15T06:53:25Z

After discussion with @bisgaard-itis :

proposal to modify the osparc python client:

modify the API call to create a computational job to get an optional header containing at least the parent node ID
based on ENV OSPARC_NODE_ID and possibly OSPARC_STUDY_ID variables set in the dynamic service,
the client can automatically fill in the headers
-> Users that are using the python client in their code will get that feature for free

mguidon · 2024-05-15T07:01:04Z

So this is to avoid having it in the not-validated metadata?

sanderegg · 2024-05-15T07:07:16Z

So this is to avoid having it in the not-validated metadata?

As discussed, no. This is for generalization of this usage and to ensure we always get that info so that the billing center looks nice.

As discussed as well, both ways (the sim4life.io way and the new one should work, at least for awhile)

bisgaard-itis · 2024-05-15T07:23:21Z

After discussion with @bisgaard-itis :

proposal to modify the osparc python client:

* modify the API call to create a computational job to get an optional header containing at least the parent node ID

* based on ENV `OSPARC_NODE_ID` and possibly `OSPARC_STUDY_ID` variables set in the dynamic service,

* the client can automatically fill in the headers
  -> Users that are using the python client in their code will get that feature for free

After thinking a bit more about this I have the following modified proposal: Since this approach is based on the client "picking up" the node_id and sending it to the api-server I suggest to simply overwrite the create_solver_job method in the osparc python client, so that it first calls the endpoint on the api-server to create the job and afterwards calls the patch endpoint with the metadata picked up from the environment variables. That way we will not have to modify anything on the server, so any existing functionality will continue to work and we simply "package" the endpoints into user-friendly functions on the client side.

sanderegg · 2024-05-15T07:27:24Z

After discussion with @bisgaard-itis :

proposal to modify the osparc python client:
* modify the API call to create a computational job to get an optional header containing at least the parent node ID

* based on ENV `OSPARC_NODE_ID` and possibly `OSPARC_STUDY_ID` variables set in the dynamic service,

* the client can automatically fill in the headers
  -> Users that are using the python client in their code will get that feature for free
After thinking a bit more about this I have the following modified proposal: Since this approach is based on the client "picking up" the node_id and sending it to the api-server I suggest to simply overwrite the create_solver_job method in the osparc python client, so that it first calls the endpoint on the api-server to create the job and afterwards calls the patch endpoint with the metadata picked up from the environment variables. That way we will not have to modify anything on the server, so any existing functionality will continue to work and we simply "package" the endpoints into user-friendly functions on the client side.

@bisgaard-itis ok, but will this also work if the user (such as in sim4life.io) also calls the PATCH endpoint? will this not overwrite whatever was in there? Also I would prefer that the parent node id is not just some json field, but a defined one.

bisgaard-itis · 2024-05-15T07:32:02Z

After discussion with @bisgaard-itis :

proposal to modify the osparc python client:
* modify the API call to create a computational job to get an optional header containing at least the parent node ID

* based on ENV `OSPARC_NODE_ID` and possibly `OSPARC_STUDY_ID` variables set in the dynamic service,

* the client can automatically fill in the headers
  -> Users that are using the python client in their code will get that feature for free
After thinking a bit more about this I have the following modified proposal: Since this approach is based on the client "picking up" the node_id and sending it to the api-server I suggest to simply overwrite the create_solver_job method in the osparc python client, so that it first calls the endpoint on the api-server to create the job and afterwards calls the patch endpoint with the metadata picked up from the environment variables. That way we will not have to modify anything on the server, so any existing functionality will continue to work and we simply "package" the endpoints into user-friendly functions on the client side.
@bisgaard-itis ok, but will this also work if the user (such as in sim4life.io) also calls the PATCH endpoint? will this not overwrite whatever was in there? Also I would prefer that the parent node id is not just some json field, but a defined one.

This basically delegates all responsibility for setting the parent node_id to the client. So essentially the idea is to do in the python osparc client exactly what Manuel is already doing in the C++ client he is using from sim4life.io and wrap it into a user-friendly function by picking up the node_id from the env. I am not sure I understand exactly what you mean by a "defined field". In the end I guess it will be added in the metadata in the db in the same way Manuel is currently doing it, no?

sanderegg · 2024-05-15T07:43:42Z

After discussion with @bisgaard-itis :

proposal to modify the osparc python client:
* modify the API call to create a computational job to get an optional header containing at least the parent node ID

* based on ENV `OSPARC_NODE_ID` and possibly `OSPARC_STUDY_ID` variables set in the dynamic service,

* the client can automatically fill in the headers
  -> Users that are using the python client in their code will get that feature for free
After thinking a bit more about this I have the following modified proposal: Since this approach is based on the client "picking up" the node_id and sending it to the api-server I suggest to simply overwrite the create_solver_job method in the osparc python client, so that it first calls the endpoint on the api-server to create the job and afterwards calls the patch endpoint with the metadata picked up from the environment variables. That way we will not have to modify anything on the server, so any existing functionality will continue to work and we simply "package" the endpoints into user-friendly functions on the client side.
@bisgaard-itis ok, but will this also work if the user (such as in sim4life.io) also calls the PATCH endpoint? will this not overwrite whatever was in there? Also I would prefer that the parent node id is not just some json field, but a defined one.
This basically delegates all responsibility for setting the parent node_id to the client. So essentially the idea is to do in the python osparc client exactly what Manuel is already doing in the C++ client he is using from sim4life.io and wrap it into a user-friendly function by picking up the node_id from the env. I am not sure I understand exactly what you mean by a "defined field". In the end I guess it will be added in the metadata in the db in the same way Manuel is currently doing it, no?

@bisgaard-itis so the project metadata that Manuel is using are metadata that are owned by the user. we currently hack this out in order to get the parent NodeID. If your solution does not imply that the user may inadvertently remove the parent NodeID by explicitly calling the endpoint then I am ok.

sanderegg mentioned this issue May 14, 2024

TIP v3 on AWS ITISFoundation/osparc-issues#1309

Open

sanderegg transferred this issue from ITISFoundation/osparc-issues May 14, 2024

sanderegg assigned sanderegg, mguidon and matusdrobuliak66 May 14, 2024

sanderegg added a:apiserver api-server service a:director-v2 issue related with the director-v2 service a:dask-service Any of the dask services: dask-scheduler/sidecar or worker a:resource-usage-tracker resource usage tracker service labels May 14, 2024

sanderegg unassigned mguidon, sanderegg and matusdrobuliak66 May 14, 2024

matusdrobuliak66 assigned mguidon, sanderegg and matusdrobuliak66 May 14, 2024

matusdrobuliak66 modified the milestone: Leeroy Jenkins May 14, 2024

sanderegg added this to the Leeroy Jenkins milestone May 14, 2024

sanderegg assigned odeimaiz and bisgaard-itis May 14, 2024

sanderegg changed the title ~~pass parent node ID in a structured way to the computational backend, also for cost display, logs @sanderegg @matusdrobuliak66 @mguidon~~ Display statistics of computational jobs together with their parent nodes May 14, 2024

sanderegg mentioned this issue May 27, 2024

Adapt PublicAPI/Python client to automatically set the parent project UUID and node ID #5881

Closed

matusdrobuliak66 mentioned this issue May 31, 2024

🎨 introducing parent ids to rut (🗃️) #5891

Merged

1 task

bisgaard-itis mentioned this issue Jun 4, 2024

✨🗑️ ⬆️ Propagate server-side changes to client ITISFoundation/osparc-simcore-clients#156

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Display statistics of computational jobs together with their parent nodes #5816

Display statistics of computational jobs together with their parent nodes #5816

sanderegg commented May 14, 2024 •

edited

Tasks

sanderegg commented May 15, 2024

mguidon commented May 15, 2024

sanderegg commented May 15, 2024

bisgaard-itis commented May 15, 2024

proposal to modify the osparc python client:

sanderegg commented May 15, 2024

proposal to modify the osparc python client:

bisgaard-itis commented May 15, 2024

proposal to modify the osparc python client:

sanderegg commented May 15, 2024

proposal to modify the osparc python client:

Display statistics of computational jobs together with their parent nodes #5816

Display statistics of computational jobs together with their parent nodes #5816

Comments

sanderegg commented May 14, 2024 • edited

Context

Goal

Needed changes

Tasks

sanderegg commented May 15, 2024

proposal to modify the osparc python client:

mguidon commented May 15, 2024

sanderegg commented May 15, 2024

bisgaard-itis commented May 15, 2024

proposal to modify the osparc python client:

sanderegg commented May 15, 2024

proposal to modify the osparc python client:

bisgaard-itis commented May 15, 2024

proposal to modify the osparc python client:

sanderegg commented May 15, 2024

proposal to modify the osparc python client:

sanderegg commented May 14, 2024 •

edited