You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We had a corrupt SIF file, to check ran singularity run <bla.sif>, it printed could not open image <bla.sif>: SIF image <bla.sif> is corrupted: wrong partition size.
Can we make it possible to set the delay and timeout in the grpc4bmi.bmi_client_singularity.BmiClientSingularity construtor?
So we give some time for the container to die instead of immediatly checking
This could be done by passing delay in the model.setup() method or in the ewatercycle.yaml file.
The text was updated successfully, but these errors were encountered:
Do I understand correctly that you want to expose these settings through the eWaterCycle python package? Or can we just hard-code it in the model.setup() method?
Also: should we add an issue to catch this in grpc4bmi?
In grcp4bmi you can set the delay and timeout already. The problem is that the ewatercycle package uses defaults which can not be overwritten when you want to debug why a model is not launching.
The time between starting the container and checking if the container is stilll running is now set to 0 seconds. Being able to make this something like 0.1 seconds, would catch more unsuccessful container starts. Having a non-zero delay hardcoded in the model.setup() would help, but it depends on the system speed how long you must wait for the container to die prematurely. So I would rather have a way to change the delay on a system wide area like in ewatercycle.yaml then a hardcoded value or exposing it as a argument in setup().
Ah right, so in that case I'd argue for setting it in ewatercycle.yaml, with a generous default value. Since it is quite a technical thing to do, I'd rather hide it from the public API as much as possible.
We had a corrupt SIF file, to check ran
singularity run <bla.sif>
, it printedcould not open image <bla.sif>: SIF image <bla.sif> is corrupted: wrong partition size
.I expected DeadContainerException to be thrown at https://github.com/eWaterCycle/grpc4bmi/blob/d4e644a3177774e348295f78c2c4061094858256/grpc4bmi/bmi_client_singularity.py#L235, but instead the BmiClientSingularity was stuck on connecting to a grpc server that died prematurely.
Can we make it possible to set the
delay
andtimeout
in the grpc4bmi.bmi_client_singularity.BmiClientSingularity construtor?So we give some time for the container to die instead of immediatly checking
This could be done by passing
delay
in the model.setup() method or in the ewatercycle.yaml file.The text was updated successfully, but these errors were encountered: