Default atomic data distributed with Cherab #364

vsnever · 2022-03-08T22:32:14Z

While discussing where to store the Gaunt factor for Bremsstrahlung emission model in #352, @Mateasek proposed to create a subpackage cherab.default_data to store the default atomic data distributed with Cherab.

This was agreed, but the question remained whether the third party atomic data sources, like OpenADAS, should inherit from the core AtomicData interface or from the DefaultAtomicData interface.

I think that the third party data sources should inherit from the DefaultAtomicData, because the DefaultAtomicData may contain the data not present in the third party data source. For example, DefaultAtomicData has the Gaunt factor needed for bremsstrahlung, but lacks atomic rates, while OpenADAS, if inherited from the core AtomicData, will have the rates but not the Gaunt factor. Therefore, it will not be possible to simulate, for example, spectral line emission on top of the bremsstrahlung background without initializing line emission models with OpenADAS and Bremsstrahlung model with DefaultAtomicData. But the most convenient way to use atomic data is by connecting it to the Plasma object and thus using the same data source for all models.

We may run into a case when both the DefaultAtomicData and the OpenADAS contain data for the same physical quantity. For such a case we may add a parameter override_default to OpenADAS, which, if True, will use the OpenADAS data, and if False, the default data.

@Mateasek, @jacklovell, what do you think?

The text was updated successfully, but these errors were encountered:

Mateasek · 2022-03-10T10:40:09Z

Thanks for opening an issue @vsnever. I think that creating the default_data could help in the future. It is in my opinion much better than storing any data in the core.

Another option is importing classes/functions from the default_data in the "derived" data. Combining both imports and inheritance could give us enough flexibility in the future. The main point I see is to give users a flexible and simple way how to build their own Cherab data source. Without the need of copying the actual data between repositories. For example, in future, we can also have ALADDIN data source and users could combine data from both OpenADAS, ALADDIN and default_data to form their own repository.

Here I think I would also ask for opinion of @CnlPepper and @mattngc, because this is an important decision to take.

vsnever · 2022-03-10T10:55:28Z

Another option is importing classes/functions from the default_data in the "derived" data. Combining both imports and inheritance could give us enough flexibility in the future. The main point I see is to give users a flexible and simple way how to build their own Cherab data source. Without the need of copying the actual data between repositories. For example, in future, we can also have ALADDIN data source and users could combine data from both OpenADAS, ALADDIN and default_data to form their own repository.

Also, we can connect a list of atomic data sources instead of a single data source to Plasma. Models will iterate over data sources until they find the first one in which the required function is implemented. In this case we can inherit all atomic data sources from the core interface.

Mateasek · 2022-03-10T11:08:18Z

Also, we can connect a list of atomic data sources instead of a single data source to Plasma. Models will iterate over data sources until they find the first one in which the required function is implemented. In this case we can inherit all atomic data sources from the core interface.

This could lead to undefined behaviour. What would happen if there were more data sources with different data in the list? I can imagine the debugging would be terrible procedure, or even worse, you could end up with wrong results without even realising. I think that giving the possibility to prepare a custom source can prevent a lot of problems

vsnever · 2022-03-10T11:52:32Z

I think that giving the possibility to prepare a custom source can prevent a lot of problems.

This seems to be the correct way to solve this problem, but also complex in terms of implementation. However, the effort should pay off in the future.

CnlPepper · 2022-03-11T18:49:27Z

The way to solve this, and what Matt and I were working towards, was to give Cherab its own data repository and format. OpenADAS and any other data source would simply be used to populate the cherab repository. You can see the start of this inside the "openadas" module - the rates are "installed". You just need to expand this concept.

We never got around to doing this due to us not needing/interacting with non-openadas data. So in short, the correct approach is:

split the current openadas module into "atomic" and "openadas".
the "cherab" repository moves to atomic
the rate etc.. install/conversion routines stay in openadas
other rate/data sources simply install/convert data into an internal cherab form from now on

This approach is the most scaleable and as a nice side effect, it would introduce a new.... hopefully cleaner data atomic data representation to the community. So the community can then chip away at the garbage (representation/API wise... looking at you ADAS) that are the current data sources.

CnlPepper · 2022-03-11T19:38:23Z

I'd add default data to the cherab repo, much like the wavelength data. Users can override it as they want.

vsnever · 2022-03-11T21:16:53Z

Thank you very much, @CnlPepper, I think I got it.
So, the atomic data repository created in the user folder is no longer associated with a single data source exclusively. The user can populate the repository with data from multiple sources, and since multiple repositories are allowed, switching between them gives the desired flexibility.

The current format for representing atomic data in Cherab has a strong correlation with how data is provided in ADAS, and this also affects the AtomicData interfaces. But for now we can pretend that this representation is universal and improve it in the future if necessary.

vsnever added enhancement question labels Mar 8, 2022

vsnever self-assigned this Aug 18, 2022

vsnever mentioned this issue Aug 25, 2022

Split the openadas module into atomic repository and OpenADAS parser #377

Draft

jacklovell mentioned this issue Feb 17, 2023

Add Zeeman splitting and Doppler broadening to StarkBroadenedLine and move line shapes models to a dedicated submodule #400

Open

Mateasek mentioned this issue Jul 25, 2023

OpenADAS DEFAULT_REPOSITORY_PATH from environment variable #416

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Default atomic data distributed with Cherab #364

Default atomic data distributed with Cherab #364

vsnever commented Mar 8, 2022

Mateasek commented Mar 10, 2022

vsnever commented Mar 10, 2022

Mateasek commented Mar 10, 2022

vsnever commented Mar 10, 2022

CnlPepper commented Mar 11, 2022 •

edited

CnlPepper commented Mar 11, 2022

vsnever commented Mar 11, 2022

Default atomic data distributed with Cherab #364

Default atomic data distributed with Cherab #364

Comments

vsnever commented Mar 8, 2022

Mateasek commented Mar 10, 2022

vsnever commented Mar 10, 2022

Mateasek commented Mar 10, 2022

vsnever commented Mar 10, 2022

CnlPepper commented Mar 11, 2022 • edited

CnlPepper commented Mar 11, 2022

vsnever commented Mar 11, 2022

CnlPepper commented Mar 11, 2022 •

edited