Adding engine definition under resources semantic conventions #1293

ivomagi · 2020-12-16T13:55:47Z

Changes

Splunk will be adding auto-instrumentation support to various Java application servers (Tomcat, Jetty, JBoss, Weblogic, Websphere, …) during the forthcoming months. During 2021, this support will be expanded for both Java application servers as well as engines for other runtimes besides JVM.

While implementing the support, we discovered that as of now there is no semantic convention about sending the information covering “this span was captured from Apache Tomcat version 9.0.39”.

Capturing and exposing this information would result in:

Richer information in service map / runtime application architecture, supporting directed troubleshooting during incident management (Why is half of the production cluster still running previous version of Apache Tomcat)
Possibility to correlate configuration changes in version upgrades with incidents (Particular error appeared right after upgrading from Wildfly 20 to Wildfly 21)

With other APM vendors also capturing & exposing this information; standardizing it in OpenTelemetry is a way to guarantee vendor-compatibility.

Related issues #

Related oteps #

linux-foundation-easycla · 2020-12-16T13:55:50Z

The committers are authorized under a signed CLA.

✅ ivomagi (1b533b1, aed7df8, 53a8b08)

Oberon00

I think "engine" is a bit arbitrary. IMHO what we are really missing is some concept of listing technologies used in a process and their relationships to each other (who runs on top of what, etc).

Oberon00 · 2020-12-16T13:59:51Z

specification/resource/semantic_conventions/engine.md

+
+A resource can be attributed to at most one engine. To illustrate, let's look at a Python application using Apache HTTP Server with mod_wsgi as the server and Django as the web framework. In this case:
+
+* Apache HTTP Server would be set as `process.runtime`


No, the process.runtime would describe the Python interpreter. Runtimes are always "language runtimes". We have no way to describe Apache here. Or maybe Apache would be the engine?
What about adding (parallel) array attributes e.g. process.runtime_stack.*? Then you would have the first entry for apache, the second for mod_wsgi, the 3rd for Python (Python runs inside mod_wsgi which runs on Apache).

First, I agree with @Oberon00 about process.runtime and that engine will probably be Apache HTTP if we want to be strict. If we want to be lenient, then both HTTP server and mod_wsgi can be described as engine.

Second, as usual :), I propose to keep scope small. Can runtime_stack be a separate discussion?

I don't know if that should be a separate discussion. If we can define engine precise and useful, then we can discuss it separately. But I wonder if it is possible to find a good and reasonably unambiguous definition. This "engine" stuff is inherently a list of technologies that can run on top or maybe sometimes alongside each other.

"list of technologies" is too vague IMO. I mean, in case of Spring MVC application using Jackson Json parser, running on Tomcat inside JVM, what will you include in that list? Everything? Some part? How to decide?

In case of java this "engine" is very simple: it is application server and similar things, a la servlet containers. It may be "arbitrary", but this is exactly the information that we want to collect because we think it is useful.

During las specification meeting there was general agreement that this kind of information is relevant. Perhaps people from other language SIGs may provide rules specific for their language ecosystem? Or say that they are not interested in that.

And again, perfection is the enemy of the good :)

Maybe in this case it would be better to use the term app_server instead of engine? I feel like this PR should fix #1143. I see now that this issue was indeed created by the same author as this PR.
@ivomagi please link related issues in your PR description, otherwise we might end up discussing the same things twice 😃 In this case, you can maybe even use it in the Fixes # line you left empty so far.

It was explicitly decided that we don't want any "map"-like structures in the data model for traces.

@Oberon00 I believe back then we decided that we don't want them "for now" unless there is a clear need. We may need to revisit this decision if the data at the source clearly dictates/requires more complex data structure. We may not necessarily need to do it now, but let's keep the possibility open.

@ivomagi:

What is the reason we are aiming for an array of value? Do we believe that the concept that we want to express is multi-dimensional with variable number of dimensions and thus it is not possible to capture with a fixed set of attributes?
How about using the approach we use elsewhere, e.g. use attributes engine.java.name, engine.java.version. If there is a second engine dimension foo then its name should be in engine.foo.name attribute. If the reader needs to find all engines they can enumerate all engine.*.name attributes.
However, this approach is only necessary if we need to record multiple engines simultaneously and I did not see examples of that in this PR.
Finally, I assume there is never a need to record more than one name of an engine of the same dimension (i.e. you never have 2 Java engines simultaneously in one resource/span). If there ever is such a need my proposal wouldn't work.

It may be worth also placing the engine in the process namespace. We would then have process.engine.java.name. This would work unless we think there are engines that are not associated with a process (is there a such a thing?).

P.S. I am not sure "engine" is the right term to use, but I don't have a better suggestion.

@Oberon00 indeed, this workaround was what I was aiming to fall back for but wanted to verify with smarter guys whether or not we do have the "map-like" structures or not.

@tigrannajaryan

The array of values is not a "must have", it was just a solution I was able to come up with. The fixed set of attributes as you refer might have some issues though. Lets look at the same example used in original pull request which in time was designed with just a single engine in mind (but it seems that the community looks at things differently and multiple engines might be required). Lets see a Python application using Apache HTTP Server with mod_wsgi as the server and Django as the web framework. Now all of these three could be set as engines and all of these seem to want to reside under engine.python namespace.

In here I do not have a bias. Maybe more eyes with more experience will share thoughts about whether the process namespace would be better or worse location.

And about the term, indeed, naming stuff is hard. Open for suggestions but all the alternatives I was able to come up with (appserver, container, framework, library, ...) were either used for different purpose or felt overloaded or vague.

The array of values is not a "must have", it was just a solution I was able to come up with. The fixed set of attributes as you refer might have some issues though. Lets look at the same example used in original pull request which in time was designed with just a single engine in mind (but it seems that the community looks at things differently and multiple engines might be required). Lets see a Python application using Apache HTTP Server with mod_wsgi as the server and Django as the web framework. Now all of these three could be set as engines and all of these seem to want to reside under engine.python namespace.

We could do this:

apache.version=0.1.2 apache.mod_wsgi.version=1.2.3 python.django.version=2.3.4

I know this means we cannot enumerate engines without knowing what attributes to look for. Is it a requirement to be able to enumerate?

Being able to enumerate will definitely make life at backend easier, if one would like to list all engines from a trace (which sounds like a reasonable wish).

Having engine name as a part of a path in the tree - if an engine reports itself via its own public API which instrumentation can use like Apache Tomcat or Eclipse Jetty. How that should be encoded then? I think that having a name, which is potentially returned by 3rd party and can consist of anything is better suited to be represented as a value.

@vovencij OK, if that's a requirement then what I suggested won't work.

We either do what @Oberon00 proposed earlier with parallel arrays or with using array indices in attributes names (but neither looks nice to me) or we use an array of maps, which requires us to define how such arrays are translated to formats that are not capable of representing such data natively.

I do not have a strong opinion about what approach to use and I do not know of any better alternate.

iNikem · 2020-12-16T17:21:31Z

semantic_conventions/resource/engine.yaml

+        required: always
+        brief: >
+          The name of the engine.
+        examples: ['FildFly']


Suggested change

examples: ['FildFly']

examples: ['WildFly']

I notice this was marked as resolved but not applied. Intentional?

iNikem · 2020-12-16T17:24:44Z

specification/resource/semantic_conventions/engine.md

+
+A resource can be attributed to at most one engine. To illustrate, let's look at a Python application using Apache HTTP Server with mod_wsgi as the server and Django as the web framework. In this case:
+
+* Apache HTTP Server would be set as `process.runtime`


First, I agree with @Oberon00 about process.runtime and that engine will probably be Apache HTTP if we want to be strict. If we want to be lenient, then both HTTP server and mod_wsgi can be described as engine.

Second, as usual :), I propose to keep scope small. Can runtime_stack be a separate discussion?

specification/resource/semantic_conventions/engine.md

anuraaga · 2020-12-18T05:01:57Z

specification/resource/semantic_conventions/engine.md

+
+| Name | `engine.name` |
+|---|---|
+| Apache Tomcat | tomcat |


Would we set tomcat if using a spring-boot application which starts up embedded tomcat? I guess not since the user isn't even really aware of the version of tomcat, and it's sort of a spring-boot implementation detail (in such a case, it's spring-boot doing classloader trickery to e.g., load embedded JARs, and tomcat doesn't do anything).

If this case doesn't apply, then I'm not sure if ServletContext.getServerInfo() will be a precise check, and we may need more description about the caveats with regard to embedded container runtimes (effectively just libraries, not app servers).

I think in case of embedded Tomcat/Jetty/Undertow we still want to report it.

github-actions · 2020-12-26T03:37:54Z

This PR was marked stale due to lack of activity. It will be closed in 7 days.

Co-authored-by: Nikita Salnikov-Tarnovski <gnikem@gmail.com>

github-actions · 2021-01-06T03:49:05Z

Closed as inactive. Feel free to reopen if this PR is still being worked on.

tigrannajaryan · 2021-01-12T00:45:39Z

Was autoclosed due to holidays. Reopening for discussion.

iNikem · 2021-01-19T13:03:21Z

Honestly, I think we are making this PR harder/deeper than it should be. As both #1143 and #1143 (comment) say, there is very useful debugging information that current telemetry lacks. The exact meaning of that information may be language specific and I don't think specification really has to dictate it. E.g. the aforementioned example of Django application running on Apache httpd via mod_wsgi. Can we let language specific auto-instrumentation to decide on exact value? This is very language specific question and thus not a specification concern.

This PR, at least in my eyes, says very simple thing: if you (application operator or auto-instrumentation) want to provide this runtime information, this is semantic attribute for it.

Which leaves two question:

Name of the semantic attribute
Is it single value or an array?

engine is not an ideal name and IMO it is good enough. app_server is loaded term in Java world and unknown in other languages. technology is too vague IMO, but I don't object it too much. One more option is application runtime.

The "single value or array" question, IMO, depends solely on our ability to have good structure for that array. I don't like two parallel arrays for names and versions. Map-like attribute would be the simplest option.

tigrannajaryan · 2021-01-19T15:29:19Z

The "single value or array" question, IMO, depends solely on our ability to have good structure for that array. I don't like two parallel arrays for names and versions. Map-like attribute would be the simplest option.

@iNikem I agree that structurally map-like attribute would be best. The problem is that unlike OTLP most other telemetry formats are unable to represent such data. What are exporters supposed to do when they see such data? Do we want to define how map data is flattened/converted to fit the limitations of e.g. Jaeger's attributes type system?

Oberon00 · 2021-01-19T15:39:11Z

The problem is that unlike OTLP most other telemetry formats are unable to represent such data.

Also, the feature-frozen tracing API does not support such attributes, so it would not be possible to use such a semantic convention with current OpenTelemetry.

tigrannajaryan · 2021-01-19T16:01:54Z

The problem is that unlike OTLP most other telemetry formats are unable to represent such data.

Also, the feature-frozen tracing API does not support such attributes, so it would not be possible to use such a semantic convention with current OpenTelemetry.

@Oberon00 to be fair, it is possible to extend the API in backwards compatible manner to allow this, so in theory from API's perspective it is doable (though it is more work and won't be in the GA). However, I don't know how we will solve nicely the exporting of such data to non-OTLP formats. We could double down on what the spec already suggests for homogeneous arrays and make the same recommendation for maps. To remind, here is what the spec says today:

For protocols that do not natively support array values such values SHOULD be represented as JSON strings.

JSON obviously would also work for maps. But this is a slippery slope. We would be essentially making JSON attributes widespread and while for simple arrays it is very human-readable and probably does not require any special handling in the backend, the more complicated data structures we allow the more there will be a need for backends to be able to deal with such data as structures rather than as simple string.

iNikem · 2021-01-21T13:26:55Z

I propose to have this attribute as single valued. The majority of useful information proposed by @Oberon00 as "technologies" above can already be obtained from span's InstrumentationLibrary. This also solves the format problem.

I propose to further stress that exact choice of what to report in this attribute (if any) is language specific and is NOT specification's concern.

As name I like application_runtime, but think that engine is good enough, even if not ideal.

If @Oberon00 and @tigrannajaryan agree, I can work with @ivomagi on updating this PR.

iNikem · 2021-01-26T16:41:41Z

@Oberon00 , @tigrannajaryan do you agree with my proposal above?

tigrannajaryan · 2021-01-27T20:19:31Z

@Oberon00 , @tigrannajaryan do you agree with my proposal above?

@iNikem I do not understand the problem well enough to agree. I do not object with going forward with the approach that makes best sense to you. Feel free to disregard my suggestions. I was only trying to show some alternates that could potentially help, but apparently they are not a good git.

Relaxed the semantics about allowed engines and changed the wording allowing instrumentation library authors to make this decision themselves.

removed trailing spaces for markdownlint to pass

iNikem · 2021-03-11T07:45:38Z

then it should rather use it's own attribute like dotnet.runtime_variant or whatever.

why? What is that dotnet specific attribute is better than engine proposed here?

Oberon00 · 2021-03-11T09:22:31Z

The engine proposed here is vaguely defined, but about the only thing that is clear to me is that it describes something that is not the runtime but something that runs inside it or something that the runtime runs inside. The .NET runtime variant is just additional information about the same runtime described by process.runtime.

owais · 2021-03-11T10:38:32Z

Python could use this to differentiate between web servers like uWSGI, Gunicorn, etc but I don't think engine is a good fit at all for this. Something like appserver/webserver would be a much better fit for Python at least.

carlosalberto · 2021-03-16T00:09:07Z

Summarizing, it sounds like this is something useful to a few languages, but we need to find consensus on the name (and maybe clarify things a little bit more). @owais @Oberon00 any suggestions for alternative names?

owais · 2021-03-17T18:22:20Z

I can't think of anything that'd be a better fit for Python at least other than appserver or webserver. That said, I don't know how useful this addition would be specifically for Python or something like Ruby. These are WSGI servers and generally are instrumented as a result. So, not sure if this proposal adds any significant value for Python.

carlosalberto · 2021-03-19T15:13:36Z

@ivomagi Would using webserver, as suggested, would be a good compromise? Else, what about webservice_engine or web_engine?

ivomagi · 2021-03-22T13:06:27Z

engine was chosen to avoid overloaded terms such as webserver or appserver. But honestly, I care the most of having this possibility, vs the naming debate. So if @carlosalberto you or anyone else has a strong opinion on what the naming convention should be, I am on board with continuing with an alternative.

carlosalberto · 2021-03-22T22:47:22Z

@ivomagi I suggest we go with webengine, as a) it does not overload other common names such as webserver and b) blends the original engine with web, which is (more or less) what was originally. @owais @iNikem @Oberon00 please comment on whether you like this or not.

Oberon00 · 2021-03-23T07:22:09Z

Webengine is the name of some libraries that render HTML (e.g. Qt). I would still OK with it, but I would like webserver better.

engine was chosen to avoid overloaded terms such as webserver or appserver

But isn't engine even more overloaded? It seems to me that webserver is exactly what this attribute seems to want to describe?

iNikem · 2021-03-23T13:45:48Z

But isn't engine even more overloaded? It seems to me that webserver is exactly what this attribute seems to want to describe?

Would you call WebLogic server a "webserver"?

Oberon00 · 2021-03-23T14:09:49Z

In the sense of this attribute, yes, close enough 😃

carlosalberto · 2021-03-23T21:24:14Z

All right, let's go with webengine then, and try to get this PR finally merged :) @ivomagi please update this one.

github-actions · 2021-03-31T03:19:25Z

This PR was marked stale due to lack of activity. It will be closed in 7 days.

ivomagi · 2021-03-31T09:58:33Z

@carlosalberto @bogdandrutu - updated the PR to webengine. There seems to be one requested change from Bogdan, other than this we should be good to merge?

carlosalberto · 2021-03-31T13:53:04Z

I think we can dismiss @bogdandrutu's review as we the diligence was done (let me know otherwise).

ivomagi · 2021-04-01T11:47:40Z

I think we can dismiss @bogdandrutu's review as we the diligence was done (let me know otherwise).

Indeed, it seems we captured this feedback, but the review requesting changes is still out there and blocking the merge, so how should we proceed - @carlosalberto @bogdandrutu

Feedback has been gathered/addressed.

carlosalberto · 2021-04-01T14:20:43Z

@bogdandrutu Dismissed your review - let me know if you think something needs to be addressed still, and I will work on a follow up ;)

ivomagi added 3 commits December 16, 2020 15:45

Added engine definition

1b533b1

Engine added

aed7df8

Added reference to engine.md under Compute Unit

53a8b08

ivomagi requested review from a team as code owners December 16, 2020 13:55

github-actions bot assigned yurishkuro Dec 16, 2020

Oberon00 reviewed Dec 16, 2020

View reviewed changes

iNikem reviewed Dec 16, 2020

View reviewed changes

yurishkuro removed their assignment Dec 16, 2020

anuraaga reviewed Dec 18, 2020

View reviewed changes

Oberon00 mentioned this pull request Dec 18, 2020

Adding resource attributes post-creation (e.g. via auto-discovery) #1298

Open

github-actions bot added the Stale label Dec 26, 2020

Apply suggestions from code review

906839f

Co-authored-by: Nikita Salnikov-Tarnovski <gnikem@gmail.com>

github-actions bot closed this Jan 6, 2021

tigrannajaryan reopened this Jan 12, 2021

tigrannajaryan removed the Stale label Jan 12, 2021

Base automatically changed from master to main January 27, 2021 21:16

ivomagi added 2 commits January 29, 2021 13:44

Update engine.md

679fe46

Relaxed the semantics about allowed engines and changed the wording allowing instrumentation library authors to make this decision themselves.

Update engine.md

7e78b31

removed trailing spaces for markdownlint to pass

github-actions bot removed the Stale label Mar 10, 2021

github-actions bot added the Stale label Mar 31, 2021

ivomagi requested a review from bogdandrutu March 31, 2021 09:38

ivomagi added 6 commits March 31, 2021 12:39

Merge branch 'main' into engine-definition

6854721

Renamed engine to webengine

cba1f77

Updated reference to webengine

f02e289

Converted to webengine

af1c8f5

Rename engine.md to webengine.md

0de3728

Update webengine.md

8691efa

github-actions bot removed the Stale label Apr 1, 2021

Merge branch 'main' into engine-definition

39cd9e0

carlosalberto merged commit ef4612d into open-telemetry:main Apr 1, 2021


		A resource can be attributed to at most one engine. To illustrate, let's look at a Python application using Apache HTTP Server with mod_wsgi as the server and Django as the web framework. In this case:

		* Apache HTTP Server would be set as `process.runtime`

Adding engine definition under resources semantic conventions #1293

Adding engine definition under resources semantic conventions #1293

Conversation

ivomagi commented Dec 16, 2020 • edited

Changes

linux-foundation-easycla bot commented Dec 16, 2020 • edited

Oberon00 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented Dec 26, 2020

github-actions bot commented Jan 6, 2021

tigrannajaryan commented Jan 12, 2021

iNikem commented Jan 19, 2021

tigrannajaryan commented Jan 19, 2021

Oberon00 commented Jan 19, 2021

tigrannajaryan commented Jan 19, 2021

iNikem commented Jan 21, 2021

iNikem commented Jan 26, 2021

tigrannajaryan commented Jan 27, 2021

iNikem commented Mar 11, 2021

Oberon00 commented Mar 11, 2021

owais commented Mar 11, 2021

carlosalberto commented Mar 16, 2021

owais commented Mar 17, 2021 • edited

carlosalberto commented Mar 19, 2021

ivomagi commented Mar 22, 2021

carlosalberto commented Mar 22, 2021

Oberon00 commented Mar 23, 2021

iNikem commented Mar 23, 2021

Oberon00 commented Mar 23, 2021

carlosalberto commented Mar 23, 2021

github-actions bot commented Mar 31, 2021

ivomagi commented Mar 31, 2021

carlosalberto commented Mar 31, 2021

ivomagi commented Apr 1, 2021

carlosalberto commented Apr 1, 2021

ivomagi commented Dec 16, 2020 •

edited

linux-foundation-easycla bot commented Dec 16, 2020 •

edited

owais commented Mar 17, 2021 •

edited