Portal Dashboard: Treat MarkdownPartMetadata as untyped JSON #3123

mikhailshilkov · 2024-03-04T10:59:34Z

Problem

As explained in #1696 and #3023, the azure-native:portal:Dashboard resource is currently ususable due to lack of specific types for dashboard parts. The API supports a multitude of part types, but the open API spec only describes one of them. The result of this is that we generate a discriminated union which is so restricted that it can't be used in practice.

We opened Azure/azure-rest-api-specs#27465 upstream but received no feedback yet. The work to describe all other dashboard part types looks very involved - there are many types with many-many properties. Unfortunately, we don't anticipate reliable type definitions any time soon.

Proposal

As suggested in #3023, this PR introduces a type override to remove the only strongly typed part definition and replace it with a generic DashboardMetadataPart type. The type accepts arbitrary collections of inputs and settings, which enables passing configuration for any part type.

Note that technically this is a breaking change. However, I'm quite convinced that the previous type system prevented anyone from creating any non-trivial dashboards, so we are not breaking any practical user scenarios.

Implementation details

Historically, we implement this kind changes as a special case in our generation logic. However, this change is kind of tricky to do in the generation pass, because we need to introduce an new type, change an existing type, and then (ideally) remove unused types too.

Because of that, and to avoid further bloat of inline generation logic, I decided to introduce a new mechanism in customer resources. It allows specifying "overrides" for schema and metadata types, that would then replace the original types with the same name.

Additionally, I added a final-pass on the schema that deletes all unused object types. The good news is that no previous types were unused, so only the Dashboard Markdown types are being removed now. This also helps with the existing snapshot test: it elides the Dashboard types and therefore the snapshot is unchanged.

Testing

In addition to a few unit tests, I added two end-2-end tests that create a dashboard from TypeScript and C#. The TypeScript one was created by importing an elaborate dashboard created manually in Azure portal. Unfortunately, our C# import is broken beyond repair for untyped dictionaries, so I defined a much simpler dashboard manually.

Resolves #1696
Resolves #3023

github-actions · 2024-03-04T11:05:02Z

Does the PR have any schema changes?

Found 10 breaking changes:

Types

🟡 "azure-native:portal:DashboardParts": properties: "metadata" type changed from "#/types/azure-native:portal:MarkdownPartMetadata" to "#/types/azure-native:portal:DashboardPartMetadata"
🟡 "azure-native:portal:DashboardPartsResponse": properties: "metadata" type changed from "#/types/azure-native:portal:MarkdownPartMetadataResponse" to "#/types/azure-native:portal:DashboardPartMetadataResponse"
🔴 "azure-native:portal:MarkdownPartMetadata" missing
🔴 "azure-native:portal:MarkdownPartMetadataContent" missing
🔴 "azure-native:portal:MarkdownPartMetadataResponse" missing
🔴 "azure-native:portal:MarkdownPartMetadataResponseContent" missing
🔴 "azure-native:portal:MarkdownPartMetadataResponseSettings" missing
🔴 "azure-native:portal:MarkdownPartMetadataResponseSettingsSettings" missing
🔴 "azure-native:portal:MarkdownPartMetadataSettings" missing
🔴 "azure-native:portal:MarkdownPartMetadataSettingsSettings" missing
No new resources/functions.

codecov · 2024-03-04T11:14:15Z

Codecov Report

Attention: Patch coverage is 95.39171% with 10 lines in your changes are missing coverage. Please review.

Project coverage is 61.40%. Comparing base (1e0e6a4) to head (03d9e7a).

Files	Patch %	Lines
provider/pkg/gen/types.go	84.78%	6 Missing and 1 partial ⚠️
provider/pkg/gen/schema.go	90.32%	2 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #3123      +/-   ##
==========================================
+ Coverage   60.75%   61.40%   +0.64%     
==========================================
  Files          71       72       +1     
  Lines       11368    11583     +215     
==========================================
+ Hits         6907     7112     +205     
- Misses       3898     3906       +8     
- Partials      563      565       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

danielrbradley

The end result looks good, though I've got a couple of concerns.

The new schema transformation doesn't appear to be scoped to only affecting the custom resource's schema - it can globally change any types. That said, it's also not complete as it doesn't allow for transformation of the resource's schema itself. Also, the order of transformations could be an issue as they could easily interact with each other. It would be much easier to reason about if we could apply the transformation for the types generated for just a specific resource - so the transformation is scoped to a single gather resource context.

Secondly, the implementation of the transformation looks quite fragile and slow. We're iterating through every generated type which I expect adds quite a bit of redundant processing. We're also using a hard-coded list to try and identify the types associated with the resource. If the schema is updated this will break.

Can we make this easier to maintain in the future?

mikhailshilkov · 2024-03-06T10:17:27Z

@danielrbradley All good questions - I had similar thoughts. See my answers below.

we could apply the transformation for the types generated for just a specific resource

There is no such thing as types scoped to a single resource. Types are contained within a module, and multiple resources of that module can rely on overlapping types. If we wanted to get a list of all types that a given resource depends on, we'd need to traverse its properties recursively and build out a list. And yet, again, it would not be an exclusive list.

I actually think that we need cross-resource and cross-module transformations. We could use them for existing hard-coded generation tweaks like SubResource.ID expansion, User Assigned Identities shape, maintained sub-resource collection, etc.

it's also not complete as it doesn't allow for transformation of the resource's schema itself

That's straightforward to add later, I don't see why I need to do so in this PR without a use case.

the order of transformations could be an issue as they could easily interact with each other.

That's fair. Do you have ideas how to overcome this? I could add a test that runs transformations in a few random orders and compares the result. Since we are talking about schema types here, if the order becomes important, we will start getting spurious diffs on schema generation, which is already a test of a kind.

We're iterating through every generated type which I expect adds quite a bit of redundant processing

It's an iteration through map keys and parsing each key, which is ~O(N) for N in a few thousands. I expect it to take a few ms max. What am I missing?

We're also using a hard-coded list to try and identify the types associated with the resource. If the schema is updated this will break.

That's fair. Do you have suggestions here? How can we transform the types without relying on their names? The best I can think of is to check the type shape and fail loud and clear if it's not what we expect, forcing a maintainer to resolve it.

If you have a better approach on these in mind, I'm all ears!

danielrbradley · 2024-03-06T11:10:20Z

@mikhailshilkov I could see a few alternatives here...

For easier maintainability, we could just write the component completely manually - and skip the resource being generated at all. This would move us away from a half-way house of generating then kudging.
Another option would be skip the resource during the main generation, then when creating the custom resource call into the main generation code but from an isolated context so we get a complete list of the types related to just this one resource, then customise this. Then we could merge the resource & associated types into the main schema.
Currently our context during generation is for the whole packageGenerator, but we could create a new context per resource being generated something like resourceGenerator wherein we could load transformations for that specific resource (so resource transformations are registered before generation, keyed by resource) then these transformations would be called while generating all types associated to that resource.
A larger change would be to refactor the generation process to avoid mutating any global context - where every resource returns its own resource and associated types, then we use a process to merge each resources types after. This would allow for very simpler intercepting of the values being returned rather than intercepting while they're being generated (just before they're added to the global list).

mikhailshilkov · 2024-03-11T13:14:52Z

@danielrbradley I refactored the implementation away from all-mighty transformations. Instead, the custom resource is now specifying "overrides" for schema and metadata types, that then replace the original types with the same name.

Additionally, I added a final-pass on the schema that deletes all unused object types. The good news is that no previous types were unused, so only the Dashboard Markdown types are being removed now. This also helps with the existing snapshot test: it elides the Dashboard types and therefore the snapshot is unchanged.

Let me know if you think it's a step in the right direction.

danielrbradley

I think this is better, though it still feels a little obscure when reading the custom resource definition as to what it's actually doing.

The separation of Types* and MetaTypes* is not great as these are almost identical implementations, and could easily introduce bugs if they differed accidentally. It's probably a wider issue to bring these two together into a single model at some point which can be projected into both the schema & metadata as required.

Thinking from the point of view of what we're trying to express in the purpose of this custom resource, perhaps utilizing the idea of the visitor pattern could work nicely here so the custom resource can only interact with types that the original resource referenced, or add new ones. For example:

func portalDashboard() *CustomResource {
	return &CustomResource{
		// Provide a token of the resource to override
		Token: "azure-native:portal:Dashboard",
		// Follow the types referenced from the resource then pass them all in here to be customised
		CustomiseTypes: func (schema map[string]*schema.ComplexTypeSpec, metadata map[string]*resources.AzureAPIType) (map[string]*schema.ComplexTypeSpec, map[string]*resources.AzureAPIType) {
			// Iterate over types & add to output with modifications where required.
			// Extra types can be added here too.
			// Unreferenced types will be dropped from the schema.
		},
		// We could also add a `CustomiseResource` function eaily to mirror this for the root resource's schema too.
	}
}

danielrbradley · 2024-03-11T14:54:45Z

provider/pkg/gen/schema.go

+func normalizePackage(pkg *pschema.PackageSpec, metadata *resources.AzureAPIMetadata) {
+	// Record all type tokens referenced from resources and functions.
+	usedTypes := map[string]bool{}
+	visitor := func(t string, _ pschema.ComplexTypeSpec) {
+		usedTypes[t] = true
+	}
+	VisitPackageSpecTypes(pkg, visitor)
+
+	// Elide unused types.
+	allTypeNames := codegen.SortedKeys(pkg.Types)
+	for _, typeName := range allTypeNames {
+		if !usedTypes[typeName] {
+			t := pkg.Types[typeName]
+			if len(t.Enum) > 0 {
+				continue
+			}
+			delete(pkg.Types, typeName)
+			delete(metadata.Types, typeName)
+		}
+	}
+}


This method would be a nice utility to have for elsewhere too! Maybe another little one for the pulumi-go-provider as an "x" package?

…types

mikhailshilkov force-pushed the mikhailshilkov/dashboard-type branch 4 times, most recently from 1955a3e to 709e975 Compare March 5, 2024 19:52

mikhailshilkov requested review from thomas11, danielrbradley and mjeffryes March 5, 2024 21:34

mikhailshilkov marked this pull request as ready for review March 5, 2024 21:34

danielrbradley requested changes Mar 6, 2024

View reviewed changes

mikhailshilkov force-pushed the mikhailshilkov/dashboard-type branch from 709e975 to 5e6b940 Compare March 11, 2024 13:10

mikhailshilkov force-pushed the mikhailshilkov/dashboard-type branch 3 times, most recently from 098633b to 793eb8e Compare March 11, 2024 14:28

danielrbradley reviewed Mar 11, 2024

View reviewed changes

mikhailshilkov added 4 commits March 11, 2024 17:11

Examples and tests

c6f504b

Schema and SDKs

1c03b6b

Define schema transformations and use them to amend portal dashboard …

093122a

…types

Replace transformations with schema type overrides

03d9e7a

mikhailshilkov force-pushed the mikhailshilkov/dashboard-type branch from 793eb8e to 03d9e7a Compare March 11, 2024 16:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Portal Dashboard: Treat MarkdownPartMetadata as untyped JSON #3123

Portal Dashboard: Treat MarkdownPartMetadata as untyped JSON #3123

mikhailshilkov commented Mar 4, 2024 •

edited

github-actions bot commented Mar 4, 2024 •

edited

codecov bot commented Mar 4, 2024 •

edited

danielrbradley left a comment

mikhailshilkov commented Mar 6, 2024

danielrbradley commented Mar 6, 2024

mikhailshilkov commented Mar 11, 2024

danielrbradley left a comment

danielrbradley Mar 11, 2024

Portal Dashboard: Treat MarkdownPartMetadata as untyped JSON #3123

Are you sure you want to change the base?

Portal Dashboard: Treat MarkdownPartMetadata as untyped JSON #3123

Conversation

mikhailshilkov commented Mar 4, 2024 • edited

Problem

Proposal

Implementation details

Testing

github-actions bot commented Mar 4, 2024 • edited

Does the PR have any schema changes?

Types

codecov bot commented Mar 4, 2024 • edited

Codecov Report

danielrbradley left a comment

Choose a reason for hiding this comment

mikhailshilkov commented Mar 6, 2024

danielrbradley commented Mar 6, 2024

mikhailshilkov commented Mar 11, 2024

danielrbradley left a comment

Choose a reason for hiding this comment

danielrbradley Mar 11, 2024

Choose a reason for hiding this comment

mikhailshilkov commented Mar 4, 2024 •

edited

github-actions bot commented Mar 4, 2024 •

edited

codecov bot commented Mar 4, 2024 •

edited