WIP: Record (TOC digest → DiffID) mapping in BlobInfoCache #2321

mtrmac · 2024-02-29T15:01:04Z

A single DiffID may map to multiple TOC digest values. Record that in BlobInfoCache, and use it for layer reuse.

Also prefer reusing even TOC-matched layers by DiffID, when available.

@giuseppe I’d appreciate a preliminary review of the new logic; see individual commits.

Draft: The BlobInfoCache implementations don’t actually store/record any data yet — so this is obviously completely untested.

giuseppe · 2024-02-29T15:30:20Z

internal/blobinfocache/types.go

+	// UncompressedDigest returns an uncompressed digest corresponding to anyDigest.
+	// Returns "" if the uncompressed digest is unknown.
+	// FIXME: Does this need to record TOC/compression type?
+	UncompressedDigestForTOC(tocDigest digest.Digest) digest.Digest


The TOC digest is the checksum of the uncompressed JSON document, so I think the compression should not matter in this case

I agree we probably don’t need that right now (with GetTOCDigest refusing to work on manifests which contain multiple TOC digest annotations, and presumably with the zstd / estargz code being unable to decompress the other one).

This comment is a looking a bit more into the future, for lookups in the other direction, where we will want to look up (UncompressedDigest → (compressed digest, TOC digest, algorithm)) and match that against “the user wants the destination to contain zstd:chunked” (i.e. reject estargz matches).

for lookups in the other direction,

That will be done in a separate data structure (an extension of RecordDigestCompressorName: We need the full set of annotations for reuse of a TOC-compressed blob, so this simple mapping is not sufficient anyway. And the other structure does record the algorithm.

mtrmac

Note to self: This is code-complete but I want to test it in practice.

mtrmac · 2024-04-13T15:55:02Z

storage/storage_dest.go

+	// (and we assume the TOC digest also uniquely identifies the contents, i.e. there aren’t two
+	// different formats/ways to parse a single TOC).


All of the c/storage+c/image code has been built around this assumption, but it is false currently (containers/storage#1888 ) and I’m not sure whether we need to revisit the design. Let’s discuss that in the c/storage issue.

yes, this assumption is correct

Should not change behavior. Signed-off-by: Miloslav Trmač <mitr@redhat.com>

The new code is not called, so it should not change behavior (apart from extending the BoltDB/SQLite schema). Signed-off-by: Miloslav Trmač <mitr@redhat.com>

…storage by DiffID If we can, prefer identifying layers by DiffID, because multiple TOCs can map to the same DiffID; and because it maximizes reuse with non-TOC layers. For now, the new situation is unreachable. Signed-off-by: Miloslav Trmač <mitr@redhat.com>

We will add one more instance of this, so share the code. Should not change behavior (it does remove one unreachable code path). Signed-off-by: Miloslav Trmač <mitr@redhat.com>

… is known - Multiple TOC values might correspond to a single DiffID (e.g. if different compression levels are used); try to share them all, identified by DiffID (so that we also reuse with non-TOC pulls). - LayersByTOCDigest only uses a single TOC digest per layer; BlobInfoCache allows multiple matches, matches layers which have been since deleted, and potentially matches TOC digests which we have created by pushing but haven't pulled yet. - On reuse, we can now use DiffID-based layer identities even if the reuse was TOC~driven. Signed-off-by: Miloslav Trmač <mitr@redhat.com>

…hole layer This is similar to what putBlobToPendingFile does. Signed-off-by: Miloslav Trmač <mitr@redhat.com>

…yers Signed-off-by: Miloslav Trmač <mitr@redhat.com>

mtrmac · 2024-04-25T21:30:00Z

To test:

Before:

# podman rmi alpine level1 level9
# rm -f /var/lib/containers/cache/blob-info-cache-v1.sqlite 
# podman pull quay.io/libpod/alpine
# podman --log-level=debug push --compression-format zstd:chunked --compression-level 1 --force-compression quay.io/libpod/alpine localhost:50000/level1
## Even better would be to use two different destination registries, to be 100% certain the blobs are not reused
## (right now they are not reused, but we’ll fix that):
# podman--log-level=debug push --compression-format zstd:chunked --compression-level 9 --force-compression quay.io/libpod/alpine localhost:50000/level9
## Note the compressed digest, and TOC digest, values:
# skopeo inspect --raw docker://localhost:50000/level1 | jq .
# skopeo inspect --raw docker://localhost:50000/level9 | jq .
## No DigestTOCUncompressedPairs entries:
# sqlite3 /var/lib/containers/cache/blob-info-cache-v1.sqlite .dump 
# podman rmi alpine level1 level9
## Triggers a partial pull: "Applying differ in …":
# podman --log-level=debug pull localhost:50000/level1
## Triggers a partial pull: "Applying differ in …"
# podman --log-level=debug pull localhost:50000/level9 
## level1 and level9 have different image IDs:
# podman images 
## Contains two copies of the layer, with the same expected-layer-diffid
# jq . < /var/lib/containers/storage/overlay-layers/layers.json ```

After:

DigestTOCUncompressedPairs contains 2 records
Pull of level1 triggers a partial pull (creating a layer with known TOC digest and uncompressed digest)
Pull of level9 reuses the layer (by BIC compressed -> uncompressed mapping)
FIXME: the layer is shared, but the image not yet - the hasLayerPulledByTOC code path is wrong

Signed-off-by: Miloslav Trmač <mitr@redhat.com>

giuseppe reviewed Feb 29, 2024

View reviewed changes

mtrmac force-pushed the chunked-bic branch from 3560394 to 5f98f2b Compare March 4, 2024 14:29

mtrmac force-pushed the chunked-bic branch 3 times, most recently from be098a2 to b14f00b Compare March 14, 2024 22:14

mtrmac force-pushed the chunked-bic branch from b14f00b to 506bacc Compare March 25, 2024 17:31

mtrmac added the kind/feature A request for, or a PR adding, new functionality label Apr 5, 2024

mtrmac force-pushed the chunked-bic branch 4 times, most recently from d238714 to 6dae67d Compare April 11, 2024 22:35

mtrmac mentioned this pull request Apr 13, 2024

zstd:chunked blocker: TarSplitChecksumKey not used in a layer ID containers/storage#1888

Open

mtrmac force-pushed the chunked-bic branch from 6dae67d to e0e53b6 Compare April 13, 2024 15:51

mtrmac commented Apr 13, 2024

View reviewed changes

mtrmac force-pushed the chunked-bic branch 2 times, most recently from 2a542f7 to 9e3cace Compare April 24, 2024 18:26

mtrmac added 7 commits April 25, 2024 22:58

Explicitly document that we assume TOC digests to be unambiguous

50e7d68

Should not change behavior. Signed-off-by: Miloslav Trmač <mitr@redhat.com>

Add TOC digest <-> uncompressed digest mapping to BIC

e745ad4

The new code is not called, so it should not change behavior (apart from extending the BoltDB/SQLite schema). Signed-off-by: Miloslav Trmač <mitr@redhat.com>

Split reusedBlobFromLayerLookupLocked from tryReusingBlobAsPending

299594e

We will add one more instance of this, so share the code. Should not change behavior (it does remove one unreachable code path). Signed-off-by: Miloslav Trmač <mitr@redhat.com>

Record (compressed, uncompressed) digest mapping if we consumed the w…

5cd0066

…hole layer This is similar to what putBlobToPendingFile does. Signed-off-by: Miloslav Trmač <mitr@redhat.com>

Record the (TOC digest, uncompressed digest) data when we compress la…

ac7a7e7

…yers Signed-off-by: Miloslav Trmač <mitr@redhat.com>

FIXME

a9266ff

Signed-off-by: Miloslav Trmač <mitr@redhat.com>

mtrmac force-pushed the chunked-bic branch from 9e3cace to a9266ff Compare April 30, 2024 20:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Record (TOC digest → DiffID) mapping in BlobInfoCache #2321

WIP: Record (TOC digest → DiffID) mapping in BlobInfoCache #2321

mtrmac commented Feb 29, 2024

giuseppe Feb 29, 2024

mtrmac Feb 29, 2024

mtrmac Apr 13, 2024

mtrmac left a comment

mtrmac Apr 13, 2024

giuseppe Apr 15, 2024

mtrmac commented Apr 25, 2024

		// (and we assume the TOC digest also uniquely identifies the contents, i.e. there aren’t two
		// different formats/ways to parse a single TOC).

WIP: Record (TOC digest → DiffID) mapping in BlobInfoCache #2321

Are you sure you want to change the base?

WIP: Record (TOC digest → DiffID) mapping in BlobInfoCache #2321

Conversation

mtrmac commented Feb 29, 2024

giuseppe Feb 29, 2024

Choose a reason for hiding this comment

mtrmac Feb 29, 2024

Choose a reason for hiding this comment

mtrmac Apr 13, 2024

Choose a reason for hiding this comment

mtrmac left a comment

Choose a reason for hiding this comment

mtrmac Apr 13, 2024

Choose a reason for hiding this comment

giuseppe Apr 15, 2024

Choose a reason for hiding this comment

mtrmac commented Apr 25, 2024