storage/disk: bundles issue 4868 #4877

srenatus · 2022-07-11T12:28:43Z

🚧 👷 PR for discussion purposes (#4868)

srenatus · 2022-07-11T12:29:44Z

storage/disk/disk.go

@@ -351,17 +347,20 @@ func (db *Store) Truncate(ctx context.Context, txn storage.Transaction, params s

 	// update symlink to point to the active db
 	symlink := filepath.Join(path.Dir(newDB.Opts().Dir), symlinkKey)


I've included #4870 to unbreak my workflow, please disregard.

srenatus

some comments inline

storage/disk/disk.go

srenatus · 2022-07-11T12:30:57Z

bundle/file.go

+		return ret
+	}
+
+	return []file{f} // try it anyways


That's the 🤞-it-might-still-work-if-we-try code path. We might still fail, but maybe we don't, because the smallSize cutoff value isn't based on any calculation, but a guess.

bundle/file.go

Before, we'd failed to activate a bundle with a large data.json which would end up being attempted to load in a single write. Now, we'll recognize if the value is larger than a threshold, and split it into multiple blobs, which hopefully go in with a single txn. Activating the bundle in question works, but re-activating it afterwards still fails in a different place: eraseBundles accumulates too many writes in a single txn, attempting to reset the data in store.) Signed-off-by: Stephan Renatus <stephan.renatus@gmail.com>

Before, we've accumulated too-large transactions by retrieving data keys according to their partitioning, and deleting them one-by-one. Now, we'll do the same thing, but in an iterator passed to Truncate, which deals with restarting transactions. This also means that we've got an extra call to Truncate, so one Truncate operation more than before. However, we've been having one Truncate per bundle already. To fix this, we could combine multiple storage.Iterator into a chaining iterator, and only do one Truncate operation, but that's a future optimization. Signed-off-by: Stephan Renatus <stephan.renatus@gmail.com>

srenatus · 2022-07-12T12:42:11Z

💡 tests fail because the logic is wrong: calling Truncate in where it's now called will loose the transaction previously written to -- i.e. the delta bundles' patches are applied, but never written to newDB. They're an open txn for oldDB, but that doesn't matter when truncate switches DBs.

I'll redo those bits. I think I'm at the point where I'm this close to not understanding anymore how the current setup works 😅

srenatus · 2022-07-18T11:46:12Z

TBC

srenatus commented Jul 11, 2022

View reviewed changes

srenatus force-pushed the sr/storage/disk-bundles-issue-4868 branch 2 times, most recently from 917f5c5 to d6150b4 Compare July 12, 2022 09:22

srenatus force-pushed the sr/storage/disk-bundles-issue-4868 branch from d6150b4 to 4577fad Compare July 12, 2022 09:38

srenatus closed this Jul 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

storage/disk: bundles issue 4868 #4877

storage/disk: bundles issue 4868 #4877

srenatus commented Jul 11, 2022 •

edited

srenatus Jul 11, 2022

srenatus left a comment

srenatus Jul 11, 2022

srenatus commented Jul 12, 2022

srenatus commented Jul 18, 2022

		@@ -351,17 +347,20 @@ func (db *Store) Truncate(ctx context.Context, txn storage.Transaction, params s

		// update symlink to point to the active db
		symlink := filepath.Join(path.Dir(newDB.Opts().Dir), symlinkKey)

storage/disk: bundles issue 4868 #4877

storage/disk: bundles issue 4868 #4877

Conversation

srenatus commented Jul 11, 2022 • edited

srenatus Jul 11, 2022

Choose a reason for hiding this comment

srenatus left a comment

Choose a reason for hiding this comment

srenatus Jul 11, 2022

Choose a reason for hiding this comment

srenatus commented Jul 12, 2022

srenatus commented Jul 18, 2022

srenatus commented Jul 11, 2022 •

edited