feat(blob): BlobModule v1 #3390

vgonkivs · 2024-05-10T12:51:58Z

Introducing BlobModulev1 that will support new features:

Automatic retries;
Blob subscriptions;
Non-blocking submission;
Will provide tx hashes for the submitted blobs;
Ability to specify signing account;

With the upcoming features like async blob submission, changes need to be made to the Blob Module API. Currently, the Blob Module relies on height to perform read operations, which is not feasible in the case of async submissions. Therefore, blob.Get should utilize txHash instead of height to identify the blob. Additionally, blob.Get will allow users to subscribe to a particular blob by waiting until the transaction is included in the block.

type BlobModule interface {
	// Submit sends PFB transaction and reports the height it landed on.
	// Allows sending multiple Blobs atomically asynchronously.
	Submit(context.Context, *TxOptions, Blob...) (hash string, error)
	
	// Get wait until tx will be included and retrieves the blob by commitment. 
	Get(context.Context, string, namespace.ID, Commitment) (Blob, error)
	
	// GetAll retrieves all the blobs for given namespaces at the given height. 
	GetAll(context.Context, height uint64, namespace.ID...) ([]Blob, error)

       // GetProof gets the Proof for a blob if it exists at the given height under the namespace. 
       GetProof(context.Context, height uint64, namespace.ID, Commitment) (Proof, error)

	// Included checks whether a blob's given commitment(Merkle subtree root) is included at the given height under a particular namespace.
	// Useful for light clients wishing to check the inclusion of the data.
	Included(context.Context, height uint64, namespace.ID, Commitment, Proof) (bool, error)

      // Subscribe allows subscribing to the whole namespace. Blob Module will check blobs from the requested
     // namespace in every new height that will be sampled.
     Subscribe(context.Context, namespace.ID) (*Subscription, error)
}

type TxOption struct {
    Signer string
    Fee int64
    GasLimit uint64
}

type Subscription struct {
     Blobs chan []*Blob
}

Some open questions are:

do we want to be able to submit blobs in both modes(sync/async);

distractedm1nd · 2024-05-10T13:09:25Z

does it make sense to have GetAll/GetProof accepting txHash instead of height

GetAll should not have TxHash, as its per namespace getting, not per transaction. I can see an argument for Get taking it, but what worries me there is that we are going up a layer of abstraction away from blobs and into transactions. Not going to block on it though

vgonkivs · 2024-05-10T13:09:56Z

GetAll should not have TxHash

Yeah, already removed it

Wondertan · 2024-05-10T13:20:22Z

Lets also add Subscription methods to the whole namespacw

vgonkivs · 2024-05-14T10:47:02Z

Another point to consider, which has been brought up several times by @renaynay and @walldiss and I agree with, is to rework GetAll to use one namespace instead of multiple ones. Managing a single namespace per query becomes much easier and error handling becomes clearer. Currently, it is almost impossible to handle errors that occur as a result of multiple request failures.

// GetAll retrieves all the blobs for given namespaces at the given height. 
	GetAll(context.Context, height uint64, namespace.ID) ([]Blob, error)

walldiss · 2024-05-14T10:52:36Z

I would like to clarify that propagating retrieval results, indicating which namespaces were retrieved successfully and which resulted in errors, is very messy with the existing multi-namespace API. Handling such errors is even worse on the client side, so we should prioritize reworking the API to work with only a single namespace.

liamsi · 2024-05-14T11:17:24Z

Introducing BlobModulev1 that will support new features:

I am not sure where the v1 versioning will be applied but note that I'd like to reserve v1 for semantically versioning the next iteration of the blob API that will also overhaul the current implementation to use grpc instead of json rpc.

Automatic retries;

IMO, it could also be important to allow users some control over retries. If we are not sure what they want, we should also allow users to completely control the retry logic client side then.

Wondertan · 2024-05-14T12:05:01Z

I am not sure where the v1 versioning will be applied but note that I'd like to reserve v1 for semantically versioning the next iteration of the blob API that will also overhaul the current implementation to use grpc instead of json rpc.

Iirc, we discussee grpc coming on top of internal API as separate layer. This internal API versioning, while Canonical can have its own independent

Wondertan · 2024-05-14T12:11:39Z

IMO, it could also be important to allow users some control over retries.

If Submit guarantees tx inclusion with internal retrying, no one would want/need to handle it themselves. Btw, they can always cancel the Submit API call and start a new retry, even with presence of internal retries, but thats suboptimal compared to internal ones

walldiss · 2024-05-17T15:25:05Z

IMO, it could also be important to allow users some control over retries. If we are not sure what they want, we should also allow users to completely control the retry logic client side then

Implicit retries does not communicate the state of retries to user. He might want to do something else after N retries failed. Like try fallback while keep retrying, or introduce backoff. I think implicit retries should be optional and well defined. They also need to be documented, describing retry-able conditions. We can start without restart just to have less and add more if needed.

Wondertan · 2024-05-17T15:49:32Z

He might want to do something else after N retries failed.

I don't see an example of such something else and would be great if someone can provide that.

The point is to make a reliable API method that handles mempool.congestion and account seqeunce issues internally without leaking those details to the client. It's hard for me to imagine a world where every node user has to design their own way of dealing with our mempool impl details.

We can start without restart just to have less and add more if needed.

We already doing that. The current released version of the node has no retries and that's causing issues and introducing the need to add them. With the Signer being integrated we get those retries for free

vgonkivs added area:api Related to celestia-node API area:blob labels May 10, 2024

vgonkivs changed the title ~~BlobModule v1~~ feat(blob): BlobModule v1 May 10, 2024

walldiss mentioned this issue May 14, 2024

misc(blob/service)!: return empty values in case blobs were not found #3223

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(blob): BlobModule v1 #3390

feat(blob): BlobModule v1 #3390

vgonkivs commented May 10, 2024 •

edited

distractedm1nd commented May 10, 2024

vgonkivs commented May 10, 2024

Wondertan commented May 10, 2024

vgonkivs commented May 14, 2024

walldiss commented May 14, 2024

liamsi commented May 14, 2024 •

edited

Wondertan commented May 14, 2024

Wondertan commented May 14, 2024 •

edited

walldiss commented May 17, 2024

Wondertan commented May 17, 2024 •

edited

feat(blob): BlobModule v1 #3390

feat(blob): BlobModule v1 #3390

Comments

vgonkivs commented May 10, 2024 • edited

distractedm1nd commented May 10, 2024

vgonkivs commented May 10, 2024

Wondertan commented May 10, 2024

vgonkivs commented May 14, 2024

walldiss commented May 14, 2024

liamsi commented May 14, 2024 • edited

Wondertan commented May 14, 2024

Wondertan commented May 14, 2024 • edited

walldiss commented May 17, 2024

Wondertan commented May 17, 2024 • edited

vgonkivs commented May 10, 2024 •

edited

liamsi commented May 14, 2024 •

edited

Wondertan commented May 14, 2024 •

edited

Wondertan commented May 17, 2024 •

edited