uuid v7 #681

pmccarren · 2023-01-22T00:49:50Z

UUID V7

This PR implements UUID V7, ~~closing~~ ref #580

V7 is implemented, tested and documented and ready for review! No changes to the other versions were made.

Relevant RFC documents:

Benchmarks

V7 appears as fast as V4 when not using native crypto.randomUUID generation.

Results:

Starting. Tests take ~1 minute to run ...
uuid.stringify() x 1,450,855 ops/sec ±2.12% (88 runs sampled)
uuid.parse() x 2,350,677 ops/sec ±1.19% (88 runs sampled)
---

uuid.v1() x 3,585,680 ops/sec ±0.86% (92 runs sampled)
uuid.v1() fill existing array x 9,229,031 ops/sec ±0.50% (93 runs sampled)
uuid.v4() x 13,834,376 ops/sec ±3.34% (84 runs sampled)
uuid.v4() fill existing array x 3,833,619 ops/sec ±1.51% (84 runs sampled)
uuid.v4() without native generation x 2,395,572 ops/sec ±1.27% (91 runs sampled)
uuid.v3() x 268,797 ops/sec ±1.32% (86 runs sampled)
uuid.v5() x 287,899 ops/sec ±1.12% (92 runs sampled)
uuid.v7() x 2,764,344 ops/sec ±0.95% (93 runs sampled)
uuid.v7() fill existing array x 2,801,443 ops/sec ±5.48% (84 runs sampled)
uuid.v7() with defined time x 3,223,107 ops/sec ±1.14% (89 runs sampled)
Fastest is uuid.v4()

TODO:

preserve monotonicity
Add V7 type definitions to DefinitelyTyped

note, husky pre-commit hook was bypassed for this commit

broofa

Thanks for the contribution @pmccarren ccarren. This looks promising!

In addition to the inline comments, can you add a unit test alongside https://github.com/uuidjs/uuid/blob/main/test/unit/v1.test.js, with similar assertions. Specifically, I'd like to see similar tests for sort order, uniqueness, and options handling. (And anything else you think may be relevant.)

The sorting will be of particular interest, since producing UUIDS that sort the same whether sorted by timestamp or lexicographically was pretty much the whole point of version 7. 😄

Note: This should not close #580, as the new RFC adds v6 and v8 formats, too, as well as the MAX_UUID constant. v7 is definitely the majority of what's needed, but we'll want to keep 580 open until we fill in the remaining items.

Lastly, the new RFC is still making it's way through the review process. I don't expect substantive changes at this point, so this is an appropriate time to be adding this. But as I noted in 580, I think it's prudent to go with an experimental version release tag of some sort.

@ctavan: Thoughts on how best to accept this? Should we create a v9-experimental branch that we merge this into rather than master? Then publish experimental releases from that?

README.md

src/regex.js

src/uuid-bin.js

src/v7.js

pmccarren · 2023-01-22T23:02:26Z

@broofa - glad to lend a hand! I appreciate your time reviewing the implementation.

This PR has been updated with your suggested changes, most notably:

reworked buffer management so rnds is not mutated
added v7 unit test coverage
fixed README generation (via npm run docs)

In regards to sorting, I've been considering a few different approaches and while not presently implemented in this PR, I absolutely agree it should be :) I'll work on implementing this as well the associated unit tests.

src/regex.js

src/v7.js

LinusU

Code looks good 👍

Lastly, the new RFC is still making it's way through the review process.

Thoughts on how best to accept this? Should we create a v9-experimental branch that we merge this into rather than master? Then publish experimental releases from that?

Personally think that we should keep this PR open until the spec has been ratified. If we merge into a v9 branch that would stop us from making a v9 release until that has happened, and that might potentially be a long time?

I think that it would be nice to backport the "version 2 is valid" bugfix right away to the current release though?

ctavan · 2023-01-25T07:44:06Z

Thanks an lot for the contribution!

I initially thought this new feature could land in the v9 release, but given we’re broadening the validation regex it’s a breaking change and we’ll have to bump the major version.

I think it would be nice to somehow release it as a prerelease so that people have a chance to try it and provide feedback, but I think our current release and branching process is not yet ready for multiple active major releases. If anyone’s willing to take a look at this I’m happy to support (we were considering better automation for this for a while… #636).

I agree however, that we should not release this in a public major release until the draft has been accepted.

src/regex.js

src/v7.js

broofa · 2023-01-25T23:52:09Z

Note: I've just created the rfc4122bis branch off main as a place to land this (and set it as the target for this PR). As @ctavan says, we haven't worked through the deploy & publish pipeline for non-main branches yet, so we'll have to figure that out.

Wish I could give you an ETA for that, but it's not been a priority for either of us. As he says... "Help Wanted".

Co-authored-by: Linus Unnebäck <linus@folkdatorn.se>

broofa

@ctavan @LinusU Any other comments? If not, I'd like to get this merged (pending @pmccarren commiting his suggested change for the regex to allow 1-8).

Regarding the plan for the rfc4122bis branch, I've created a project to capture the remaining work, with brief notes on possible implementation details.

Honestly, there's not all that much left... if anyone is feeling inspired. ;-)

Lastly, thoughts on inviting Patrick to join the UUIDJS org? He's done some solid work here that's much appreciated, and seappears to have good street cred. If people could weigh in on that idea (including you, Patrick), we can move forward or not as appropriate. [cc'ing @TrySound here]

pmccarren · 2023-02-04T18:55:59Z

@broofa just merged the 1-8 regex update. I'm close to done with implementing monotonic generation - I'd prefer to merge once that's wrapped. Will be buttoned up in a day or two.

As for UUIDJS Org, I'll let my work speak for itself but will add that if you'll have me I'll be an active participant :)

pmccarren · 2023-02-06T05:41:34Z

@broofa I just finished adding monotonic support, and it's ready for review! Took a bit of finesse but I'm rather happy with how it turned out. In addition to the monotonicity related unit tests, I manually tested sorting preservation during generation of 100M uuids.

couple of sequence counter implementation notes:

when additional generations occur inside the same millisecond, we use a dedicated 31 bits as an incrementing sequence counter. Referred to as Fixed-Length Dedicated Counter Bits (Method 1) in the draft RFC.
seq is 31 bits and (re)initialized from the random data pool whenever the clock advances/changes. as the seq is initialized randomly, for nominal usage there is a significant amount (74 bits) of randomness in a given v7 uuid.
when the sequence counter rolls over we increment the internal date by 1 millisecond and continue. this accepts the tradeoff of minor clock drift for lexicographical sorting in substantial batch id generation workloads.
if the internal date ever exceeds 10 seconds beyond system time, both the date and seq are hard reset.
lexicographical sorting is preserved up to (2^31)*10000 generations for a provided millisecond.
the seq is stored as 31 bits. the 12 high bits and 19 low bits are stored separately in the uuid to preserve sorting while maintaining a large enough counter size

Pending review, I believe this PR is ready to go now!

pmccarren · 2023-02-18T17:11:17Z

CI failures tracked in #688 (actions/setup-node npm 9)

robinpokorny · 2023-04-26T09:40:03Z

Hey, @pmccarren, I had a quick look and have some questions. If you have time, could you look at them?

Should there be more information that the code includes monotonic guarantee beyond the timestamp? While I think it's a good default, it seems fair to highlight it.
Should there be an option to opt-out of that? That is, make the seq random for each call even in the same millisecond. (In this case, I was thinking about updating a v4 UUID with timestamp and version, while potentially reusing native v4 generation.)
Does the fact that we always increase the seq by one, even for the first call within a millisecond, have an impact on the randomness?
Why did we choose 10 second? That seems like a lot.
What about mixed use of user providing msecs or not (as this library is used a lot, it can happen quite easily)? It seems it would break the monotonic guarantee. Is that a concern?

wiperawa · 2023-05-18T13:57:32Z

Just wondering when approx. this PR will be merged?

broofa · 2023-05-18T16:55:25Z

@wiperawa No specific ETA. I just need to find the time to review and work through deploy process for an experimental branch.

FWIW, we (CodePen) actually have a need for this as well, so I do have a bit of incentive to make this happen. I just need to find the time to sit down and do a proper review.

(And continued apologies to @pmccarren for dragging our feet on this. This is still important work that we intend to merge.)

rdrpenguin04 · 2023-06-11T18:40:24Z

One of my teams is also waiting on UUID v7 for their project; we'd also greatly appreciate if this were reviewed, especially since it already has one approval.

ghost · 2023-07-31T23:45:10Z

Is this PR ever going to get merged? it's disappointing to see that the author has done their great part since Feb, but this PR is still hanging.

claytongulick · 2023-08-09T17:18:41Z

Eagerly awaiting this to be released as well.

Thanks for the great library and work on it!

DevBrent · 2023-10-02T15:59:37Z

I'll play ChatGPT here and summarize the current status:

UUID org has concerns of the sequential "+ 1" nature impacting randomness of UUIDs generated within the same millisecond. My take on this is that this is a legitimate concern because you can theoretically spam an endpoint to get "seed" UUIDs then offset by +1, +2 to dig up other generations on the same millisecond. Despite it being a legitimate concern, many other libraries implement similar logic including https://github.com/mongodb/js-bson in my experience. uuid v7 #681 (comment) suggests a possible opt-out to enable truly random UUIDs within a given millisecond. I would personally make use of that option despite the lack of guarantee about sequential sort within the same millisecond.
@pmccarren has implemented logic to push a UUID into the next millisecond up to an arbitrary 10 seconds before resetting the date (and I imagine the random seed) in order to guarantee the ability to generate (2^31)*10000 of sequential UUIDs in a given millisecond. This was one of the more significant things holding back this merge I gather because it seems there should be a better way to handle this.
@robinpokorny mentioned a potential failure case when date inputs are provided from databases or for instance date pickers without milliseconds included which would be fairly common for such a widely used library like uuid. I'm not sure what a good solution for this would be.

My take would be UUIDs become less useful if you're going to alter the date input and don't strictly stick to modifying the sequence when sequencing. The primary driving factor of this is that a randomly initiated sequence could initialize 1 count below the max sequence and leaving padding atop the sequence reduces the entropy of the sequence.

This Hacker News thread has more discussion on the ULID spec which UUIDv7 was largely based on: https://news.ycombinator.com/item?id=36447837

The spec as written on that page is confusing on that point, but the incrementing-counter-within-the-same-millisecond-behavior only happens if you explicitly specify a "monotonicFactory", https://github.com/ulid/javascript#monotonic-ulids. The default behavior (just using the ulid() function) doesn't do that, it generates a completely random value regardless of the millisecond value.

If it's random value, then generated UUIDs won't always be ascending which defeats purpose as well.

No, that's not really it either. ULIDs have both a time component (in this case accurate to the millisecond) and a random component. Thus the order is always accurate up to the ms, and for the vast majority of applications anything created in the same millisecond can be considered created "at the same time", so it's usually OK if order is undetermined within the same MS.
Note that the UUID v7 spec is largely modeled after the ULID spec. ULIDs came first, and they traded the "standard" UUID format of 8x-4x-4x-4x-12x hex string for the more compact Crockford base32 format, and there are some other minor differences in number of timestamp bits vs. number of random bits, but they are otherwise functionally equivalent.

This discussion seems to imply other implementations ALWAYS re-seed the random bits and increment the counter.

I'm not really sure if I would prioritize sequential over randomness and perhaps this does need to be a preference. Generally, those seeking UUIDv7 for randomness security through obscurity/incalculability should be using another UUID.

Edit 1: Multi-node behavior should be considered as well as single-node sequence generator node behavior because I believe UUIDv7 in this library could be used in both forms. Each has slightly different preferences. For multi-node, all of this discussion goes out the window and it's best to not modify the msecs at all. For single-node, I think someone who prioritizes sort order over all else may want this unusual behavior.

Edit 2: Within each node of a multi-node cluster, I may wish to prefer sequential sort and it might be acceptable to eat into the MS up to 1-3 milliseconds for my usecases but I still feel uncomfortable. I'd rather OPT IN to sacrifice 10% of my sequence entropy than have this behavior though.

Edit 3: Percona notes UUID_SHORT from MySQL allows the sequence to rollover.

ihmpavel · 2023-10-27T11:51:20Z

README.md

-**Upgrading from `uuid@3`?** Your code is probably okay, but check out [Upgrading From `uuid@3`](#upgrading-from-uuid3) for details.
+> **Note** Upgrading from `uuid@3`? Your code is probably okay, but check out [Upgrading From `uuid@3`](#upgrading-from-uuid3) for details.
+
+> **Note** Only interested in creating a version 4 UUID? You might be able to use [`cypto.randomUUID()`](https://developer.mozilla.org/en-US/docs/Web/API/Crypto/randomUUID), eliminating the need to install this library.


Looks like a small typo here

Suggested change

> **Note** Only interested in creating a version 4 UUID? You might be able to use [`cypto.randomUUID()`](https://developer.mozilla.org/en-US/docs/Web/API/Crypto/randomUUID), eliminating the need to install this library.

> **Note** Only interested in creating a version 4 UUID? You might be able to use [`crypto.randomUUID()`](https://developer.mozilla.org/en-US/docs/Web/API/Crypto/randomUUID), eliminating the need to install this library.

ihmpavel · 2023-10-27T11:52:07Z

README_js.md

-**Upgrading from `uuid@3`?** Your code is probably okay, but check out [Upgrading From `uuid@3`](#upgrading-from-uuid3) for details.
+> **Note** Upgrading from `uuid@3`? Your code is probably okay, but check out [Upgrading From `uuid@3`](#upgrading-from-uuid3) for details.
+
+> **Note** Only interested in creating a version 4 UUID? You might be able to use [`cypto.randomUUID()`](https://developer.mozilla.org/en-US/docs/Web/API/Crypto/randomUUID), eliminating the need to install this library.


Another small typo here

Suggested change

> **Note** Only interested in creating a version 4 UUID? You might be able to use [`cypto.randomUUID()`](https://developer.mozilla.org/en-US/docs/Web/API/Crypto/randomUUID), eliminating the need to install this library.

> **Note** Only interested in creating a version 4 UUID? You might be able to use [`crypto.randomUUID()`](https://developer.mozilla.org/en-US/docs/Web/API/Crypto/randomUUID), eliminating the need to install this library.

mbrimmer83 · 2024-05-10T09:02:08Z

Rumor has it uuid v7 is now part of the proposed standard What would it take to get this over the finish line?

pmccarren added 4 commits January 21, 2023 19:36

feat: implement uuid7 (uuidjs#580)

0b8f680

fix: add v7.js to .local (uuidjs#580)

a112083

note, husky pre-commit hook was bypassed for this commit

fix: add v7 to uuid-bin

5a89a92

chore: fix readme anchor

92bc63a

broofa reviewed Jan 22, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

README.md Show resolved Hide resolved

src/regex.js Outdated Show resolved Hide resolved

src/uuid-bin.js Outdated Show resolved Hide resolved

src/v7.js Outdated Show resolved Hide resolved

pmccarren added 5 commits January 22, 2023 12:48

chore: use generated readme, remove timestamp arg from uuid-bin v7

ba6d9cc

fix: typo in uuid regex, add negative test cases

3a201c3

fix: do not mutate provided rnds, add v7 unit tests

b710c41

fix: validation test should not pass version 0

1a61942

chore: update package.json description

bf11212

LinusU reviewed Jan 25, 2023

View reviewed changes

src/regex.js Outdated Show resolved Hide resolved

LinusU reviewed Jan 25, 2023

View reviewed changes

src/v7.js Outdated Show resolved Hide resolved

LinusU reviewed Jan 25, 2023

View reviewed changes

broofa requested changes Jan 25, 2023

View reviewed changes

src/regex.js Outdated Show resolved Hide resolved

src/v7.js Outdated Show resolved Hide resolved

broofa changed the base branch from main to rfc4122bis January 25, 2023 23:46

Update src/v7.js

a17479a

Co-authored-by: Linus Unnebäck <linus@folkdatorn.se>

broofa mentioned this pull request Feb 4, 2023

Validate new versions (6-8) #683

Open

broofa added the bis Issues related to RFC4122bis specification label Feb 4, 2023

broofa approved these changes Feb 4, 2023

View reviewed changes

include uuid v6 and v8 in validation regex

ced5fdc

pmccarren added 4 commits February 4, 2023 13:25

chore: add test:matching script to package.json

7264c2c

fix: v7 monotonicity and lexicographical sorting

2263096

refactor: v7 seq reinitialization

4022ec2

chore: update v7 README

1d5c88a

pmccarren requested a review from broofa February 9, 2023 23:19

jonkoops and others added 4 commits February 12, 2023 07:39

docs: add note about cypto.randomUUID() (uuidjs#686)

aef75d7

Merge branch 'uuidjs:main' into uuid7

2626fe2

chore: fix README_js.md prettier

3cb9acd

chore: render README.md

7f1ceaf

broofa mentioned this pull request Sep 30, 2023

Sponsored issue: Incremental UUID #737

Closed

DevBrent mentioned this pull request Oct 2, 2023

Support new IETF UUID formats #580

Open

ihmpavel reviewed Oct 27, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

uuid v7 #681

uuid v7 #681

pmccarren commented Jan 22, 2023 •

edited

broofa left a comment •

edited

pmccarren commented Jan 22, 2023

LinusU left a comment

ctavan commented Jan 25, 2023

broofa commented Jan 25, 2023 •

edited

broofa left a comment •

edited

pmccarren commented Feb 4, 2023

pmccarren commented Feb 6, 2023

pmccarren commented Feb 18, 2023

robinpokorny commented Apr 26, 2023

wiperawa commented May 18, 2023

broofa commented May 18, 2023

rdrpenguin04 commented Jun 11, 2023

ghost commented Jul 31, 2023

claytongulick commented Aug 9, 2023

DevBrent commented Oct 2, 2023 •

edited

ihmpavel Oct 27, 2023

ihmpavel Oct 27, 2023

mbrimmer83 commented May 10, 2024

	> Note Only interested in creating a version 4 UUID? You might be able to use [`cypto.randomUUID()`](https://developer.mozilla.org/en-US/docs/Web/API/Crypto/randomUUID), eliminating the need to install this library.
	> Note Only interested in creating a version 4 UUID? You might be able to use [`crypto.randomUUID()`](https://developer.mozilla.org/en-US/docs/Web/API/Crypto/randomUUID), eliminating the need to install this library.

uuid v7 #681

Are you sure you want to change the base?

uuid v7 #681

Conversation

pmccarren commented Jan 22, 2023 • edited

UUID V7

Benchmarks

broofa left a comment • edited

Choose a reason for hiding this comment

pmccarren commented Jan 22, 2023

LinusU left a comment

Choose a reason for hiding this comment

ctavan commented Jan 25, 2023

broofa commented Jan 25, 2023 • edited

broofa left a comment • edited

Choose a reason for hiding this comment

pmccarren commented Feb 4, 2023

pmccarren commented Feb 6, 2023

pmccarren commented Feb 18, 2023

robinpokorny commented Apr 26, 2023

wiperawa commented May 18, 2023

broofa commented May 18, 2023

rdrpenguin04 commented Jun 11, 2023

ghost commented Jul 31, 2023

claytongulick commented Aug 9, 2023

DevBrent commented Oct 2, 2023 • edited

ihmpavel Oct 27, 2023

Choose a reason for hiding this comment

ihmpavel Oct 27, 2023

Choose a reason for hiding this comment

mbrimmer83 commented May 10, 2024

pmccarren commented Jan 22, 2023 •

edited

broofa left a comment •

edited

broofa commented Jan 25, 2023 •

edited

broofa left a comment •

edited

DevBrent commented Oct 2, 2023 •

edited