Skip to content
This repository has been archived by the owner on Sep 2, 2023. It is now read-only.

Transition Path Problems For Tooling #388

Closed
bmeck opened this issue Sep 26, 2019 · 45 comments
Closed

Transition Path Problems For Tooling #388

bmeck opened this issue Sep 26, 2019 · 45 comments

Comments

@bmeck
Copy link
Member

bmeck commented Sep 26, 2019

Just tracking tooling encountering issues with what we have been rolling out, please add more as they are seen for gathering overall impact:

issue resolved summary
eslint/eslint#12319 Always used require() and .js now errors for type: "module"
@guybedford
Copy link
Contributor

.js-based script config files is a really important class of bug - it could be worth having a dedicated issue for this one.

@guybedford
Copy link
Contributor

(this will affect most tools!)

@devsnek
Copy link
Member

devsnek commented Sep 26, 2019

we could also just get rid of type: module, assuming we want people to depend less on package.json magic, not more

@evanplaice
Copy link

BTW, this works flawlessly in 12.0. The newer (ie better) package boundary detection is throwing the error.

Semantics feel off. The consumer (ie CJS) is consuming a module (ie ESM) that exists outside of it's package boundary. Shouldn't the package type constraint be based on the consumer not the consumed?

Anyway, w/ a 1-line patch to eslint giving the config a .cjs extension works flawlessly.


we could also just get rid of type: module, assuming we want people to depend less on package.json magic, not more

This breaks browser interop for pure ESM packages.

Using .mjs for ESM breaks the most tools build to write ES6+. Outside the Node ecosystem .js and ESM are one-and-the-same.

@ljharb
Copy link
Member

ljharb commented Sep 27, 2019

That’s not entirely accurate; for example, in the babel ecosystem, everything is either a script or a module based on config, not file extension. Since nothing but node yet parses JS based on file extension, what .js means everywhere else is “i have no idea which parse goal to use yet”.

@devsnek
Copy link
Member

devsnek commented Sep 27, 2019

I definitely didn't intend to start any arguments about extensions, I was just hoping to get in some ideas about various directions to go with ecosystem feedback. best to think of breaking changes now before we unflag.

@GeoffreyBooth
Copy link
Member

This isn’t really a breaking change for the tool, because the user has opted into the new mode by setting type: module. The tool just doesn’t support that mode yet. Until the tool is updated, the user can simply take out type: module (or stop using that tool).

And while the user waits for the tool to be updated, they can keep using CommonJS or ESM as they do now, via esm or Babel or whatever. I’m not convinced that nodejs/node#29732 is really necessary.

There are going to be lots of tools that won’t fully support ESM mode in Node for quite awhile, beyond just require of .js inside type: module. Until we get loaders completed, code instrumentation or mocks won’t work, for example. There’s going to be a lengthy period where users will have to choose between ESM (whether via type: module or ESM more broadly) and various tools that they may want to use.

@weswigham
Copy link
Contributor

I’m not convinced that nodejs/node#29732 is really necessary.

I agree - the resolver under a type: module modifier is explicitly experimental right now, so this becoming an error isn't a "new break", it's just how it should be in the experimental feature - and any pain points using type: module with tools brings up are likely other shortcomings in our current interop story.

@weswigham
Copy link
Contributor

Is nodejs/node#29732 getting fast tracked because of a resolution at the meeting that I missed? Or do some of the core devs just feel that strongly about it?

@guybedford
Copy link
Contributor

guybedford commented Sep 27, 2019

@weswigham all discussions and decisions are as made publicly there, there have been no side agreements. If you disagree with fast tracking or merging it behind the flag we can certainly reconsider, my primary goal was just to avoid shipping frictions to the ecosystem before we are 100% certain about it, to avoid unnecessary concern / issues for users. I see it as more of a courtesy than anything else.

@guybedford
Copy link
Contributor

(this issue was not discussed at the last meeting, because we were not yet aware of the compatibility case)

@weswigham
Copy link
Contributor

weswigham commented Sep 27, 2019

I just wanted to know if it'd been discussed in the WG at all (and wasn't sure since I'd missed Wednesday's meeting), since it's a slight walk-back of what'd already been merged (and i thought part of the reason for merging it was to see what kinds of issues cropped up).

I definitely asked if it should have been behind a flag when it was first added, and I still do agree enough with it having been unflagged, provided one believes that using type: module in your package.json right now implicitly opts you in to experimental behavior (regardless of node flags). If that's not perceived to be the case, I imagine we're going to have some degree of trouble shipping type: module at all, I'd wager.

@guybedford
Copy link
Contributor

@weswigham as I say it is as a courtesy - we want to encourage experimentation of type module, and we don't want to cause unnecessary frictions in doing that, unless we are 100% it is those frictions we want to ship. So it's not so much about whether it is breaking as whether this is the story we want users to be dealing with. Personally I think it is still debatable whether this should be an error or a warning or similar.

@weswigham
Copy link
Contributor

weswigham commented Sep 28, 2019

I was under the impression that this was expected - when you have type: module set, every .js file in scope is a module, and that includes config files. I, at least, was under no other illusions. Tools don't currently load module-based config files, so, y'know, those tools aren't going to work yet. There are no APIs for them to work yet, really. There are certainly some things we could do to make the story better (like require of esm just working, rather than erroring), or failing that TLA and dynamic import alongside package.json probing should allow those tools to support module-based configuration (with async APIs T.T).

@guybedford
Copy link
Contributor

The major constraint for tools is that they often have synchronous config loading internals (I know at least Babel does /cc @loganfsmyth), so that using import() and falling back to require is not such an easy option or non-breaking change for the tool itself.

But yes if we can accept these breaks for tools and we are all 100% sure this is what we want to dictate that tools should have to deal with then that is fine too.

As I say though I don't see why we couldn't still support these cases under a deprecation path as the major part of the hazard that concerned me previously was that the CJS loader was injecting these into the ESM loader such that depending on if you loaded CJS or ESM first you would get CJS or ESM interpretations respectively through the ESM loader itself. If we can at least avoid that hazard, what hazard is left?

@weswigham
Copy link
Contributor

weswigham commented Sep 28, 2019

If we can at least avoid that hazard, what hazard is left?

The hazard is that require("path/to/esm-module") shouldn't ever be parsed as cjs. And require("module-with-type-module/eslint.config.js") is, by all rights, a module.

@weswigham
Copy link
Contributor

weswigham commented Sep 28, 2019

The usability issue is the lack of an API (eg, require itself, for simplicity), to synchronously load a module.

I really hope I can hammer that home to enough people that that's, like, one of the biggest non-bikeshed concerns of node's (current) esm implementation.

@guybedford
Copy link
Contributor

And require("module-with-type-module/eslint.config.js") is, by all rights, a module.

what I'm asking is where is the actual hazard in this case? How does it bite?

@weswigham
Copy link
Contributor

weswigham commented Sep 28, 2019

Literally, this - the same file could be interpreted twice, in two different ways (one of which may or may not error due to syntax), which is exactly what we're trying to avoid, as if the file does anything stateful (eg, set a global, muck with process), that state change will occur twice.

@guybedford
Copy link
Contributor

guybedford commented Sep 28, 2019

So to try flesh out the example - say you have a "type": "module" package on npm (P) and install it such that it has two dependents - one CommonJS and one ES module package that both import from it. The CommonJS package loads a file with no module syntax from P that does a global mutation, and the ES module package loads that same file from P resulting in the global mutation happening twice?

Note this "hazard" is naturally guarded by the following:

  • Any use of require or ES module exports will interrupt this hazard from occurring as one of the dependents will get an error
  • The mutations can only be global mutations since side effects that rely on imports would also throw
  • The bugs are only caused if there are side-effect calls eg to process or console or mutations of globals. But without these there would be no conflict too.

@devsnek
Copy link
Member

devsnek commented Sep 28, 2019

we shouldn't ever have a case where a file will run more than once.

@Jamesernator
Copy link

My feeling is the best solution is still just to allow synchronous require of ESM that strictly does not include a top-level await. The TLA proposal specifically chose to make sure that module graphs that only contain synchronous modules evaluate synchronously.

This means it would be possible to have something like sourceTextModule.evaluateSync() that only works if all modules to be evaluated don't contain [[Async]]: true and hence a sync-require could work fine for modules that don't depend on TLA (or WASM modules) avoiding significant breakage.

@devsnek
Copy link
Member

devsnek commented Sep 28, 2019

I would not like to bifurcate TLA like that, if we support it we support it.

@Jamesernator
Copy link

I would not like to bifurcate TLA like that, if we support it we support it.

The web is already intending to reject TLA in service workers, given that require of ESM would primarily be provided as an escape hatch for existing code depending on require I don't think it's unreasonable to say we provide require of certain ES graphs (e.g. those without TLA) so that existing tools can be easily updated.

@devsnek
Copy link
Member

devsnek commented Sep 29, 2019

I disagree with the web disabling TLA in service workers. if TLA randomly doesn't work in certain contexts why would anyone feel motivated to write code with TLA in it.

@weswigham
Copy link
Contributor

I disagree with the web disabling TLA in service workers. if TLA randomly doesn't work in certain contexts why would anyone feel motivated to write code with TLA in it.

This is super snarky, but bear with me:

I disagree with node disabling esm in require. if eem randomly doesn't work in certain contexts why would anyone feel motivated to write code with esm in it.

Because the convenience of use outweighs the inconvenience of incompatibility, ofc. That's the hope, at least. We've been making similar choices the whole time. I'm not saying I wouldn't prefer uniform TLA support (and Jordan's right, having both TLA and non-TLA modules is pretty much a tooling nightmare - different supported syntax subsets in the same runtime is not going to be fun for explaining to people in an error), but I do get the trade-off. Sort-of. It sounds like the argument is just "TLA in service workers is a footgun since they're often performance sensitive and it's easy to deadlock or wait on unneeded async resources using TLA" - which is all true in other contexts, too. It's again just a matter of weighing that possibly against the convenience of its use, and deciding differently in the SW context. If anything, that w3c feels free to make ecosystem bifrucating choices like this, should be freeing to us, as it certainly means runtimes should be considered as free to determine their own destiny when it comes to module support and use, since there are even compat concerns within the browser.

@devsnek
Copy link
Member

devsnek commented Sep 29, 2019

I disagree with node disabling esm in require.

require could never load esm. nothing was disabled. nothing was lost. esm is still working. I think i've said a lot in the past i am a fan of require(esm), and i still am, but randomly disabling language features is in the "unacceptable" category for me. Why not just disable the instantiation checks that verify that imports are wired correctly and map cjs modules at runtime? Why not just make our own module system that mirrors babel esm instead of native esm? If we randomly change the language for our own purposes we end up bifurcating the ecosystem. I am hopeful TC39 will choose to disallow this when its brought up in a few days.

@weswigham
Copy link
Contributor

require could never load esm. nothing was disabled. nothing was lost

And service workers have never been able to be modules yet, so nothing was lost with no TLA in them.

randomly disabling language features is in the "unacceptable" category for me

I think the line between a language feature and a runtime feature is slim - especially for a module system feature like TLA. In any case, the distinction is lost on most less in-the-weeds users and I do think you're right; inconsistent is inconsistent, but I do think I can also call a spade a spade and say we've made similar choices thus far, to weight technical concerns and imagined future problems over real ecosystem and compatibility issues, and I understand theirs in that context, thus I don't feel quite right criticizing it.

I am hopeful TC39 will choose to disallow this when its brought up in a few days.

I don't think they can. It's not possible. An implementor could just say "we support es2020 modules with TLA and other features in these contexts, and es2016 modules that don't have TLA in these others". If you simply rephrase the bifrucating as essentially two runtimes based on differing spec versions (and SW really are essentially their own runtime), the language spec doesn't get a word in edgewise about it. I don't think tc39 will be able to do a lick about it, other than some committee members saying "I don't like it". The committee could always say "but if you don't support it in all contexts then you're not a real es2020 runtime" (philosophically), but safari is the only es2016, runtime by that metric, as they're the only runtime that implemented tail calls, but you don't typically see people up in arms about that, do you?

@devsnek
Copy link
Member

devsnek commented Sep 29, 2019

And service workers have never been able to be modules yet, so nothing was lost with no TLA in them.

replacing words in a sentence doesn't inherently preserve the logical meaning of the sentence. your argument leans on the fallacy that esm doesn't work without require being able to load it, which is quite obviously false.

An implementor could just say "we support es2020 modules with TLA and other features in these contexts, and es2016 modules that don't have TLA in these others".

implementations do this with TCO, and therefore the ecosystem as a whole doesn't use TCO. It would really be a shame for the same to happen to TLA.
Alternatively, people just bundle all their TLA, bypassing the restriction but keeping the behaviour, and there was no point to disallowing it in the first place.

If node follows web workers, we just make the problem worse because of how large a platform node is. Given that the ecosystem already gets along without TLA, why would anyone choose to use it if part of the web and node don't support it? Or, if they do use it, they can just bundle it and bypass the restriction, at the cost of a worse API surface. There is no winning if we disable it.

@devsnek
Copy link
Member

devsnek commented Sep 29, 2019

This also has the classic issue we have worked hard to avoid where the importer shouldn't impose constraints on the imported.

@weswigham
Copy link
Contributor

This also has the classic issue we have worked hard to avoid where the importer shouldn't impose constraints on the imported.

I suspect we and browsers have different views on this - we want the imported to be opaque, ideally, so backing implementations can be swapped out as easily as possible. Meanwhile, the browser security module is essentially "trust nothing", so they're going down the route of the importer determining how something is meant to be used and applying constraints on what can be done by the imported.

@devsnek
Copy link
Member

devsnek commented Sep 29, 2019

TLA isn't a security issue though, they want to disable it because of performance concerns. If that is the case, perhaps we just shouldn't support TLA at all, since node is used for performance sensitive code.

@ljharb
Copy link
Member

ljharb commented Sep 29, 2019

Which is yet another reason that it just doesn’t make sense to too closely tie browsers to node.

@jkrems
Copy link
Contributor

jkrems commented Sep 29, 2019

importer determining how something is meant to be used and applying constraints on what can be done by the imported.

I don't think that characterization of the browser approach is complete. Generally the browser has chosen to go with the importee determining how it's going to be interpreted (image mime types for <img>, module vs. JSON vs. HTML for import). It's just that for import the browser so far only supported one type of content. I assume that if (!) JSON modules get revised and/or non-Javascript modules get marked in the import statements, node would adopt the same system.

@Jamesernator
Copy link

If we randomly change the language for our own purposes we end up bifurcating the ecosystem.

require is not part of the language Node can do what it likes, this is already a bifurcation that is not inherently compatible with other environments due to node-specific loading requirements.

Also allowing multiple jobs to occur synchronously within another single job seems like a much larger bifurcation given that other hosts simply can't implement that so saying "if we support it we support it." along side the above statement then I see no reconciliation that would allow for require of ESM as by definition any solution will necessarily bifurcate the ecosystem.

@ljharb
Copy link
Member

ljharb commented Sep 30, 2019

@Jamesernator it’s quite compatible, that’s how bundlers work. ESM is syntax, however, and can’t be forged, or otherwise intercepted the way require can.

@Jamesernator
Copy link

@Jamesernator it’s quite compatible, that’s how bundlers work.

Bundlers work with best effort, I have never seen a bundler that works in 100% of cases, dynamic require can easily lead to code that cannot be bundled. Sync XHR can allow dynamic require but it's deprecated for good reason.

However allowing sync require of TLA has no such solution (like sync XHR) as allowing other jobs to run while blocking on require is a direct violation of how jobs are specified.

Ultimately one of these trade-offs has to be made if TLA is supported (and if browsers ship TLA then one of the these trade-offs is guaranteed):

  • Create more incompatibility with web and make bundlers more complex (e.g. sync require of TLA)
  • Break existing tooling (e.g. don't allow sync require of ESM)
  • Or provide limited compatibility (e.g. sync require of just modules with TLA)

My preference is strongly the last one as it allows tools to work, provides a clean upgrade path to ESM when mixing formats and doesn't disallow TLA entirely and its only downside is that sometimes import() is necessary from CJS files.

@devsnek
Copy link
Member

devsnek commented Sep 30, 2019

would you publish code containing TLA knowing that require(esm) can't load it?

@Jamesernator
Copy link

Jamesernator commented Sep 30, 2019

would you publish code containing TLA knowing that require(esm) can't load it?

Yes, for old consumers I won't (e.g. eslint config files) but in general yes.

This is true for any async code, if something takes a sync function I can't just pass it one that returns a Promise and expect their code to magically just work. They would have to update their code to be async to await/.then the promise.

There's a lot of npm code that passes a value and expects a response synchronously. I can't just pass an async function to all of that and expect it to just work. Either I need to make my code synchronous or them make their code asynchronous, they don't just get to magically await.

This is a large point of contention for async/await in general, basically transforming to async generally implies some friction on one side. However I don't think this has significantly hurt the adoption of Promises despite many libraries (and builtins on the web especially) not even offering synchronous variants, hence forcing their consumers to upgrade to Promises.

@devsnek
Copy link
Member

devsnek commented Sep 30, 2019

So you've just walked into this point I keep making:

implementations do this with TCO, and therefore the ecosystem as a whole doesn't use TCO. It would really be a shame for the same to happen to TLA. Alternatively, people just bundle all their TLA, bypassing the restriction but keeping the behaviour, and there was no point to disallowing it in the first place.

This seems clearly harmful to people trying to adopt the feature, so why would we do this?

@littledan
Copy link

I don't think the analogy with TCO and TLA is appropriate. We don't really have consensus in TC39 about what to do with TCO. However, we recently achieved consensus on Stage 3 for TLA. In my personal opinion, TLA should be permitted wherever possible, and it would be unfortunate to grow the list of environments which have these kinds of restrictions. ServiceWorker is already an environment where people find its restrictions annoying compatibility problems (e.g., the lack of localStorage).

@devsnek
Copy link
Member

devsnek commented Sep 30, 2019

@littledan the comparison is a feature not available in all implementations, not about the consensus process. I see using the [[Async]] field to block TLA in the same vein as not implementing IsInTailPosition correctly.

@littledan
Copy link

@devsnek We heard strong implementation concerns about implicit tail calls, whereas implementers agreed for TLA to reach Stage 3. Checking [[Async]] is guided by a separate standards proposal, rather than a disagreement about what the standard is. They are simply different situations.

@devsnek
Copy link
Member

devsnek commented Sep 30, 2019

@littledan I'm trying to say in both cases, feature X is not available in certain places because of a conscious decision for it to not be there.

@MylesBorins
Copy link
Member

I'd like to suggest that we take a step back and get this thread on track. If folks want to discuss require('esm') perhaps you could move convo to nodejs/node#49450 or open a new issues. While this specific feature could solve this issue we do not have consensus on how it could be approached and I have still not seen a way for it to be implemented in a non semver-major way which precludes it from landing in 12.x if we are able to unflag in that version.

I do think it is a worthwhile goal and something that we should be exploring for future versions of Node.js as an iterative enhancement

I think the discussion regarding W3C decisions around Service Workers and Top-Level await is out of scope for this discussion. Perhaps it will play a role in the discussion of require('esm') if so it should take place in discussions specific to that proposal.

It seems to me like perhaps we should close this thread and start a new one... or at least hide comments that are out of scope because it is honestly hard to figure out what types of solutions we are hoping to reach based on the discussion here.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
No open projects
Custom Loading
Outstanding Needs
Development

No branches or pull requests