Remove `NtExpr`. #96724

nnethercote · 2022-05-05T06:07:10Z

By using invisible delimiters.

nnethercote · 2022-05-05T06:18:17Z

I was playing around with this just to see what happened. #96627 showed that avoiding Interpolated for NtIdent compiled ok but caused lots of error message to degrade. Here I've done much the same for NtTy and the results are intriguing. There were eight ui test failures to start with.

    [ui] src/test/ui/did_you_mean/bad-assoc-expr.rs
    [ui] src/test/ui/did_you_mean/bad-assoc-pat.rs
    [ui] src/test/ui/did_you_mean/bad-assoc-ty.rs
    [ui] src/test/ui/parser/macro/issue-37113.rs

For these four, after the change there is one less error than before. And I think the new behaviour is better. I can't see why the relevant lines were errors. Perhaps the parser has one or more places where an Interpolated should have been handled.

    [ui] src/test/ui/macros/stringify.rs
    [ui] src/test/ui/proc-macro/capture-unglued-token.rs

For these ones a lot of the stringified outputs have changed. A lot are worse, with worse spacing. But a few are arguably better. There's even a FIXME that this change seemingly fixes.

    [ui] src/test/ui/parser/macro/trait-object-macro-matcher.rs
    [ui] src/test/ui/tuple/tuple-struct-fields/test2.rs

These are ones where the error messages have changed a little for the worse.

After the ui tests are made to pass, other tests work fine up to the debuginfo tests. After that the rand-0.8.5 crate is compiled, and there's a new compiler error. Here's a reduced test case showing the problem:

// Current rustc and new version both reject this like so:
//
// error: `<` is interpreted as a start of generic arguments for `u32`, not a shift
//   --> f.rs:15:14
//    |
// 15 |     1 as u32 << 24
//    |              ^^ -- interpreted as generic arguments
//    |              |
//    |              not interpreted as shift
//    |
// help: try shifting the cast value
//    |
// 15 |     (1 as u32) << 24
//    |     +        +

fn sample2() -> u32 {
    1 as u32 << 24
}

// Current rustc accepts this, new version doesn't.
macro_rules! f {
    ($u_scalar:ty) => {
        fn sample1() -> u32 {
            1 as $u_scalar << 24
        }
    }
}

f! { u32 }

fn main() {
    let _ = sample1() + sample2();
}

Again, I think the new behaviour is reasonable.

This doesn't seem possible to land, but it does suggest that the current use of Interpolated might be resulting in some bugs.

cc @petrochenkov

src/test/ui/did_you_mean/bad-assoc-expr.rs

src/test/ui/did_you_mean/bad-assoc-pat.rs

src/test/ui/did_you_mean/bad-assoc-ty.rs

src/test/ui/macros/stringify.rs

src/test/ui/parser/macro/issue-37113.rs

src/test/ui/parser/macro/trait-object-macro-matcher.rs

src/test/ui/proc-macro/capture-unglued-token.stdout

src/test/ui/tuple/tuple-struct-fields/test2.stderr

petrochenkov · 2022-05-05T12:00:59Z

The 1 as $u_scalar << precision breakage may also be fixed by not dropping Invisible delimiters.

If we assume that $u_scalar is e.g. u64 (I didn't check), then u64 would be an identifier, so we'd continue parsing < as a start of generic arguments after it (*), but «u64» wouldn't be an identifier, so the parsing shouldn't proceed to < in the same way.

(*) The grammar is IDENT < GENERIC_ARGS >.

petrochenkov · 2022-05-05T12:02:59Z

Conclusion: most of the changes in tests will disappear if Invisible delimiters are not dropped by the parser.

petrochenkov · 2022-05-05T12:05:14Z

For quick testing you can replace the invisibles with regular parentheses () in this case because (TYPE) is also a type and the effect should be very close.

nnethercote · 2022-05-06T06:22:21Z

I have updated. As well as not producing Interpolated tokens for NtTy, the parser now also does not drop invisible delimiters, and there is more handling of invisible delimiters. These parser changes seem to mostly work, though I'm not confident I have all the details exactly right.

On one hand, it's encouraging that I've managed to get it down to only four ui test failures. On the other hand, there are many non-terminal kinds remaining. And this doesn't seem like something that can be done incrementally... all the extra handling of invisible delimiters needs to be in place once we stop dropping them. It seems hard to make such a big change all at once without causing lots of subtle breakage .

src/test/ui/attributes/key-value-expansion.stderr

petrochenkov · 2022-05-06T15:12:57Z

@nnethercote
What prevents using the strategy of preserving Invisibles conditionally, like

fn bump(&self, preserve_invisible: bool) { ... }

?

I'm not ready to take a responsibility for reviewing a change that does all of this in one go.
I think we should convert all macro_rules matchers in separate steps, starting from ty or expr or pat where invisibles are equivalent to parentheses, then wait for regressions or run crater, then continue with other matchers.

Every such step may potentially require a language team involvement.
Every such step may also cause breakage requiring additional flags in Delimiter::Invisible to mitigate (due to things like self.look_ahead(1, |t| matches!(t.kind, NtTy(..)) or matches!(self.prev_token.kind, NtTy(..))).
It's going to be a process taking some time.

nnethercote · 2022-05-09T06:01:02Z

@nnethercote What prevents using the strategy of preserving Invisibles conditionally, like
fn bump(&self, preserve_invisible: bool) { ... }
?

I understand the motivation here. But we always get one token in advance. When we're at a place where we want to allow an invisible delimiter, e.g. in parse_bottom_expr, we've already ignored it in the previous bump call. At that point we really want to say "reget the last token, allowing invisible delimiters this time".

petrochenkov · 2022-05-09T10:46:57Z

I need to think how to better stage the implementation here.
Maybe the invisible delimiters can be conditionally removed at some earlier stage, for example in fn parse_ast_fragment immediately after leaving the proc macro.

petrochenkov · 2022-05-10T00:11:30Z

So, to split the implementation into multiple part we need bump to preserve Delimiter::Invisibles only if they come from macro_rules ty (NtTy), and skip all other invisible delimiters.

To do that we should be able to simply keep an extra flag in Delimiter::Invisible indicating that the delimiter must not be skipped.
proc_macro::Delimiter::None doesn't need such a flag, because this flag will be preserved inside rustc, but will be "laundered" if passed through a proc macro, exactly how NtTys disappear when passed through proc macros.

Once all nonterminal tokens are migrated we'll be able to switch this flag to "never skip" and process invisible delimiters from proc macros correctly too.

nnethercote · 2022-05-10T07:25:39Z

Good suggestion. I have added the flag to Delimiter::Invisible, and switched from removing NtTy to removing NtExpr. I have uploaded the code in this PR. There are currently 23 test failures, in three main categories:

Ones where the error message is slightly worse because of minor span differences.
Proc macro ones where I haven't yet updated the PRINT-BANG output, which will be easily fixed.
More serious cases where things don't compile that should.

nnethercote · 2022-05-10T07:27:15Z

The current test failures:

    [ui] src/test/ui/borrowck/issue-25793.rs
    [ui] src/test/ui/conditional-compilation/cfg-attr-syntax-validation.rs
    [ui] src/test/ui/inference/deref-suggestion.rs
    [ui] src/test/ui/issues/issue-26094.rs
    [ui] src/test/ui/issues/issue-26237.rs
    [ui] src/test/ui/macros/issue-29084.rs
    [ui] src/test/ui/mismatched_types/issue-26480.rs
    [ui] src/test/ui/parser/extern-abi-from-mac-literal-frag.rs
    [ui] src/test/ui/parser/float-field-interpolated.rs
    [ui] src/test/ui/parser/macro/trait-non-item-macros.rs
    [ui] src/test/ui/proc-macro/capture-macro-rules-invoke.rs
    [ui] src/test/ui/proc-macro/expr-stmt-nonterminal-tokens.rs
    [ui] src/test/ui/proc-macro/issue-75734-pp-paren.rs
    [ui] src/test/ui/proc-macro/macro-rules-derive-cfg.rs
    [ui] src/test/ui/proc-macro/nested-nonterminal-tokens.rs
    [ui] src/test/ui/proc-macro/nodelim-groups.rs
    [ui] src/test/ui/proc-macro/nonterminal-expansion.rs
    [ui] src/test/ui/rfc-2294-if-let-guard/feature-gate.rs
    [ui] src/test/ui/rfc-2361-dbg-macro/dbg-macro-expected-behavior.rs
    [ui] src/test/ui/type/ascription/issue-47666.rs
    [ui] src/test/ui/unsafe/ranged_ints_macro.rs#mir
    [ui] src/test/ui/unsafe/ranged_ints_macro.rs#thir

nnethercote · 2022-05-19T06:26:04Z

Down to 2 ui failures now. src/test/ui/proc-macro/macro-rules-derive-cfg.rs is a hard one. Here's a cut-down version:

macro_rules! produce_it {
    ($expr:expr) => {
        #[derive(Print)]
        struct Foo { 
            val: [bool; $expr]
        }
    }
}

produce_it!({ #![cfg_attr(not(FALSE), allow(unused))] 30 });

where Print is a derive proc macro that prints the input token stream and returns an empty token stream.

Currently I get an ICE:

error: internal compiler error: no errors encountered even though `delay_span_bug` issued

error: internal compiler error: Missing token range for attribute
  --> src/main.rs:43:15
   |
43 | produce_it!({ #![cfg_attr(not(FALSE), allow(unused))] 30 });
   |               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
   |
   = note: delayed at compiler/rustc_parse/src/parser/attr_wrapper.rs:274:22

The output printed by the derive proc macro looks like this:

PRINT-DERIVE INPUT (DISPLAY): struct Foo
{ val : [bool ; /*«*/ #! [allow(unused)] { #! [allow(unused)] 30 } /*»*/] }

as compared to the output from an unmodified rustc:

PRINT-DERIVE INPUT (DISPLAY): struct Foo
 { val : [bool ; { #! [allow(unused)] 30 }] }

I don't understand why the allow(unused) shows up twice, nor what that Missing token range for attribute error means.

petrochenkov · 2022-05-19T12:28:29Z

src/test/ui/proc-macro/macro-rules-derive-cfg.rs is a hard one

Looks like some pre-existing issue exposed, #86055 or something similar.

nnethercote · 2022-05-20T06:31:45Z

Interesting: the change of pretty-printing from "foo::bar" to "foo :: bar" broke some links in doctests. Fortunately it's pretty easy to address, just by adding ModSep to tt_prepend_space.

nnethercote · 2022-05-20T06:47:35Z

I have updated the code, which has a bunch of fixes, including a couple from @petrochenkov's branches. I have disabled two tests and commented out some failing-to-parse assertions in one unit test. Except for those, all tests now pass, so that's some kind of progress. Still plenty of wrinkles to iron out, though, and who knows what a crater run might uncover.

bors · 2022-05-21T09:35:17Z

☔ The latest upstream changes (presumably #97239) made this pull request unmergeable. Please resolve the merge conflicts.

doctest failures: ``` proc_macro/struct.Diagnostic.html:14: broken intra-doc link - [<code>Level :: Error</code>] proc_macro/struct.Diagnostic.html:15: broken intra-doc link - [<code>Level :: Error</code>] proc_macro/struct.Diagnostic.html:16: broken intra-doc link - [<code>Level :: Warning</code>] proc_macro/struct.Diagnostic.html:17: broken intra-doc link - [<code>Level :: Warning</code>] proc_macro/struct.Diagnostic.html:18: broken intra-doc link - [<code>Level :: Note</code>] proc_macro/struct.Diagnostic.html:19: broken intra-doc link - [<code>Level :: Note</code>] proc_macro/struct.Diagnostic.html:20: broken intra-doc link - [<code>Level :: Help</code>] proc_macro/struct.Diagnostic.html:21: broken intra-doc link - [<code>Level :: Help</code>] checked links in: 3.9s number of HTML files scanned: 32308 number of HTML redirects found: 10094 number of links checked: 2259114 number of links ignored due to external: 112863 number of links ignored due to exceptions: 19 number of intra doc links ignored: 25 errors found: 8 found some broken links ``` XXX: caused by changes in pretty-printing: `Level::Error` -> `Level :: Error`.

[*] two ui tests temporarily removed, and one unit test has a couple of failing-to-parse assertions removed. XXX: - Some tests have sub-optimal error messages, I want to improve them - Added a `src` field to `Delimiter::Invisible` which indicates where it came from, partly replicates the Nonterminal kind within `Interpolated`. - parse_meta_item_inner required significant changes, it was assuming that parse_unsuffixed_lit didn't consume any tokens on failure

bors · 2022-06-02T16:38:16Z

☔ The latest upstream changes (presumably #97293) made this pull request unmergeable. Please resolve the merge conflicts.

Dylan-DPC · 2022-12-10T08:50:18Z

@nnethercote what's the update on this?

nnethercote · 2022-12-10T10:27:12Z

I'm not planning to work on it further, because it seems too hard and likely to have major backwards incompatibility problems. I am ok with closing it, if you want.

Dylan-DPC · 2022-12-10T10:29:54Z

Thanks for the update. Closing it then :)

rustbot added the T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. label May 5, 2022

petrochenkov self-assigned this May 5, 2022

petrochenkov added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label May 5, 2022

nnethercote force-pushed the dont-interpolate-NtTy branch from 773bc03 to a778dde Compare May 5, 2022 08:52

This comment has been minimized.

Sign in to view

petrochenkov reviewed May 5, 2022

View reviewed changes

petrochenkov added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels May 5, 2022

This comment was marked as resolved.

Sign in to view

nnethercote force-pushed the dont-interpolate-NtTy branch 2 times, most recently from 4096efb to 0fa765f Compare May 6, 2022 06:18

This comment has been minimized.

Sign in to view

petrochenkov added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels May 6, 2022

bjorn3 reviewed May 6, 2022

View reviewed changes

src/test/ui/attributes/key-value-expansion.stderr Outdated Show resolved Hide resolved

petrochenkov added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels May 6, 2022

nnethercote force-pushed the dont-interpolate-NtTy branch from 0fa765f to 6b6f97f Compare May 10, 2022 07:26

This comment has been minimized.

Sign in to view

petrochenkov added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels May 19, 2022

This comment was marked as resolved.

Sign in to view

petrochenkov added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels May 19, 2022

nnethercote force-pushed the dont-interpolate-NtTy branch from f3c1674 to 5f31dad Compare May 20, 2022 06:45

nnethercote changed the title ~~Don't use TokenKind::Interpolate for NtTy.~~ Remove NtExpr. May 20, 2022

nnethercote force-pushed the dont-interpolate-NtTy branch from 5f31dad to 6a9399b Compare May 20, 2022 07:02

This comment has been minimized.

Sign in to view

nnethercote mentioned this pull request May 24, 2022

Improve token stream pretty printing. #97340

Closed

petrochenkov and others added 3 commits May 27, 2022 09:57

builtin_macros: Strip top-level invisible delimiters in stringify!.

cda26a7

nnethercote force-pushed the dont-interpolate-NtTy branch from 6a9399b to 17a1e04 Compare May 26, 2022 23:57

petrochenkov mentioned this pull request Jun 4, 2022

rustc macro: unexpected compilation error when ty designator is used within where clause #97719

Closed

Dylan-DPC closed this Dec 10, 2022

nnethercote mentioned this pull request Aug 9, 2023

Start removing Nonterminal #114647

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove `NtExpr`. #96724

Remove `NtExpr`. #96724

nnethercote commented May 5, 2022 •

edited

nnethercote commented May 5, 2022 •

edited

This comment has been minimized.

petrochenkov commented May 5, 2022

petrochenkov commented May 5, 2022

petrochenkov commented May 5, 2022

This comment was marked as resolved.

nnethercote commented May 6, 2022

This comment has been minimized.

petrochenkov commented May 6, 2022

nnethercote commented May 9, 2022

petrochenkov commented May 9, 2022

petrochenkov commented May 10, 2022

nnethercote commented May 10, 2022

nnethercote commented May 10, 2022

This comment has been minimized.

nnethercote commented May 19, 2022

This comment was marked as resolved.

petrochenkov commented May 19, 2022

nnethercote commented May 20, 2022

nnethercote commented May 20, 2022

This comment has been minimized.

bors commented May 21, 2022

bors commented Jun 2, 2022

Dylan-DPC commented Dec 10, 2022

nnethercote commented Dec 10, 2022

Dylan-DPC commented Dec 10, 2022

Remove NtExpr. #96724

Remove NtExpr. #96724

Conversation

nnethercote commented May 5, 2022 • edited

nnethercote commented May 5, 2022 • edited

This comment has been minimized.

petrochenkov commented May 5, 2022

petrochenkov commented May 5, 2022

petrochenkov commented May 5, 2022

This comment was marked as resolved.

nnethercote commented May 6, 2022

This comment has been minimized.

petrochenkov commented May 6, 2022

nnethercote commented May 9, 2022

petrochenkov commented May 9, 2022

petrochenkov commented May 10, 2022

nnethercote commented May 10, 2022

nnethercote commented May 10, 2022

This comment has been minimized.

nnethercote commented May 19, 2022

This comment was marked as resolved.

petrochenkov commented May 19, 2022

nnethercote commented May 20, 2022

nnethercote commented May 20, 2022

This comment has been minimized.

bors commented May 21, 2022

bors commented Jun 2, 2022

Dylan-DPC commented Dec 10, 2022

nnethercote commented Dec 10, 2022

Dylan-DPC commented Dec 10, 2022

Remove `NtExpr`. #96724

Remove `NtExpr`. #96724

nnethercote commented May 5, 2022 •

edited

nnethercote commented May 5, 2022 •

edited