Add string-options for ignoring newline differences #2496

vbreuss · 2023-12-04T10:06:51Z

Background and motivation

Remaining issue from #2364
(see this comment for more details)

This should also fix #1247

API Proposal

Add the following options to the EquivalencyAssertionOptions<T> for strings

public EquivalencyAssertionOptions<T> IgnoringAllNewlines()
{
    // This will remove all newlines from actual and expected before comparison
    return this;
}
public EquivalencyAssertionOptions<T> IgnoringNewlineStyle()
{
    // This will replace "\r\n" with "\n" in actual and expected before comparison
    return this;
}

API Usage

subject.Should().BeEquivalentTo(expected, o => o.IgnoringNewlineStyle());

Alternative Designs

No response

Risks

No response

Are you willing to help with a proof-of-concept (as PR in that or a separate repo) first and as pull-request later on?

Yes, please assign this issue to me.

The text was updated successfully, but these errors were encountered:

dennisdoomen · 2024-01-03T07:58:04Z

I'm fine with this proposal. If @jnyrup is too, we can mark it as approved.

jnyrup · 2024-02-01T14:53:45Z

Wondering whether enabling IgnoringAllNewlines means that "cashew nuts" and cash\r\new nuts are considered equivalent.

In implementation details, whether "\n" and "\r\n" are replaced with:

"" or
" "

My first thought is that IgnoringAllNewlines would consider "cashew\r\nnuts" and "cashew nuts" to be equivalent.

(This can probably be considered my litmus test for this feature)

vbreuss · 2024-02-01T15:04:58Z

Yes, when IgnoringAllNewlines is set, "cash\r\new nuts" and "cashew nuts" would be equivalent, but "cashew\r\nnuts" and "cashew nuts" not, as the first would not have any blank between the words.

bart-vmware · 2024-02-01T15:35:05Z

Wondering whether enabling IgnoringAllNewlines means that "cashew nuts" and cash\r\new nuts are considered equivalent.

I agree. If there's such a need, probably introduce IgnoreLeadingAndTrailingWhitespace for that.

dennisdoomen · 2024-02-01T17:08:59Z

Yes, when IgnoringAllNewlines is set, "cash\r\new nuts" and "cashew nuts" would be equivalent, but "cashew\r\nnuts" and "cashew nuts" not, as the first would not have any blank between the words.

That's also what I would expect

jnyrup · 2024-03-23T17:20:28Z

I don't think we have come to a conclusion yet. (I apologize for the long delay from my side)

I've moved the proposal IgnoringNewlineStyle to #2612 to separate the discussion and eventual approval of the two proposed APIs.

Back to IgnoringAllNewlines.

If the intended implementation of IgnoringAllNewlines is "remove all newlines and then do simple equality", then e.g. "every\r\nday" and everyday would be considered equivalent even though "every day" and "everyday" are different language constructs and not interchangeable.

To my knowledge in regular prose line breaking occurs instead of a space, i.e. "some value" is broken into "some\r\nvalue" to fit within the margins.
To obtain a justified right margin words may be split up using a hyphen+newline, i.e "some longer sentence" is broken into "some long-\r\ner sentence".

I can't off the top of my head come up with a realistic case, where newlines don't replace something.
I'm not saying the case doesn't exists, just that I haven't found it.

We shouldn't try to guess whether a hyphen was inserted manually or by the line-breaking algorithm.
I.e. we shouldn't consider "e\r\nmail" and "e-\r\nmail" as equivalent.

I hope this more clearly expresses why I don't think "remove all newlines and then do simple equality" is the way to go.

dennisdoomen · 2024-03-24T13:37:16Z

Good point. Now I'm wondering what this was supposed to fix in the first place.

vbreuss · 2024-03-24T15:40:47Z

My original idea for this feature came from this comment 😄

I agree, that completely ignoring newlines might have some unintended consequences. For the use cases, that I see, the IgnoringNewlineStyle would be sufficient. I will adapt #2565 to only implement the IgnoringNewlineStyle option and close this issue.

vbreuss added the api-suggestion Early API idea and discussion, it is NOT ready for implementation label Dec 4, 2023

vbreuss mentioned this issue Dec 4, 2023

New string-specific options for string equivalency assertions #2364

Closed

This comment was marked as resolved.

Sign in to view

This was referenced Jan 16, 2024

Add option to ignore newline style when comparing strings for equivalency #2565

Merged

Incorrect line breaks handling with custom string comparer #2566

Closed

jnyrup mentioned this issue Mar 23, 2024

Add string-option for ignoring newline style #2612

Closed

vbreuss closed this as completed Mar 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add string-options for ignoring newline differences #2496

Add string-options for ignoring newline differences #2496

vbreuss commented Dec 4, 2023 •

edited by jnyrup

dennisdoomen commented Jan 3, 2024

This comment was marked as resolved.

jnyrup commented Feb 1, 2024 •

edited

vbreuss commented Feb 1, 2024

bart-vmware commented Feb 1, 2024

dennisdoomen commented Feb 1, 2024

jnyrup commented Mar 23, 2024

dennisdoomen commented Mar 24, 2024

vbreuss commented Mar 24, 2024

Add string-options for ignoring newline differences #2496

Add string-options for ignoring newline differences #2496

Comments

vbreuss commented Dec 4, 2023 • edited by jnyrup

Background and motivation

API Proposal

API Usage

Alternative Designs

Risks

Are you willing to help with a proof-of-concept (as PR in that or a separate repo) first and as pull-request later on?

dennisdoomen commented Jan 3, 2024

This comment was marked as resolved.

jnyrup commented Feb 1, 2024 • edited

vbreuss commented Feb 1, 2024

bart-vmware commented Feb 1, 2024

dennisdoomen commented Feb 1, 2024

jnyrup commented Mar 23, 2024

dennisdoomen commented Mar 24, 2024

vbreuss commented Mar 24, 2024

vbreuss commented Dec 4, 2023 •

edited by jnyrup

jnyrup commented Feb 1, 2024 •

edited