Tracking issue for error source iterators #58520

sfackler · 2019-02-16T20:14:21Z

This covers the <dyn Error>::iter_chain and <dyn Error>::iter_sources methods added in #58289.

The text was updated successfully, but these errors were encountered:

withoutboats · 2019-02-16T20:19:22Z

My preference before stabilizing would be to make changes to make it more like the original failure design, which I still prefer. There should be one iterator, it should begin from this error type, and its name should be sources. If you want all the sources of error less this layer, you can write .skip(1).

TimDiekmann · 2019-02-16T21:00:00Z

I like the idea of a convenient interface to iterate sources. However, the interface feels unergonomic as mentioned in #58289 (comment). Additionally I think this should not be stabilized before #53487

haraldh · 2019-02-18T15:27:09Z

My preference before stabilizing would be to make changes to make it more like the original failure design, which I still prefer. There should be one iterator, it should begin from this error type, and its name should be sources. If you want all the sources of error less this layer, you can write .skip(1).

Yeah, my first iteration (pun) had only .iter(), but then @sfackler requested the other iter_*()

faern · 2019-03-29T15:22:48Z

These iterators are really good. I have long relied on the display_chain method in error-chain. As I move away from that lib I find it cumbersome to transform my entire chain of errors into a good string representation.

With these iterators I can get the equivalent of display_chain this way:

let display_chain: String = error
    .iter_sources()
    .fold(format!("Error: {}", error), |mut s, e| {
        s.push_str(&format!("\nCaused by: {}", e));
        s
    });

These iterators does not solve the entire problem. But they make it a bit easier to implement.

haraldh · 2019-05-31T12:21:47Z

So, any changes needed here? I really want this to be stabilized

withoutboats · 2019-07-11T20:52:45Z

I think a few changes are needed before this can be stabilized:

First, we need to actually reach a consensus about whether we have both of these APIs. I've been consistent that I think the changes made to failure regarding this issue since I stopped maintaining it are a mistake: we should have one iterator which begins with self and calls cause until it returns None. If users want to skip the self value, its just as simple as adding .skip(1) to that iterator. No one else has really expressed an opinion about that question.
Even if we do have both, I think the names right now are not well-chosen. It would be better to change them to something like .errors() and .sources() (or keep one and just call it one of these two names). We tend to avoid the name iter except for iterating through elements of a collection, modified by its ownership type; for iterators which are semantic values pulled out of a type, we prefer just using the name of that kind of value (as in lines, keys, chars, etc).
We need to decide what to do about the trait object vs provided method problem, discussed on the PR. This involves the struggle around the fact that Box<Error> doesn't implement Error. I think the best solution available to us now is to just have the method repeated in both places.

derekdreery · 2019-07-14T12:53:42Z

my 2¢: I think the simplest (for users) implementation is a single iter method, with a note in the documentation that

The iterator length is always >= 1, that is, you can call iter.next().unwrap() once without causing a panic.
If you just want the sources, use e.iter().skip(1).

An example like

fn print_error(e: impl Error) { // or Box<Error> or whatever other actual type this is implemented for
  let mut chain = e.iter();
  eprintln!("error: {}", chain.next().unwrap());
  for source in chain {
    eprintln!("caused by {}", source);
  }

This leaves the option open of an iter_sources method later if it's decided that it's really needed (that can just be implemented as iter().skip(1)).

BurntSushi · 2019-07-14T13:00:42Z

I agree with @withoutboats and @derekdreery. I had to carefully read the docs for each method before I figured out how they differed. Moreover, I suspect it will be quite difficult to pick names for both methods that are easy to remember and distinguish. Having one method and requiring the caller to use skip(1) if they don't want the current error seems much nicer IMO.

withoutboats · 2019-07-14T19:50:52Z

It does seem that many users found the terminology causes confusing for an iterator that returns self (myself, it seemed intuitive, if the documentation says so, that this error can be considered a member of the chain of causality). So a better name than sources might be preferred here. However, both of the other options I can think of are also imperfect:

I think iter violates our naming convention, in that I think iter must return the same iterator returned by <&T as IntoIterator>::into_iter, and I don't expect that impl to exist for error types.
Error::errors does not seem great either, just because it can be confusing to say that this error contains many errors.

As I said previously, I don't think names like iter_chain are a good choice either, they're pretty far outside of our general style for iterator names.

faern · 2019-07-14T20:52:32Z

I agree causes/sources would be confusing if the iterator contains the error it's called on. In err.causes() I want the errors that caused err. err did not cause itself, nor is it part of the sources for this error. It would not be consistent with the existing err.source() which returns the first error under err not err itself.

I do fully agree that err is part of the chain of causality however, which is why I think something like chain would work.

derekdreery · 2019-07-15T12:13:43Z

@withoutboats

I think iter violates our naming convention, in that I think iter must return the same iterator returned by <&T as IntoIterator>::into_iter, and I don't expect that impl to exist for error types.

Is this set in stone. I think iter is the best name, because it feels familiar, in that the iter() method should return an iterator that makes most sense for the object. I wouldn't expect FromIterator to be implemented for &Error, but if it were to be implemented then it should look like this (i.e. include the base error).

Could the naming convention be changed to "iter must return the same iterator returned by <&T as IntoIterator>::into_iter, or if this is not implemented, it must return the canonical iterator for the object (implying that such a canonical iterator must exist)".

derekdreery · 2019-07-15T12:18:33Z

To give some more context on why I would like it to be called iter:

When I'm reading the documentation for some object with methods, when I see an iter method I think to myself "what iterator makes most sense for this object". So for a vector I would assume it returns refs to the elements, or for a linked list (which is what our error chain is essentially), I would expect it to walk the nodes and return a node per iteration. I don't think "how is FromIterator implemented for the reference type for this object". Maybe I should be thinking that :P

haraldh · 2019-07-30T13:06:15Z

I think iter violates our naming convention, in that I think iter must return the same iterator returned by <&T as IntoIterator>::into_iter, and I don't expect that impl to exist for error types.

Hmm, this works:

impl<'a> IntoIterator for &'a (dyn Error + 'static) {
    type Item = &'a (dyn Error + 'static);
    type IntoIter = ErrorIter<'a>;

    fn into_iter(self) -> Self::IntoIter {
        ErrorIter {
            current: Some(self),
        }
    }
}

let mut iter = <&(dyn Error + 'static) as IntoIterator>::into_iter(&err);

or

for e in err.borrow() as &(dyn Error) {
    eprintln!("{}", e);
}

and iter_chain() (or iter() how I would call it) is implemented for dyn Error

withoutboats · 2019-08-01T17:13:13Z

@haraldh We can add that impl, but I don't think we should (and it seems completely circular to add it just to justify naming the method iter)

haraldh · 2019-08-01T17:36:47Z

@haraldh We can add that impl, but I don't think we should ...

Agreed

teiesti · 2019-09-04T01:42:38Z

I'd like to point to the issues discussing the addition (#48420) and stabilization (#50894) of std::path::Path::ancestors. At first glance this API might seem unrelated but the situation back then was quite similar:

The objective was to add an iterator that recursively calls an existing, well-known method.
There was a discussion if self should be included.
There was a lengthy discussion about the name.

In the end, the rationale was:

Such iterators are helpful. They should better be stabilized sooner than later.
self should be included. It is easy to .skip(1) it. Not including self is harmful because it is harder to add self to the iterator than to remove it. (I think, it is not a good idea to add two APIs just to cover both semantics because that will clutter both, the namespace and the human mind.)
The chosen name should be telling and reflect the fact that self is included. Adding an "s" to the original method name is a good to start unless that leads to unwanted associations which, I think, is often the case. (I think, .iter() is not a good choice, because it is natural to iterate over vector items but it is not natural to iterate over error sources. In theory, there may be other properties of an Error that are a more or equally natural subject to iteration, e.g., descriptions provided in different languages. In particular, a type implementing Error might also want to implement a method called .iter() for whatever reason.)

I'd like to throw two naming ideas into the ring that might be considered for further debate:

.ancestors(), because the iterator iterates over the ancestors in a ancestry of errors which are cause to their respective children.
.chained() in honor of error-chain and because the iterator iterates over the chain of errors that is somehow included in self.

I'd also like to propose to rename the ErrorIter struct. I think, it is good practice throughout the standard library to name the struct after the method creating it (example).

ehuss · 2019-10-07T18:40:09Z

Sorry if this was brought up earlier, I tried to read the RFC/issues, and didn't see it brought up.

What is the intent of how to handle older code that overrides cause but not source? For example, IntoStringError only implements cause. If you iterate over sources (iter_sources), then the cause is lost. In this case it is the Utf8Error which provides useful detail like invalid utf-8 sequence of 1 bytes from index 0 .

Should these errors be changed to implement source instead of cause? Or maybe implement both?

faern · 2019-10-07T19:13:24Z

IMO all new errors should only implement source. All old code that want to stay relevant and usable should be updated to implement source as well. I do not think we should focus the development of new features in libstd to cover deprecated items.

derekdreery · 2019-10-07T19:29:09Z

IMO all new errors should only implement source. All old code that want to stay relevant and usable should be updated to implement source as well. I do not think we should focus the development of new features in libstd to cover deprecated items.

If we were to do this, could we make an effort to submit PRs to popular crates using cause.

More generally (and slightly OT), is there a mechanism for the community to help crate maintainers with stuff like this?

haraldh · 2019-10-18T13:47:03Z

I'd like to point to the issues discussing the addition (#48420) and stabilization (#50894) of std::path::Path::ancestors. At first glance this API might seem unrelated but the situation back then was quite similar:

1. The objective was to add an iterator that recursively calls an existing, well-known method.

2. There was a discussion if `self` should be included.

3. There was a lengthy discussion about the name.

In the end, the rationale was:

1. Such iterators are helpful. They should better be stabilized sooner than later.

2. `self` should be included. It is easy to `.skip(1)` it. Not including `self` is harmful because it is harder to add `self` to the iterator than to remove it. (I think, it is not a good idea to add two APIs just to cover both semantics because that will clutter both, the namespace and the human mind.)

3. The chosen name should be telling and reflect the fact that `self` is included. Adding an "s" to the original method name is a good to start unless that leads to unwanted associations which, I think, is often the case. (I think, `.iter()` is not a good choice, because it is natural to iterate over vector items but it is not natural to iterate over error sources. In theory, there may be other properties of an `Error` that are a more or equally natural subject to iteration, e.g., descriptions provided in different languages. In particular, a type implementing `Error` might also want to implement a method called `.iter()` for whatever reason.)

I'd like to throw two naming ideas into the ring that might be considered for further debate:

* `.ancestors()`, because the iterator iterates over the ancestors in a ancestry of errors which are cause to their respective children.

* `.chained()` in honor of `error-chain` and because the iterator iterates over the chain of errors that is _somehow_ included in `self`.

I'd also like to propose to rename the ErrorIter struct. I think, it is good practice throughout the standard library to name the struct after the method creating it (example).

Opened a PR with this suggestion: #65557

haraldh · 2019-10-21T08:41:37Z

So, it seems, that even Path::ancestors() includes itself. So, to avoid confusion and simplify it more, I reduced PR #65557 to only have chained and Chain.

Rationale:

Such iterators are helpful. They should better be stabilized sooner than later.
self should be included. It is easy to .skip(1) it. Not including self is harmful because it is harder to add self to the iterator than to remove it.
The chosen name should be telling and reflect the fact that self is included. .chained() was chosen in honor of error-chain and because the iterator iterates over the chain of errors that is somehow included in self.
The resulting iterator is named Chain because the error::Chain is what we want to have.

derekdreery · 2019-10-21T15:01:29Z

I like the name chained because it indicates that it includes the current error, but connects all the others.

The docs should mention the skip(1) way of avoiding the original error. (This is currently the case in the PR 323f6a4)

bbqsrc · 2020-09-02T15:08:47Z

Anything blocking this from stabilization?

haraldh · 2020-09-02T20:05:04Z

see #58520 (comment) and following

especially #69161 is annoying

haraldh · 2020-09-14T11:05:14Z

Stabilization Report

The current implementation of error source iterators is only implemented on dyn Error as chain(), which returns an iterator beginning with self instead of self.source() (which was discussed here).

This causes an un-ergonomic usage on non &dyn Error types:

    let mut iter = (&b as &(dyn Error)).chain();
    // or
    let mut iter = <dyn Error>::chain(&b);
    // or
    let mut iter = Error::chain(&b);

Solutions to the un-ergonomic usage

Implement `chain()` on the Error trait

An additional implementation on trait Error could be done via:

pub trait Error: Debug + Display {
    // […]
    fn chain(&self) -> Chain<'_> where Self: Sized + 'static {
        Chain {
            current: Some(self),
        }
    }
}

but that leads to issue #69161

Add an `ErrorChain` trait

See #58520 (comment) and PR #69163

Remove `chain()` and add `sources()` instead

Returning an iterator not starting with self but self.source() can be done in the Error trait directly and so it is implemented for all dyn Error also.

pub trait Error: Debug + Display {
    // […]
    fn sources(&self) -> Chain<'_> {
        Chain {
            current: self.source(),
        }
    }
}

haraldh · 2020-09-14T11:06:59Z

@withoutboats #58520 (comment)

haraldh · 2021-02-02T16:01:11Z

So, if nobody objects, I will open a PR with the "Remove chain() and add sources() instead" solution, as nothing else provides an ergonomic solution.

haraldh · 2021-02-03T12:21:48Z

Here we go: #81705

To produce an error iterator `std::error::Chain` one had to call `<dyn Error>::chain()`, which was not very ergonomic, because you have to manually cast to a trait object, if you didn't already have one with the type erased. ``` let mut iter = (&my_error as &(dyn Error)).chain(); // or let mut iter = <dyn Error>::chain(&my_error); // or let mut iter = Error::chain(&my_error); ``` The `chain()` method can't be implemented on the Error trait, because of rust-lang#69161 `Chain::new()` replaces `<dyn Error>::chain()` as a good alternative without confusing users, why they can't use `my_error.chain()` directly. The `Error::sources()` method doesn't have this problem, so implement it more efficiently, than one could achieve this with `Chain::new().skip(1)`. Related: rust-lang#58520

KodrAus · 2021-02-04T10:36:30Z

I think @haraldh touched on a reason to consider both a method that iterates over self and its sources and one that just includes its sources: you can call the latter on any error, but can only call the former on an error you can cast into dyn Error + 'static:

// Valid for any `&E: Error + ?Sized`
fn sources(&self) -> Chain<'_> {}

// Valid only for `&E: Error + Sized + 'static`
fn chain(&self) -> Chain<'_> where Self: Sized + 'static {}

So you could write a maximally compatible method to display an error using sources:

fn render(err: impl Error) {
    fn render_inner(err: impl Error) { .. }

    render_inner(&err);

    for source in err.sources() {
        render_inner(source);
    }
}

but you couldn't write that same implementation using chain unless you also constrain the input error to 'static.

You could argue that's a bit of a forced difference though, because in practice I think non-'static errors aren't very useful anyway since they can't participate in the source chain, but currently unless you wrap a Box<dyn Error + 'static> up into a newtype that implements Error the only impl Error you can get from it is non-'static.

notgull · 2021-07-02T23:18:54Z

Shouldn't the iterator proper implement FusedIterator?

dtolnay · 2021-07-03T02:30:08Z

Shouldn't the iterator proper implement FusedIterator?

That's not obvious. There is no documented requirement on https://doc.rust-lang.org/1.53.0/std/error/trait.Error.html#method.source that it behave in a fused way.

notgull · 2021-07-03T02:47:25Z

Shouldn't the iterator proper implement FusedIterator?

That's not obvious. There is no documented requirement on https://doc.rust-lang.org/1.53.0/std/error/trait.Error.html#method.source that it behave in a fused way.

Apologies if I'm misinterpreting something, but once the Error object returns a None, that's the end of the error chain, isn't it? You can't get another &(dyn Error + 'static) out of that. I guess you could create an infinite loop, but ending is not required for fused iterators.

mathstuf · 2021-07-03T03:09:50Z

Nothing says the error can't change its mind about what its source is:

impl Error for SillyError {
    fn source(&self) -> Option<&(dyn Error + 'static)> {
        if rand::int() == 1 { Some(&self.source) } else { None }
    }
}

notgull · 2021-07-03T03:13:43Z

Nothing says the error can't change its mind about what its source is:

impl Error for SillyError {
    fn source(&self) -> Option<&(dyn Error + 'static)> {
        if rand::int() == 1 { Some(&self.source) } else { None }
    }
}

Is it a requirement for fused iterators to not involve randomness? All I'm saying is that std::error::Chain could implement FusedIterator, since, unless there are plans to change the way that it is implemented, it could get a marginal win in the albeit rare case where one passes it into a function that fuses it.

mathstuf · 2021-07-03T03:19:28Z

Is it a requirement for fused iterators to not involve randomness?

I guess my example might still not be enough as std::error::Chain will assume it's over. The rule for FusedIterator is that once None is returned, None will always be returned on subsequent .next() calls. I suppose whether the last Error is delegated to or if there is some implicit FusedIterator assumption in std::error::Chain would answer this question.

Some papercuts on error::Error Renames the chain method, since I chain could mean anything and doesn't refer to a chain of sources (cc rust-lang#58520) (and adds a comment explaining why sources is not a provided method on Error). Renames arguments to the request method from `req` to `demand` since the type is `Demand` rather than Request or Requisition. r? `@yaahc`

Some papercuts on error::Error Renames the chain method, since I chain could mean anything and doesn't refer to a chain of sources (cc rust-lang#58520) (and adds a comment explaining why sources is not a provided method on Error). Renames arguments to the request method from `req` to `demand` since the type is `Demand` rather than Request or Requisition. r? ``@yaahc``

nrc · 2022-08-30T09:40:18Z

In #100955, I renamed chain to sources, it still includes self in the iterator. I added a comment to try and record the issue with why source must be defined in the impl and not the trait. In terms of solutions, this should be fixed by the dyn* work which will permit more methods on trait objects. The other possible solution is to remove self from the iterator (this is a solution because the problem is in converting self to a trait object, which might not work if self is itself a wide pointer which is not a trait object, all the other sources are already trait objects so skipping self means no conversion has to happen). Adding an Unsize bound is not a solution because its not backwards compatible.

jonhoo · 2022-09-23T22:36:47Z

I recently discovered that io::Error doesn't return custom inner errors through Error::source (#101817). I wonder whether we should work around that for iter_chain and iter_sources so that they know how to walk through (and continue after) io::Error::Custom as well?

zopsicle · 2024-04-14T12:22:20Z

Solutions to the un-ergonomic usage

Another solution is to make sources a function in the module core::error, rather than a method on dyn Error. One would write use std::error; and then error::sources(&error).

sfackler added T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. B-unstable Blocker: Implemented in the nightly compiler and unstable. C-tracking-issue Category: A tracking issue for an RFC or an unstable feature. labels Feb 16, 2019

withoutboats added the A-error-handling Area: Error handling label Jul 12, 2019

haraldh mentioned this issue Oct 18, 2019

rename Error::iter_chain() and remove Error::iter_sources() #65557

Merged

Dylan-DPC-zz added the PG-error-handling Project group: Error handling (https://github.com/rust-lang/project-error-handling) label Sep 19, 2020

KodrAus mentioned this issue Sep 29, 2020

Error Handling Project Group rust-lang/libs-team#3

Open

ramosbugs mentioned this issue Nov 24, 2020

Token endpoint response parsing failure cause is not exposed to the caller ramosbugs/openidconnect-rs#34

Closed

Nemo157 mentioned this issue Dec 2, 2020

Tracking issue for RFC 2504, "Fix the Error trait" #53487

Open

9 tasks

haraldh mentioned this issue Feb 3, 2021

std::error::Error: change the error iterator producer #81705

Closed

nrc mentioned this issue Aug 24, 2022

Some papercuts on error::Error #100955

Merged

tustvold mentioned this issue Sep 13, 2022

Enhance TaskContext and add task failure root cause apache/datafusion#3410

Open

jonhoo mentioned this issue Sep 23, 2022

Return the custom error instead of its cause in io::Error::{cause,source} #101818

Closed

Tracking issue for error source iterators #58520

Tracking issue for error source iterators #58520

Comments

sfackler commented Feb 16, 2019

withoutboats commented Feb 16, 2019

TimDiekmann commented Feb 16, 2019

haraldh commented Feb 18, 2019

faern commented Mar 29, 2019

haraldh commented May 31, 2019

withoutboats commented Jul 11, 2019 • edited

derekdreery commented Jul 14, 2019 • edited

BurntSushi commented Jul 14, 2019

withoutboats commented Jul 14, 2019

faern commented Jul 14, 2019

derekdreery commented Jul 15, 2019 • edited

derekdreery commented Jul 15, 2019

haraldh commented Jul 30, 2019 • edited

withoutboats commented Aug 1, 2019

haraldh commented Aug 1, 2019 • edited

teiesti commented Sep 4, 2019 • edited

ehuss commented Oct 7, 2019

faern commented Oct 7, 2019

derekdreery commented Oct 7, 2019

haraldh commented Oct 18, 2019

haraldh commented Oct 21, 2019

derekdreery commented Oct 21, 2019 • edited

bbqsrc commented Sep 2, 2020

haraldh commented Sep 2, 2020

haraldh commented Sep 14, 2020

Stabilization Report

Solutions to the un-ergonomic usage

Implement chain() on the Error trait

Add an ErrorChain trait

Remove chain() and add sources() instead

haraldh commented Sep 14, 2020

haraldh commented Feb 2, 2021

haraldh commented Feb 3, 2021

KodrAus commented Feb 4, 2021 • edited

notgull commented Jul 2, 2021

dtolnay commented Jul 3, 2021

notgull commented Jul 3, 2021

mathstuf commented Jul 3, 2021

notgull commented Jul 3, 2021

mathstuf commented Jul 3, 2021

nrc commented Aug 30, 2022

jonhoo commented Sep 23, 2022

zopsicle commented Apr 14, 2024

withoutboats commented Jul 11, 2019 •

edited

derekdreery commented Jul 14, 2019 •

edited

derekdreery commented Jul 15, 2019 •

edited

haraldh commented Jul 30, 2019 •

edited

haraldh commented Aug 1, 2019 •

edited

teiesti commented Sep 4, 2019 •

edited

derekdreery commented Oct 21, 2019 •

edited

Implement `chain()` on the Error trait

Add an `ErrorChain` trait

Remove `chain()` and add `sources()` instead

KodrAus commented Feb 4, 2021 •

edited