-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RFC: Generic member access for dyn Error trait objects #2895
base: master
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(didn't make it very far before my day filled up again, will return shortly)
One
You cannot make the default |
@burdges this is a general restriction on Any and downcast on Error types, I don't think it's currently possible to downcast types that aren't : 'static The second design with the a Provider miiight work with this, not sure. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I focused my feedback more on "technical writing" as requested, but in general I'm quite excited to see this proposal move forward! There are a number of cases at work where it would be nice to specialize error presentation in this way. At a high level I think this proposal would benefit from a "why not more general, i.e. all trait objects?" section. I also recommend putting more examples closer to the top of the "error reports" that you'd like to enhance -- they're quite motivating IMO.
This looks fantastic! ❤️ What is the likely timeline for this to reach stable? Given that it hasn't hit nightly yet, I'm guessing this will be gated behind a Rust 2024 edition? |
There's no timeline but it won't be gated behind any edition. This RFC doesn't include any breaking changes. The current next step is to break out the type tagged downcasting logic into it's own RFC which is being done by @Plecra. |
Reminder for myself: Once this lands I would like to update the example in the docs added in rust-lang/rust#95356 to instead use generic member access to grab the |
Add provider API to error trait Implements rust-lang/rfcs#2895
Add provider API to error trait Implements rust-lang/rfcs#2895
`Display` trait, which acts as the interface to the main member, the error | ||
message itself. It provides the `source` function for accessing `dyn Error` | ||
members, which typically represent the current error's cause. Via | ||
`#![feature(backtrace)]` it provides the `backtrace` function, for accessing a |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This seems outdated. In master branch I don't see any backtrace
function on the Error
trait with or without that feature. Maybe the backtrace
feature was already merged but left out the backtrace
method on Error
. This RFC makes it unnecessary anyway so I'll look into updating this section here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There used to be an unstable backtrace
function on Error
, but it was removed in favor for the generic member access proposed in this RFC.
core/any: remove Provider trait, rename Demand to Request This touches on two WIP features: * `error_generic_member_access` * tracking issue: rust-lang#99301 * RFC (WIP): rust-lang/rfcs#2895 * `provide_any` * tracking issue: rust-lang#96024 * RFC: rust-lang/rfcs#3192 The changes in this PR are intended to address libs meeting feedback summarized by `@Amanieu` in rust-lang#96024 (comment) The specific items this PR addresses so far are: > We feel that the names "demand" and "request" are somewhat synonymous and would like only one of those to be used for better consistency. I went with `Request` here since it sounds nicer, but I'm mildly concerned that at first glance it could be confused with the use of the word in networking context. > The Provider trait should be deleted and its functionality should be merged into Error. We are happy to only provide an API that is only usable with Error. If there is demand for other uses then this can be provided through an external crate. The net impact this PR has is that examples which previously looked like ``` core::any::request_ref::<String>(&err).unwramp() ``` now look like ``` (&err as &dyn core::error::Error).request_value::<String>().unwrap() ``` These are methods that based on the type hint when called return an `Option<T>` of that type. I'll admit I don't fully understand how that's done, but it involves `core::any::tags::Type` and `core::any::TaggedOption`, neither of which are exposed in the public API, to construct a `Request` which is then passed to the `Error.provide` method. Something that I'm curious about is whether or not they are essential to the use of `Request` types (prior to this PR referred to as `Demand`) and if so does the fact that they are kept private imply that `Request`s are only meant to be constructed privately within the standard library? That's what it looks like to me. These methods ultimately call into code that looks like: ``` /// Request a specific value by tag from the `Error`. fn request_by_type_tag<'a, I>(err: &'a (impl Error + ?Sized)) -> Option<I::Reified> where I: tags::Type<'a>, { let mut tagged = core::any::TaggedOption::<'a, I>(None); err.provide(tagged.as_request()); tagged.0 } ``` As far as the `Request` API is concerned, one suggestion I would like to make is that the previous example should look more like this: ``` /// Request a specific value by tag from the `Error`. fn request_by_type_tag<'a, I>(err: &'a (impl Error + ?Sized)) -> Option<I::Reified> where I: tags::Type<'a>, { let tagged_request = core::any::Request<I>::new_tagged(); err.provide(tagged_request); tagged.0 } ``` This makes it possible for anyone to construct a `Request` for use in their own projects without exposing an implementation detail like `TaggedOption` in the API surface. Otherwise noteworthy is that I had to add `pub(crate)` on both `core::any::TaggedOption` and `core::any::tags` since `Request`s now need to be constructed in the `core::error` module. I considered moving `TaggedOption` into the `core::error` module but again I figured it's an implementation detail of `Request` and belongs closer to that. At the time I am opening this PR, I have not yet looked into the following bit of feedback: > We took a look at the generated code and found that LLVM is unable to optimize multiple .provide_* calls into a switch table because each call fetches the type id from Erased::type_id separately each time and the compiler doesn't know that these calls all return the same value. This should be fixed. This is what I'll focus on next while waiting for feedback on the progress so far. I suspect that learning more about the type IDs will help me understand the need for `TaggedOption` a little better.
core/any: remove Provider trait, rename Demand to Request This touches on two WIP features: * `error_generic_member_access` * tracking issue: rust-lang/rust#99301 * RFC (WIP): rust-lang/rfcs#2895 * `provide_any` * tracking issue: rust-lang/rust#96024 * RFC: rust-lang/rfcs#3192 The changes in this PR are intended to address libs meeting feedback summarized by `@Amanieu` in rust-lang/rust#96024 (comment) The specific items this PR addresses so far are: > We feel that the names "demand" and "request" are somewhat synonymous and would like only one of those to be used for better consistency. I went with `Request` here since it sounds nicer, but I'm mildly concerned that at first glance it could be confused with the use of the word in networking context. > The Provider trait should be deleted and its functionality should be merged into Error. We are happy to only provide an API that is only usable with Error. If there is demand for other uses then this can be provided through an external crate. The net impact this PR has is that examples which previously looked like ``` core::any::request_ref::<String>(&err).unwramp() ``` now look like ``` (&err as &dyn core::error::Error).request_value::<String>().unwrap() ``` These are methods that based on the type hint when called return an `Option<T>` of that type. I'll admit I don't fully understand how that's done, but it involves `core::any::tags::Type` and `core::any::TaggedOption`, neither of which are exposed in the public API, to construct a `Request` which is then passed to the `Error.provide` method. Something that I'm curious about is whether or not they are essential to the use of `Request` types (prior to this PR referred to as `Demand`) and if so does the fact that they are kept private imply that `Request`s are only meant to be constructed privately within the standard library? That's what it looks like to me. These methods ultimately call into code that looks like: ``` /// Request a specific value by tag from the `Error`. fn request_by_type_tag<'a, I>(err: &'a (impl Error + ?Sized)) -> Option<I::Reified> where I: tags::Type<'a>, { let mut tagged = core::any::TaggedOption::<'a, I>(None); err.provide(tagged.as_request()); tagged.0 } ``` As far as the `Request` API is concerned, one suggestion I would like to make is that the previous example should look more like this: ``` /// Request a specific value by tag from the `Error`. fn request_by_type_tag<'a, I>(err: &'a (impl Error + ?Sized)) -> Option<I::Reified> where I: tags::Type<'a>, { let tagged_request = core::any::Request<I>::new_tagged(); err.provide(tagged_request); tagged.0 } ``` This makes it possible for anyone to construct a `Request` for use in their own projects without exposing an implementation detail like `TaggedOption` in the API surface. Otherwise noteworthy is that I had to add `pub(crate)` on both `core::any::TaggedOption` and `core::any::tags` since `Request`s now need to be constructed in the `core::error` module. I considered moving `TaggedOption` into the `core::error` module but again I figured it's an implementation detail of `Request` and belongs closer to that. At the time I am opening this PR, I have not yet looked into the following bit of feedback: > We took a look at the generated code and found that LLVM is unable to optimize multiple .provide_* calls into a switch table because each call fetches the type id from Erased::type_id separately each time and the compiler doesn't know that these calls all return the same value. This should be fixed. This is what I'll focus on next while waiting for feedback on the progress so far. I suspect that learning more about the type IDs will help me understand the need for `TaggedOption` a little better.
maybe this proposal need updated to reflect that fact? |
} | ||
``` | ||
|
||
# Drawbacks |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A potential additional drawback is that requesting types dynamically like this can lead to surprising changes in runtime behavior when changing versions of (transitive) dependencies.
Given:
- A crate
foo
with versions 1.0.0 and 2.0.0, each of which provides a typeFoo
. - A crate
bar
which depends on version 1.0.0 offoo
and whose errors provideFoo
. - A crate
baz
which dependsbar
and on version 2.0.0 offoo
and tries to requestFoo
frombar
's errors.
Then a situation is created in which baz
will fail to request Foo
from bar
's error because the requested Foo
and provided Foo
are actually distinct types because the transitive dependency foo
's versions don't match up.
This could even happen in a semver-compatible release of bar
and baz
because technically the API is not broken: the type can still be requested, but it will simply begin returning None
instead of Some(_)
. This would be even worse if foo
, bar,
and baz
are all library crates depended on by an application crate and the semver-incompatible versions of foo
are introduced by a semver-compatible change in bar
or baz
which would get pulled in by running cargo update
on the application crate. The application crate has no recourse in this situation aside from waiting for bar
and baz
to agree on a semver-compatible version of foo
again.
(I've been playing with an alternate design based on an associated type rather than generic member access which causes this situation to be a compile time error instead of a runtime behavior change, but that design requires traits with lifetime GATs to be dyn safe to be actually good, and associated type defaults for it to maybe be possible to retrofit that design onto the standard library's existing error trait.)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree this is a footgun I'd prefer to avoid but as far as I can tell this is not a problem being uniquely introduced by generic member access. This is a problem with type erasure in general and the same situation could be encountered by having foo return Foo
as a Box<dyn Any>
in any of it's public API's. If baz
tries to downcast using it's own path to foo::Foo
it will run into the same type ID mismatch.
I avoid this in my crates currently by re-exporting types from crates that are part of my public API (https://docs.rs/color-eyre/latest/color_eyre/#reexports). Applying this to your example you could avoid this footgun if you downcast/request Foo
via bar::foo::Foo
which would correctly resolve to whatever version it is using. This of course gets a little silly when you want to be able to handle all the versions of Foo
, because then presumably you have to try to request Foo
once for each path through which you could possibly reach it, which is definitely a subpar user experience, so not necessarily a great solution.
That said, I was never able to figure out how to implement generic member access w/o type erasure and runtime checks. I'm curious to understand more about your potential alternative design, because I would definitely love to see a better solution but I don't see how it is possible given the constraints of the error trait and the goals of generic member access.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree this is a footgun I'd prefer to avoid but as far as I can tell this is not a problem being uniquely introduced by generic member access.
True.
This is a problem with type erasure in general and the same situation could be encountered by having foo return
Foo
as aBox<dyn Any>
in any of it's public API's.
However, it seems to me that this kind of thing is relatively rare currently, both in the standard library and ecosystem crates. The only standard library exception I can think of is std::io::Error
, for which I found (part of?) the rationale for in rust-lang/rust#24793 (comment), and I haven't personally encountered any ecosystem crates that make use of this and tell users in documentation that they can/should downcast the inner error to some concrete type.
So, I think making type erasure a central part of the error handling API of the standard library would probably be the main place this kind of problem would come up. I'm not sure the existing occasional cases of downcasting in public APIs can be used to justify a standard library API built entirely on downcasting.
Applying this to your example you could avoid this footgun if you downcast/request
Foo
viabar::foo::Foo
which would correctly resolve to whatever version it is using.
While true, I imagine another place this might become a problem is in a library whose purpose is to use generic member access to generate pretty error reports. This hypothetical library might hypothetically pull in the backtrace
crate for handling backtraces. If any other crate in the dependency tree has errors that provide a backtrace from a different version of the backtrace
crate, the hypothetical library will not be able to extract them. I don't think this can be solved with re-exports, because that would force the library to have a re-export of every crate that has errors that provide a backtrace
crate backtrace.
Your next point about requesting the type through each path (including crate version) does provide a solution there, but also as you point out, would not be the most fun to maintain.
I'm curious to understand more about your potential alternative design, because I would definitely love to see a better solution but I don't see how it is possible given the constraints of the error trait and the goals of generic member access.
I can't promise that this particular idea is any better, but for what it's worth, it goes like this:
trait Error {
/// Type that contains details about this error.
///
/// For example, this could be a struct containing a String with help text and a Backtrace.
type Details<'a> = ()
where Self: 'a;
/// Get the details about this error.
fn details(&self) -> Self::Details<'_>;
}
As written, this requires some language features that aren't implemented yet:
- Associated type defaults. Necessary to make it possible to backport this onto
core::error::Error
, but luckily not required for defining a new error trait in a new crate. dyn
-safety for traits with lifetime GATs. In my current implementation, I've simply dropped the lifetime and am just returning an owned value, which has some overhead but at least allows experimenting with the rest of the design.
Another part of this design is a map
-like function to change the type of Details
when composing errors that have different concrete types.
To approximate generic member access to make it possible to implement a hypothetical library for pretty-printing errors like described above, I imagine a crate could define some traits like trait GetBacktrace { fn get_backtrace(&self) -> Option<&Backtrace>; }
(or one trait with many methods, I suppose) and then adding these traits as bounds on Error::Details
on the pretty-printing functions. The author of a binary crate could then implement these traits for their Error::Details
type and use the library's pretty-printing code.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Wouldn't that still have a similar problem? To adapt your example from before:
- A crate foo with versions 1.0.0 and 2.0.0, each of which provides a trait Foo.
- A crate bar which depends on version 1.0.0 of foo and whose errors have a
Details
type that implements Foo - A crate baz which depends bar and on version 2.0.0 of foo and has uses a bound that requires
Error::Details
to implement Foo (from version 2.0.0 of foo)
baz still can't get the the Foo
details from errors from bar.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's correct, but the difference is how it can't get the details. With the generic member access design, this would fail at run time: the generic member access functions would return None
. With the associated type design, this would fail at compile time: a compilation error would be emitted about the trait bound being unsatisfied.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can currently have a Box<dyn Error + 'static>
(or reference, or some other lifetime). Associated types and dyn-safety for traits with lifetime GATs isn't enough. If you had a Details
type that isn't equal to the default, your error won't be compatible with anything that types a dynamic error, without some other kind of mechanism. And there may be some benefit to a compile time check. But that also means you lose use cases where you might want to extract data dynamically without knowing if it exists at runtime.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can currently have a
Box<dyn Error + 'static>
(or reference, or some other lifetime). Associated types and dyn-safety for traits with lifetime GATs isn't enough. If you had aDetails
type that isn't equal to the default, your error won't be compatible with anything that types a dynamic error, without some other kind of mechanism.
I guess a really important question is how many notable occurrences of Box<dyn Error>
(and similar) are there in public APIs? The only one I can think of off the top of my head is std::io::Error::{other, into_inner}
. In such a case, I believe the map
function I alluded to before should be sufficient. To extract the original Details
, you'd have to downcast to the concrete type first, which I think is not really a regression because that's how std::io::Error
is currently intended to work. For internal-only APIs, the type could simply be changed to specify the desired Details
.
But that also means you lose use cases where you might want to extract data dynamically without knowing if it exists at runtime.
I suspect these cases could be covered for Error
-processing functions by adding bounds on Details
of traits that contain accessor functions that return Option<T>
so that Error
implementors have the, uh, option of returning None
. Are there any situations where that would be insufficient?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I guess a really important question is how many notable occurrences of Box (and similar) are there in public APIs?
Well, I think returning a Result<(), Box<dyn Error + 'static>
from main is a somewhat common pattern. And while you could maybe replace that with Result<(), Box<dyn Error<Details=SomeType> + 'static>
, you would still need the same Details type for all errors, which gets messy if your errors come from different sources.
And you have libraries like anyhow that make heavy use of dyn core::error:Error
, such as in root_cause()
and chain()
and from_boxed()
in anyhow.
To extract the original Details, you'd have to downcast to the concrete type first, which I think is not really a regression because that's how std::io::Error is currently intended to work
But that means you have to know what the concrete type is. And Details
would need to implement a trait that has support for downcasting. And how do you handle combining multiple detail types into the a single error. Do you try casting to a struct that has the exact combination of details you expect? Do you use a collection of Details objects, and then try downcasting each one of those to what you expect?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Well, I think returning a
Result<(), Box<dyn Error + 'static>
from main is a somewhat common pattern.
The return type of main
is not a public API, and the requirements of the Termination
trait allow you to choose other types for the Err
variant, so this would be easy to change to specify a details type. (Specifically, the requirement on the Err
variant is just Debug
.)
And you have libraries like anyhow that make heavy use of
dyn core::error:Error
, such as inroot_cause()
andchain()
andfrom_boxed()
in anyhow.
I don't think this really counts towards what I was asking either. For anyhow
(and similar crates) to be relevant here we'd need to know how often other library crates have anyhow::Error
in their public interface.
But that means you have to know what the concrete type is.
Worth noting that this is not a regression from the status quo.
And how do you handle combining multiple detail types into the a single error. Do you try casting to a struct that has the exact combination of details you expect? Do you use a collection of Details objects, and then try downcasting each one of those to what you expect?
You would use the map
function to compose detail types. You could implement your details type to do any of those things if you wanted, but I think a simpler approach would be to impl From<OtherDetails> for MyDetails
or so.
But adding a Details type to existing errors would break existing code, which means this would either not be very useful, or would have massive churn. And you would need all your errors to have the same details type, which likely means you would have to map over the details for all of them, which isn't very ergonomic. With the original proposal, the only places that have to worry about the context/details of an error are places that consume those details, and places that produce them. But with the associated type, everything else also has to worry about it to make sure the types are consistent, and the details are propagated up.
My point is that in order for anyhow to make use of this feature, it would have to make a breaking change, whoch would cause a lot of churn. And what's more, any errors that define a Details type wouldn't be compatible with anyhow, without first mapping to get rid of the details. And I suspect other crates would have similar issues, anyhow is just a prominent example.
Even if this was true, it makes it less useful than the original RFC. But it is a regression, in a sense. Because you can't have something like I'm not even really sure how downcasting would work here? Do you use a Details type of
But map to what? Let's say you are implementing a library, where you have a fallible function that combines multiple different types of errors, say an error from another library that includes some details and a user-defined error you know nothing about. You don't know anything about the Details type on the user defined error (that is generated by user code), but the caller of your function may also want to make use of the details, but you don't know what traits it wants those details to implement. So you can't possibly implement the MySpecialContext details trait that your user wants, even though you might have that data, if the error came from user code. It is also going to add a lot of complexity to intermediate code that is just handling combining different types of errors. Now you not only need an enum for the different types of errors, you also probably need an enum for different types of Details. And probably need enums to delegate various traits. And it needs to do that regardless of if it actually produces or consumes any detail information. |
One should also keep in mind that the purpose of the |
The previous two comments were probably intended for this review comment thread. Not sure if it's possible to move them now. |
Rendered
Also, I've mocked up the proposal in this repository - https://github.com/yaahc/nostd-error-poc/