Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Warn about trait bounds on struct and enum type parameters #1689

Open
dtolnay opened this issue Apr 22, 2017 · 26 comments
Open

Warn about trait bounds on struct and enum type parameters #1689

dtolnay opened this issue Apr 22, 2017 · 26 comments
Labels
A-lint Area: New lints E-medium Call for participation: Medium difficulty level problem and requires some initial experience. L-unnecessary Lint: Warn about unnecessary code T-middle Type: Probably requires verifiying types

Comments

@dtolnay
Copy link
Member

dtolnay commented Apr 22, 2017

Traits are for behavior.

impl<T> Data<T> where T: Behavior { /* behavior, okay */ }

fn behave<T>() where T: Behavior { /* also behavior, also okay */ }

struct Data<T> where T: Behavior { /* DON'T DO THIS */ }

The only exception is things like Cow that use associated types to define data.

My current understanding is that any data structure not using associated types should not have trait bounds (?Sized doesn't count). Redundant bounds like this suck because they transitively infect anything that names the type in any way. Consider this enum.

#[derive(Clone, Debug, Eq, PartialEq, Serialize, Deserialize)]
enum Compact<T: CompactPart, H: Serialize + Deserialize + Clone + 'static> {
    Decrypted {
        header: Header<H>,
        payload: T,
    },
    Encrypted(Encrypted),
}

// kill me
fn print_the_thing<T, H>(c: Compact<T, H>)
    where T: CompactPart + Debug,
          H: Serialize + Deserialize + Clone + 'static + Debug
{
    println!("{:?}", c);
}

Without trait bounds on the data structure:

fn print_the_thing<T: Debug, H: Debug>(c: Compact<T, H>) {
    println!("{:?}", c);
}
@dtolnay
Copy link
Member Author

dtolnay commented Apr 22, 2017

@oli-obk I think this would help a lot with people trying to write Deserialize<'de> bounds on their types. Almost all the struggles I've seen have been on structs, not so much on functions, and the answer is always just don't put a trait bound there.

@Manishearth
Copy link
Member

Hm, this is an interesting lint. Unsure if it should be warn by default; probably should.

@dtolnay
Copy link
Member Author

dtolnay commented Apr 22, 2017

I believe it should warn by default as long as we can reliably detect the use of an associated type.

@clarfonthey
Copy link
Contributor

Fun fact: a few libstd types could benefit from these changes.

@clarfonthey
Copy link
Contributor

For the longest time BinaryHeap was on that list too, but it appears to have been fixed! It's been a while since I checked but it appears that all of the iterator adapters have been fixed for this too.

@dtolnay
Copy link
Member Author

dtolnay commented Apr 22, 2017

For the longest time before 2013 rust-lang/rust@6ce7446#diff-76432ae7ccf0520cf36aac7c006809a3 😉

@oli-obk
Copy link
Contributor

oli-obk commented Apr 22, 2017

Won't the error messages quality decrease if those bounds are as late as possible? Especially for iterator adaptors? Don't know about any of the conversations on this topic, so I might be mixing some. A specialized version for a whitelist of bounds would probably be a good start. Things like debug, partialeq, deserialize...

@llogiq
Copy link
Contributor

llogiq commented Apr 22, 2017

I'd also like to see how such a lint would fare against typenum.

@dtolnay
Copy link
Member Author

dtolnay commented Apr 22, 2017

I'd also like to see how such a lint would fare against typenum.

Perfectly fine. I tried removing all trait bounds from structs in typenum and it still compiles and passes all tests.

@dtolnay
Copy link
Member Author

dtolnay commented Apr 22, 2017

Won't the error messages quality decrease if those bounds are as late as possible? Especially for iterator adaptors?

@oli-obk could you put together an example with before and after code? I don't think this could happen. Errors are triggered when you rely on behavior, and we would be keeping trait bounds on behavior.

@llogiq
Copy link
Contributor

llogiq commented Apr 22, 2017

cc @paholg

@paholg
Copy link

paholg commented Apr 22, 2017

It's beyond the scope of clippy, but I would like to see the opposite; that putting a constraint on a struct implicitly adds that constraint to all impls of that struct. This would mean that removing constraints would become a potentially breaking change, though.

As is, removing constraints from structs shouldn't break things, but it moves information on how the struct should be used from the type signature into documentation.

@dtolnay
Copy link
Member Author

dtolnay commented Apr 22, 2017

Constraints on a struct implying the same constraints on all impls is an orthogonal discussion. Even with that feature, I would strongly disagree with putting trait bounds on data and I think this lint would still be valuable.

Consider Vec<T>. You can clone a vec if you can clone T. You can serialize a vec if you can serialize T. You can deserialize a vec if you can deserialize T.

Look at the code I linked to in the top comment. I see this all the time. It is the equivalent of writing:

#[derive(Clone, Serialize, Deserialize)]
struct Vec<T> where T: Clone + Serialize + Deserialize { /* ... */ }

Yes your crate may compile either way. Yes "information on how the struct should be used [is in] the type signature [not] documentation." But no it's not valuable in any way.

@paholg
Copy link

paholg commented Apr 22, 2017

It's not valuable for a general purpose struct like Vec, but for something like typenum, it makes sense. The structs defined in typenum are not designed to have any parameters except those defined inside the crate.

I think this lint could be valuable, too, and I should have expressed that in the previous comment. It's easy enough to disable in the rare cases when you don't want it.

@dtolnay
Copy link
Member Author

dtolnay commented Apr 22, 2017

Makes sense. On a spectrum between Vec<T> and typenum::PInt<U>, where do you think the Compact<T, H> type stands? And as a fraction of types how common do you think the typenum situation of opting out of this lint would be?

@llogiq
Copy link
Contributor

llogiq commented Apr 22, 2017

I think the typenum case isn't so common – indeed I hope it goes away sooner than later, once Rust's type system can express integers directly.

Another crate I know that uses generic constraints a lot is diesel. Perhaps @killercup can tell us about their uses?

@killercup
Copy link
Member

I haven't read the whole issue thread, but let me chime in regardless :)

I'm pretty sure that since most public data structures contain some private fields (and they typically should for forwards compat reasons), they also have at least one associated constructor method (usually new). Adding constraints to this impl might not be as descriptive as adding them to the struct, but still results in the same thing.

As far as I can see grep, diesel's main crate only has a handful of type constraints used in structs, and only RawBytesBindCollector and Nullable are public. (And Nullable uses it to restrict recursion.)

As for private structs, it's possible/easier to just use the type constructors, so you need to keep track of the intended constraints yourself, or add them to your struct.

@oli-obk oli-obk added L-unnecessary Lint: Warn about unnecessary code A-lint Area: New lints T-middle Type: Probably requires verifiying types E-medium Call for participation: Medium difficulty level problem and requires some initial experience. labels Apr 26, 2017
@kennytm
Copy link
Member

kennytm commented May 4, 2017

This lint would be pointless if (big if) implied bounds is solved.

@dtolnay
Copy link
Member Author

dtolnay commented May 4, 2017

As discussed in https://github.com/Manishearth/rust-clippy/issues/1689#issuecomment-296396580, this lint would be valuable whether or not implied bounds are solved. The typenum-like cases where you would want to opt out are rare.

@kennytm
Copy link
Member

kennytm commented May 4, 2017

@dtolnay It think it makes sense for structures like HashMap<K, V>, which is useless unless K: Eq + Hash. By the time implied-bounds is really landed the lint could be changed to allow-by-default.

@dtolnay
Copy link
Member Author

dtolnay commented May 4, 2017

I would disagree with allow-by-default even after implied bounds have landed.

Let's look at the motivating example again. Keep in mind that this is real code written by a real live Rust user of the sort that would rely on Clippy for guidance in navigating the complicated world of Rust types.

#[derive(Clone, Debug, Eq, PartialEq, Serialize, Deserialize)]
enum Compact<T: CompactPart, H: Serialize + Deserialize + Clone + 'static> {
    Decrypted {
        header: Header<H>,
        payload: T,
    },
    Encrypted(Encrypted),
}

Oh but we'd like to add a PartialOrd impl because it is needed for some new code. Easy enough:

- #[derive(Clone, Debug, Eq, PartialEq, Serialize, Deserialize)]
- enum Compact<T: CompactPart, H: Serialize + Deserialize + Clone + 'static> {
+ #[derive(Clone, Debug, Eq, PartialEq, Serialize, Deserialize, PartialOrd)]
+ enum Compact<T: CompactPart, H: Serialize + Deserialize + Clone + PartialOrd + 'static> {

Boom, breaking change. Whether or not there are implied bounds. In general this pattern conflicts with the ways we like to think about adding functionality to data structures.

The HashMap case is what we mean by the rare exceptions. Everyday code is much more like the code in this comment.

@devyn
Copy link

devyn commented Sep 3, 2019

Following this has generally made my code nicer, but I have found one little hiccup that I'm not quite sure what to do about.

Including trait bounds in the data definition allows access to associated items inside traits. This is often done with traits such as Iterator, Future, and Stream.

In my case I'm writing a stream adaptor that contains a buffer of items, but I think Peekable is a good enough analogy:

pub struct Peekable<S: Stream> {
    stream: Fuse<S>,
    peeked: Option<S::Item>,
}

This is taken from peek.rs in futures.

If we want to write this without the S: Stream bound, our only option is:

pub struct Peekable<S, T> {
    stream: Fuse<S>,
    peeked: Option<T>,
}

and then to include a where S: Stream<Item=T> bound on the implementations. However this makes the types when fully written out a lot longer and potentially include duplication of T at many different layers. If you're in a situation where automatic typing is not an issue, this is probably okay although leads to less than ideal errors. But if ever anyone has to write out multiple layers of this, it's a real problem.

However, including the S: Stream bound on the struct is also problematic for the same sort of reason others have mentioned: if there are excessive trait bounds on the implementation of Stream for a given type you want to use with it inside another structure, you have to make sure that structure also has all of those required as trait bounds in some way. So you may run into the same problem eventually.

dkellner added a commit to dkellner/chronofold that referenced this issue Sep 18, 2020
This way they don't have to be repeated everywhere the type is used.
See also rust-lang/rust-clippy#1689 .
caizixian added a commit to caizixian/notegraf that referenced this issue Jun 10, 2021
tgeoghegan added a commit to divviup/libprio-rs that referenced this issue Oct 1, 2021
We already provide `serde::{Serialize, Deserialize}` implementations for
every implementation of `prio::field::FieldElement`, but we didn't
`express that trait bound on `FieldElement` itself. This makes it tricky
to serialize or deserialize `FieldElement` values in generic functions.

To make this possible, we also change the declaration of various
structs so that the `F: FieldElement` trait bound is on the `impl` and
not the `struct`. See
rust-lang/rust-clippy#1689 for explanation.
tgeoghegan added a commit to divviup/libprio-rs that referenced this issue Oct 4, 2021
We already provide `serde::{Serialize, Deserialize}` implementations for
every implementation of `prio::field::FieldElement`, but we didn't
`express that trait bound on `FieldElement` itself. This makes it tricky
to serialize or deserialize `FieldElement` values in generic functions.

To make this possible, we also change the declaration of various
structs so that the `F: FieldElement` trait bound is on the `impl` and
not the `struct`. See [the Rust API guidelines][rust-api] or [this
clippy issue](clippy) for justification.

[rust-api]: https://rust-lang.github.io/api-guidelines/future-proofing.html#data-structures-do-not-duplicate-derived-trait-bounds-c-struct-bounds
[clippy]: rust-lang/rust-clippy#1689
bgw added a commit to vercel/next.js that referenced this issue Nov 15, 2024
… types (#72823)

`Vc<T>` had a type bound that `T: Send + ?Sized`. The general recommendation is to remove all type bounds from the struct that aren't necessary:

- https://rust-lang.github.io/api-guidelines/future-proofing.html#c-struct-bounds
- rust-lang/api-guidelines#6
- rust-lang/rust-clippy#1689

The reasoning is that type bounds on structs are "infectious". Any generic function, trait, or struct referring to `Vc<T>` was required to add `T: Send` bounds.

*Sidenote: The [`implied_bounds` feature](https://rust-lang.github.io/rfcs/2089-implied-bounds.html) might mitigate some of this if ever stabilized.*

Removing the `T: Send` type bound from `Vc<T>` means that there's less places where we need to specify it.

This pattern can be seen in many parts of the stdlib. For example, `Arc<T>` doesn't require that `T: Send + Sync`, though in reality to do anything useful with it, you need `T: Send + Sync`.

## Is this safe?

Yes, bounds are checked during cell construction, and the lower-level APIs used for creating cells require `Send + Sync`, so this would be hard to mess up without explicitly unsafe code.

Also, `Vc<T>` also technically requires that `T: Sync`, but that wasn't enforced in the struct definition (only during cell construction). We did just fine without that bound on the struct.

## Why are you leaving `?Sized`?

There's an implicit `Sized` bound on type parameters in struct definitions unless explicitly specified otherwise with `?Sized`.

Right now this bound isn't used. We use a box, e.g. `Vc<Box<dyn Foo>>`, but in the future we might be able to drop that intermediate box from the type signature. I'm leaving the `?Sized` bound in place (and adding it in a few places where it was missing) in hopes of that.
wyattjoh pushed a commit to vercel/next.js that referenced this issue Nov 28, 2024
… types (#72823)

`Vc<T>` had a type bound that `T: Send + ?Sized`. The general recommendation is to remove all type bounds from the struct that aren't necessary:

- https://rust-lang.github.io/api-guidelines/future-proofing.html#c-struct-bounds
- rust-lang/api-guidelines#6
- rust-lang/rust-clippy#1689

The reasoning is that type bounds on structs are "infectious". Any generic function, trait, or struct referring to `Vc<T>` was required to add `T: Send` bounds.

*Sidenote: The [`implied_bounds` feature](https://rust-lang.github.io/rfcs/2089-implied-bounds.html) might mitigate some of this if ever stabilized.*

Removing the `T: Send` type bound from `Vc<T>` means that there's less places where we need to specify it.

This pattern can be seen in many parts of the stdlib. For example, `Arc<T>` doesn't require that `T: Send + Sync`, though in reality to do anything useful with it, you need `T: Send + Sync`.

## Is this safe?

Yes, bounds are checked during cell construction, and the lower-level APIs used for creating cells require `Send + Sync`, so this would be hard to mess up without explicitly unsafe code.

Also, `Vc<T>` also technically requires that `T: Sync`, but that wasn't enforced in the struct definition (only during cell construction). We did just fine without that bound on the struct.

## Why are you leaving `?Sized`?

There's an implicit `Sized` bound on type parameters in struct definitions unless explicitly specified otherwise with `?Sized`.

Right now this bound isn't used. We use a box, e.g. `Vc<Box<dyn Foo>>`, but in the future we might be able to drop that intermediate box from the type signature. I'm leaving the `?Sized` bound in place (and adding it in a few places where it was missing) in hopes of that.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-lint Area: New lints E-medium Call for participation: Medium difficulty level problem and requires some initial experience. L-unnecessary Lint: Warn about unnecessary code T-middle Type: Probably requires verifiying types
Projects
None yet
Development

No branches or pull requests

9 participants