Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Should stubs contain default values? #8988

Closed
hauntsaninja opened this issue Oct 26, 2022 · 28 comments
Closed

Should stubs contain default values? #8988

hauntsaninja opened this issue Oct 26, 2022 · 28 comments
Labels
project: policy Organization of the typeshed project

Comments

@hauntsaninja
Copy link
Collaborator

hauntsaninja commented Oct 26, 2022

This has come up recently, for similar reasons as #4881 and discussed a little there

From @gramster

an advantage of including default values is that tools like pyright or pylance will use stubs (if available) for the signatures they show to users when hovering over a function call, and users want to see the default values. I'd like to understand your point of view though as to why you would prefer them removed?

I'm overall fine with this, though I'd like to see the following:

  • we restrict it to simple literal values (e.g. anything that can go into a Literal)
  • we can still use ... at author's discretion (useful in overloads or cases where value doesn't actually matter or to avoid awful x: Literal["x"] = "x" things)
  • we update e.g. stubtest to verify stuff

The case for:

  • lets IDEs provide more information to users

The case against:

  • most things are longer than ... and terseness may be more valuable than the additional information
  • it's a thing that needs doing and maintaining
  • if we allow complicated default values there are more costs

A similar thing could apply to constants.

@JelleZijlstra
Copy link
Member

I agree, a couple of small comments:

  • we restrict it to simple literal values (e.g. anything that can go into a Literal)

Other values that we could accept would include floats and types (e.g., for params like dict_factory: Callable[..., ...] = dict).

  • we can still use ... at author's discretion (useful in overloads or cases where value doesn't actually matter or to avoid awful x: Literal["x"] = "x" things)
  • we update e.g. stubtest to verify stuff

It should be easy for stubtest to check defaults for correctness on Python functions (it's in the signature). For C functions, it's not necessarily possible to infer the default though, and indeed there may not be a default in a useful sense.

What other tools would need updating? I think we'd have to change flake8-pyi and hopefully nothing else.

@srittau srittau added the project: policy Organization of the typeshed project label Oct 26, 2022
@srittau
Copy link
Collaborator

srittau commented Oct 26, 2022

It would be useful - but not necessary - to update stubgen as well.

@AlexWaygood
Copy link
Member

AlexWaygood commented Oct 27, 2022

I like the idea of adding simple default values to stubs. I agree that it's very useful in IDEs, and I think it'll be easier to maintain than adding docstrings to stubs.

The = ... convention that we currently use is nice because it keeps stubs as concise as possible. But... it looks really weird to people who aren't used to it. I think it's nice for stubs to use the same syntax and conventions as normal Python wherever possible, and I remember finding the = ...s everywhere really strange and alien when I first started reading through typeshed's stubs as a new contributor.

What other tools would need updating? I think we'd have to change flake8-pyi and hopefully nothing else.

Pyright also has a lint that enforces only using ... as a default in stub files (ironically, since it's one of the IDEs that would benefit most from this change). We might be able to easily switch that off in our pyrightconfig file -- I'm not sure if it's bundled together with other stub-related lints or if it's a dedicated error code.

@sobolevn
Copy link
Member

I have some concerns about it.

First of all, this will require more branching just for the sake of default values if it is different for different python versions.

Or we can show incorrect default values for this specific python version.

Secondly, default values are not a part of the signature. So, they can be changed in bugfix releases. It will make it much harder to maintain.

Next, stubs are even harder to maintain with default values.

Moreover, there are quite ugly defaults values like _sentinel = object() or others. Not sure we want this.

I think that IDEs can just use inspect to get the function and its defaults. It is much easier and it will always show correct results.

@AlexWaygood
Copy link
Member

AlexWaygood commented Oct 27, 2022

Moreover, there are quite ugly defaults values like _sentinel = object() or others. Not sure we want this.

I like @hauntsaninja's idea of only including "simple" defaults, e.g. "anything that's valid to go in a Literal slice". We can continue to use ... for ugly or complex defaults.

@srittau
Copy link
Collaborator

srittau commented Oct 27, 2022

I'll honestly expect 95% of all default values to be None anyway.

@JelleZijlstra
Copy link
Member

First of all, this will require more branching just for the sake of default values if it is different for different python versions.

That will happen sometimes, but my sense is that it's not common.

Secondly, default values are not a part of the signature. So, they can be changed in bugfix releases. It will make it much harder to maintain.

In theory, sure, but I'd be surprised if that happens with any frequency.

If maintainability concerns like these do come up frequently, we can revisit our decision, but I'd be surprised if it is a major issue. The stdlib doesn't change its defaults that much, and for third-party libraries we only support one version at a time.

@gramster
Copy link

gramster commented Oct 27, 2022 via email

@jpe
Copy link

jpe commented Oct 27, 2022

IDEs will also probably want docstrings.

@JelleZijlstra
Copy link
Member

Docstrings provide significantly more maintainability challenges. It's worth discussing whether it's worth changing our stance on docstrings too, but let's do that in a separate issue.

@AlexWaygood
Copy link
Member

IDEs will also probably want docstrings.

Please don't discuss docstrings here. We have the following issue for that discussion:

@jpe

This comment was marked as off-topic.

@Avasam
Copy link
Collaborator

Avasam commented Nov 2, 2022

As a user of both libraries typed through Typeshed and inline-typed ones, I've noticed that's information lost in these third-party stubs, which is unfortunate. I really like when my IDE shows me default values of function parameters.

A similar thing could apply to constants.

I share the same feeling for constants. (Relates to #7258 ?)

Similarly to my comment here (#4881 (comment)) and as gramster mentionned: Is this something editors could inspect when they come across =... ? (ie: fallback to the source python code, if available).

What's a good compromise between maintainability and not hiding information that would otherwise be available?

On the idea of only doing it for simple types: stubtest could help validate and maintain accuracy. By knowing when a default is both known and considered "simple".

@JelleZijlstra
Copy link
Member

It seems like there is agreement among the maintainers here, so I think we can move forward with changing our policy and allowing defaults. I wrote a little tool to help infer defaults from the runtime: https://github.com/JelleZijlstra/stubdefaulter. I can also work on adding support to stubtest for checking that defaults are right.

@Avasam
Copy link
Collaborator

Avasam commented Dec 17, 2022

https://github.com/microsoft/pyright/releases/tag/1.1.284

Behavior Change: Removed diagnostic check that detects a non-ellipsis default value in a stub file. This check was based on a now-outdated best practice. We now recommend that stubs include default values for better usability in language servers.

@AlexWaygood
Copy link
Member

AlexWaygood commented Jan 17, 2023

We've had great progress on this recently:

  • Pyright and flake8-pyi have both removed their lints forbidding stubs to have defaults
  • @JelleZijlstra's written a codemod to automatically insert defaults into stubs
  • @JelleZijlstra recently contributed a fix to pytype to prevent a crash if a stub has a string-literal as a default value
  • @JelleZijlstra contributed an enhancement to stubtest so that it verifies defaults in stubs are correct.
  • stdlib: add defaults #9501 by @JelleZijlstra adds (some) defaults to the stdlib.

TODO, if anybody feels like taking these on:

@Avasam
Copy link
Collaborator

Avasam commented Jan 17, 2023

Do you wanna use this issue to track running the codemod on existing third-party stubs?

@JelleZijlstra
Copy link
Member

My tool has a couple of problems, opened issues in https://github.com/JelleZijlstra/stubdefaulter. Hopefully I'll have time to work on those soon.

@AlexWaygood
Copy link
Member

AlexWaygood commented Jan 20, 2023

#9501 added a couple of defaults that are set to sys.maxsize at runtime, e.g.

def count(self, sub: str | UserString, start: int = 0, end: int = 9223372036854775807) -> int: ...

The runtime source for this method is here: https://github.com/python/cpython/blob/3847a6c64b96bb2cb93be394a590d4df2c35e876/Lib/collections/__init__.py#L1439

The precise value of sys.maxsize will vary depending on what platform you're running Python on, so I think we should probably revert the changes where the default is set to sys.maxsize at runtime.

We could also consider adding a lint to flake8-pyi that forbids numeric defaults >6 digits (we can bikeshed the precise number of digits we want to permit) — thoughts?

@AlexWaygood
Copy link
Member

Alternatively we could add some special-casing to flake8-pyi around sys.maxsize specifically, so that this is allowed in a stub:

def count(self, sub: str | UserString, start: int = 0, end: int = sys.maxsize) -> int: ...

@JelleZijlstra
Copy link
Member

Sorry for missing that! For now let's just remove those defaults.

sys.maxsize defaults are fairly common and intuitive, so it would be nice to be able to use them in stubs as in your above message. It would be good though to get user feedback (@gramster?) on what kind of more complex defaults would be usable and useful for IDEs.

@AlexWaygood
Copy link
Member

In general I'm wary of allowing defaults that can't be easily verified by stubtest. I'd want most things that can't be easily verified by stubtest to be disallowed by flake8-pyi. But I'd be happy to special-case a small number of ast.Attribute nodes in flake8-pyi, and sys.maxsize seems like an obvious one to allow.

@AlexWaygood
Copy link
Member

The default value here is set to mmap.PAGESIZE at runtime, which is 4096 on my machine, but evidently a much larger value on Jelle's machine:

def __init__(self, size: int = 16384) -> None: ...

@AlexWaygood
Copy link
Member

AlexWaygood commented Jan 20, 2023

Here's a few funny ones I found using stubtest with mypy master 😄:

executable: str = "/Users/jelle/py/venvs/py311/bin/python", bits: str = "", linkage: str = ""

def getdocloc(self, object: object, basedir: str = "/Users/jelle/.pyenv/versions/3.11.1/lib/python3.11") -> str | None: ...

The first one is sys.executable, which seems like it would also make sense to be special-cased.

The second is dynamically determined using a function call at runtime: https://github.com/python/cpython/blob/3847a6c64b96bb2cb93be394a590d4df2c35e876/Lib/pydoc.py#L495

@Avasam
Copy link
Collaborator

Avasam commented Jan 23, 2023

I just found a nice side-effect of default parameter values:

def foo(bar=None):...
# Is now equivalent to
def foo(bar: Incomplete | None = None): ...

But the former keeps Unknown | None in pyright and might be preferable (when the default value is infact None obviously, I don't think it'd replace all cases).

AlexWaygood added a commit that referenced this issue Jan 29, 2023
Continuing work towards #8988.

The first five commits were created using stubdefaulter on various Python versions; the following commits were all created manually by me to fix various problems. The main things this adds that weren't present in #9501 are:

- Defaults in Windows-only modules and Windows-only branches (because I'm running a Windows machine)
- Defaults in non-py311 branches
- Defaults for float parameters
- Defaults for overloads
Avasam pushed a commit to Avasam/typeshed that referenced this issue Feb 1, 2023
Continuing work towards python#8988.

The first five commits were created using stubdefaulter on various Python versions; the following commits were all created manually by me to fix various problems. The main things this adds that weren't present in python#9501 are:

- Defaults in Windows-only modules and Windows-only branches (because I'm running a Windows machine)
- Defaults in non-py311 branches
- Defaults for float parameters
- Defaults for overloads
@Avasam
Copy link
Collaborator

Avasam commented May 2, 2023

Is there anything left to do or discuss here? Seems like we're been adding defaults to parameters for a while.

@JelleZijlstra
Copy link
Member

Alex's TODO list above says stubgen needs updating, but that's not something we can fix in this repo.

@Avasam
Copy link
Collaborator

Avasam commented May 2, 2023

Alex's TODO list above says stubgen needs updating, but that's not something we can fix in this repo.

True, but as far as typeshed's concerned, I think #10127 does the same until then 😃

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
project: policy Organization of the typeshed project
Projects
None yet
Development

No branches or pull requests

8 participants