TST/TYP: start testing public api types #40202

simonjayhawkins · 2021-03-03T16:34:30Z

There has been some discussion in the recent developer meetings regarding the testing of types.

This PR uses the a method of testing used (maybe developed?) by numpy.

IMO this is most useful for testing overloads, but could be extended to the complete public api.

jbrockmendel · 2021-03-03T19:27:28Z

ballpark, how big should we expect this form of testing to get? e.g. 3 files for every .py file in core/?

pep8speaks · 2021-03-04T08:49:12Z

Hello @simonjayhawkins! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

In the file pandas/tests/typing/data/fail/frame.py:

Line 8:89: E501 line too long (96 > 88 characters)

Comment last updated at 2021-03-04 12:28:38 UTC

simonjayhawkins · 2021-03-04T09:03:52Z

ballpark, how big should we expect this form of testing to get? e.g. 3 files for every .py file in core/?

from #40200

reveal_type(pd.DataFrame([1,2,3]).reset_index(level=0))
reveal_type(pd.DataFrame([1,2,3]).reset_index(inplace=False))
reveal_type(pd.DataFrame([1,2,3]).reset_index(drop=True, inplace=True))
reveal_type(pd.DataFrame([1,2,3]).reset_index(level=0, inplace=True))
reveal_type(pd.DataFrame([1,2,3]).reset_index(level=0, drop=False, inplace=True))
reveal_type(pd.DataFrame([1,2,3]).reset_index(level=0, inplace=True, col_fill='a'))
reveal_type(pd.DataFrame([1,2,3]).reset_index(level=0, col_fill='a'))
reveal_type(pd.DataFrame([1,2,3]).reset_index(inplace=True))

so I would expect that these would be added to the typing tests.

so the file will grow at the rate of PRs adding typing ( that we want to test) to public apis

I added a frame.py for DataFrame methods (whether defined in frame.py or generic.py), I would maybe expect a separate file (x3) for each public class and then maybe a file (x3) for top level functions (but may also make sense to split the top-level functions).

I don't think we need to mirror the source code structure for this as we are unlikely to test typing for internals. (although we could have an internals section if there are some methods that require complex overloading that would benefit from testing)

The structure (and scope) either needs more discussion if it is a concern, or be allowed to evolve over time.

MarcoGorelli · 2021-03-04T11:45:38Z

I hadn't appreciated this straight away, but there also needs to be a test for the bool case.

e.g.

# t.py

import pandas as pd

inplace: bool

reveal_type(pd.DataFrame([1,2,3]).reset_index(level=0))
reveal_type(pd.DataFrame([1,2,3]).reset_index(inplace=False))
reveal_type(pd.DataFrame([1,2,3]).reset_index(drop=True, inplace=True))
reveal_type(pd.DataFrame([1,2,3]).reset_index(level=0, inplace=True))
reveal_type(pd.DataFrame([1,2,3]).reset_index(level=0, drop=False, inplace=True))
reveal_type(pd.DataFrame([1,2,3]).reset_index(level=0, inplace=True, col_fill='a'))
reveal_type(pd.DataFrame([1,2,3]).reset_index(level=0, col_fill='a'))
reveal_type(pd.DataFrame([1,2,3]).reset_index(inplace=True))
reveal_type(pd.DataFrame([1,2,3]).reset_index(inplace=inplace))
reveal_type(pd.DataFrame([1,2,3]).reset_index(drop=True, inplace=inplace))
reveal_type(pd.DataFrame([1,2,3]).reset_index(level=0, inplace=inplace))
reveal_type(pd.DataFrame([1,2,3]).reset_index(level=0, drop=False, inplace=inplace))
reveal_type(pd.DataFrame([1,2,3]).reset_index(level=0, inplace=inplace, col_fill='a'))
reveal_type(pd.DataFrame([1,2,3]).reset_index(inplace=inplace))

on master this gives:

$ mypy t.py 
t.py:5: note: Revealed type is 'pandas.core.frame.DataFrame'
t.py:6: note: Revealed type is 'pandas.core.frame.DataFrame'
t.py:7: note: Revealed type is 'None'
t.py:8: note: Revealed type is 'None'
t.py:9: note: Revealed type is 'None'
t.py:10: note: Revealed type is 'None'
t.py:11: note: Revealed type is 'pandas.core.frame.DataFrame'
t.py:12: note: Revealed type is 'None'
t.py:13: error: No overload variant of "reset_index" of "DataFrame" matches argument type "bool"  [call-overload]
t.py:13: note: Possible overload variants:
t.py:13: note:     def reset_index(self, level: Union[Hashable, Sequence[Hashable], None] = ..., drop: bool = ..., inplace: Literal[False] = ..., col_level: Hashable = ..., col_fill: Hashable = ...) -> DataFrame
t.py:13: note:     def reset_index(self, level: Union[Hashable, Sequence[Hashable], None] = ..., drop: bool = ..., inplace: Literal[True] = ..., col_level: Hashable = ..., col_fill: Hashable = ...) -> None
t.py:13: note: Revealed type is 'Any'
t.py:14: error: No overload variant of "reset_index" of "DataFrame" matches argument types "bool", "bool"  [call-overload]
t.py:14: note: Possible overload variants:
t.py:14: note:     def reset_index(self, level: Union[Hashable, Sequence[Hashable], None] = ..., drop: bool = ..., inplace: Literal[False] = ..., col_level: Hashable = ..., col_fill: Hashable = ...) -> DataFrame
t.py:14: note:     def reset_index(self, level: Union[Hashable, Sequence[Hashable], None] = ..., drop: bool = ..., inplace: Literal[True] = ..., col_level: Hashable = ..., col_fill: Hashable = ...) -> None
t.py:14: note: Revealed type is 'Any'
t.py:15: error: No overload variant of "reset_index" of "DataFrame" matches argument types "int", "bool"  [call-overload]
t.py:15: note: Possible overload variants:
t.py:15: note:     def reset_index(self, level: Union[Hashable, Sequence[Hashable], None] = ..., drop: bool = ..., inplace: Literal[False] = ..., col_level: Hashable = ..., col_fill: Hashable = ...) -> DataFrame
t.py:15: note:     def reset_index(self, level: Union[Hashable, Sequence[Hashable], None] = ..., drop: bool = ..., inplace: Literal[True] = ..., col_level: Hashable = ..., col_fill: Hashable = ...) -> None
t.py:15: note: Revealed type is 'Any'
t.py:16: error: No overload variant of "reset_index" of "DataFrame" matches argument types "int", "bool", "bool"  [call-overload]
t.py:16: note: Possible overload variants:
t.py:16: note:     def reset_index(self, level: Union[Hashable, Sequence[Hashable], None] = ..., drop: bool = ..., inplace: Literal[False] = ..., col_level: Hashable = ..., col_fill: Hashable = ...) -> DataFrame
t.py:16: note:     def reset_index(self, level: Union[Hashable, Sequence[Hashable], None] = ..., drop: bool = ..., inplace: Literal[True] = ..., col_level: Hashable = ..., col_fill: Hashable = ...) -> None
t.py:16: note: Revealed type is 'Any'
t.py:17: error: No overload variant of "reset_index" of "DataFrame" matches argument types "int", "bool", "str"  [call-overload]
t.py:17: note: Possible overload variants:
t.py:17: note:     def reset_index(self, level: Union[Hashable, Sequence[Hashable], None] = ..., drop: bool = ..., inplace: Literal[False] = ..., col_level: Hashable = ..., col_fill: Hashable = ...) -> DataFrame
t.py:17: note:     def reset_index(self, level: Union[Hashable, Sequence[Hashable], None] = ..., drop: bool = ..., inplace: Literal[True] = ..., col_level: Hashable = ..., col_fill: Hashable = ...) -> None
t.py:17: note: Revealed type is 'Any'
t.py:18: error: No overload variant of "reset_index" of "DataFrame" matches argument type "bool"  [call-overload]
t.py:18: note: Possible overload variants:
t.py:18: note:     def reset_index(self, level: Union[Hashable, Sequence[Hashable], None] = ..., drop: bool = ..., inplace: Literal[False] = ..., col_level: Hashable = ..., col_fill: Hashable = ...) -> DataFrame
t.py:18: note:     def reset_index(self, level: Union[Hashable, Sequence[Hashable], None] = ..., drop: bool = ..., inplace: Literal[True] = ..., col_level: Hashable = ..., col_fill: Hashable = ...) -> None
t.py:18: note: Revealed type is 'Any'
Found 6 errors in 1 file (checked 1 source file)

while on the latest commit from #40200 it gives:

$ mypy t.py 
t.py:5: note: Revealed type is 'pandas.core.frame.DataFrame'
t.py:6: note: Revealed type is 'pandas.core.frame.DataFrame'
t.py:7: note: Revealed type is 'None'
t.py:8: note: Revealed type is 'None'
t.py:9: note: Revealed type is 'None'
t.py:10: note: Revealed type is 'None'
t.py:11: note: Revealed type is 'pandas.core.frame.DataFrame'
t.py:12: note: Revealed type is 'None'
t.py:13: note: Revealed type is 'Union[pandas.core.frame.DataFrame, None]'
t.py:14: note: Revealed type is 'Union[pandas.core.frame.DataFrame, None]'
t.py:15: note: Revealed type is 'Union[pandas.core.frame.DataFrame, None]'
t.py:16: note: Revealed type is 'Union[pandas.core.frame.DataFrame, None]'
t.py:17: note: Revealed type is 'Union[pandas.core.frame.DataFrame, None]'
t.py:18: note: Revealed type is 'Union[pandas.core.frame.DataFrame, None]'

MarcoGorelli · 2021-03-04T12:00:23Z

Anyway, thanks @simonjayhawkins for doing this - having spent several hours in the last few days making sense of these overloads, I've seen that it's really easy to fool oneself into thinking that the type hints are correct because mypy pandas won't throw errors if not all of these combinations are tested for

twoertwein · 2021-03-10T13:01:01Z

Sorry, this is probably not a place for a discussion. Wouldn't it be easier to automatically decorate all functions with a runtime type checker during testing, for example with typeguard? Assuming there is a runtime type checker that supports all typing features (I think typeguard does not support overloads). An alternative to typeguard could be to use mypyc to compile compatible python files. Mypyc inserts type checks into the compiled code. But I fear that mypyc does not yet support enough python language features to be useful for pandas.

simonjayhawkins · 2021-03-10T13:19:03Z

Sorry, this is probably not a place for a discussion.

sure is for now. this PR is a possible alternative to #39813 (comment)

Most likely, the best way to test if we have all the overloads correct is by fully typing our tests code, and adding # ignore comments when we are specifically testing for incorrect types.

other options are definitely worth discussing and maybe we should open a dedicated issue.

my preference for the numpy approach (if we do test types) is that not only are we closely aligned as projects with numpy, but also thier method is "tried and tested" unlike the approach suggested in #39813 where we have no idea where the pain points maybe.

we must also remember that there is also the "do nothing" option and not explicitly test the types. IIRC this was suggested in the January dev meeting (Mostly relying on user feedback to report issues.)

simonjayhawkins · 2021-03-13T16:29:01Z

@jreback @WillAyd @jbrockmendel @Dr-Irv @jorisvandenbossche Is this something we might want to do?

Dr-Irv · 2021-03-13T17:50:35Z

Is this something we might want to do?

I very much support pandas having some kind of tests that test our typing declarations.

BUT I think it will easier for us to do what I said before in #39813 (comment) (quoting again here):

Most likely, the best way to test if we have all the overloads correct is by fully typing our tests code, and adding # ignore comments when we are specifically testing for incorrect types.

My reason for going that direction is because users will encounter typing issues in their own code (and I have first hand experience with this using the type stubs that Microsoft's pylance/pyright have put out there), and I think it will be easier for the community to fix our source code that has typed methods for the public API (and add an appropriate test in pandas/tests) as opposed to using the reveal_type method proposed in this PR, which would require us to enumerate all the possibilities in two places - once in our regular testing code and once in the "testing types" code.

In addition, it's not clear to me how the current proposal helps us test the variety of arguments that can be passed to methods. We know that our existing testing code already tries different argument types, so if we typed our testing code (and ran mypy on it), then we're getting the benefit of one set of tests that tests functionality as well as typing signatures.

jreback · 2021-03-14T11:30:34Z

@simonjayhawkins is this ready? i am not sure anyone has looked yet - certainly can

simonjayhawkins · 2021-03-14T11:35:32Z

I opened this PR with the intention of supporting review of #40200 and #40197. #40200 is merged and I don't want to block #40197 pending discussion here.

so closing this.

to be clear, I am -1 on going down the route of type checking the tests at the moment.
It is not clear to me where the pitfalls maybe. Would be good to see this working successfully on other projects. for instance, does it distract from the activity of typing the core codebase? how much of slowdown would we see running mypy locally? How many type:ignores will be needed? How do we distiguish between ignores added where we expect a TypeError vs an ignore added to silence mypy for other reasons.

@simonjayhawkins is this ready? i am not sure anyone has looked yet - certainly can

I had already written the above. so will keep that narrative here. but leave the PR open for now.

simonjayhawkins · 2021-03-14T11:39:25Z

is this ready?

will give it another look shortly. so converted to draft for now.

simonjayhawkins · 2021-03-21T15:06:00Z

closing to clear queue.

simonjayhawkins · 2022-01-08T13:37:19Z

Sorry, this is probably not a place for a discussion.

sure is for now. this PR is a possible alternative to #39813 (comment)

Most likely, the best way to test if we have all the overloads correct is by fully typing our tests code, and adding # ignore comments when we are specifically testing for incorrect types.

other options are definitely worth discussing and maybe we should open a dedicated issue.

#45252

my preference for the numpy approach (if we do test types) is that not only are we closely aligned as projects with numpy, but also thier method is "tried and tested" unlike the approach suggested in #39813 where we have no idea where the pain points maybe.

This is no longer the case as the authors of https://github.com/VirtusLab/pandas-stubs/tree/master/tests/snippets can help assess this as an alternative.

we must also remember that there is also the "do nothing" option and not explicitly test the types. IIRC this was suggested in the January dev meeting (Mostly relying on user feedback to report issues.)

I'll link to this from #45252, so this option is also considered.

TST/TYP: start testing public api types

e946c61

simonjayhawkins added Testing pandas testing functions or related to the test suite Typing type annotations, mypy/pyright type checking labels Mar 3, 2021

simonjayhawkins mentioned this pull request Mar 3, 2021

TYP fixup overloads for reset_index #40200

Merged

1 task

Merge remote-tracking branch 'upstream/master' into test-typing

df0f0d2

linting

4689a9d

jreback modified the milestone: 1.3 Mar 5, 2021

This was referenced Mar 9, 2021

TYP: Update type hints for ExtensionArray and ExtensionDtype #39501

Closed

ENH: Using pyright to analyze missing type declarations #39813

Open

MarcoGorelli mentioned this pull request Mar 10, 2021

TYP test reset_index overloads #40349

Closed

simonjayhawkins marked this pull request as draft March 14, 2021 11:39

simonjayhawkins closed this Mar 21, 2021

simonjayhawkins mentioned this pull request Apr 2, 2021

TYP: groupby.pyi #40716

Merged

4 tasks

simonjayhawkins mentioned this pull request Jul 3, 2021

TYP: Fix typing in ExtensionDtype registry #41203

Merged

simonjayhawkins mentioned this pull request Jan 8, 2022

ENH: Add set of tests to type check the public APIs #45252

Open

simonjayhawkins deleted the test-typing branch January 8, 2022 14:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TST/TYP: start testing public api types #40202

TST/TYP: start testing public api types #40202

simonjayhawkins commented Mar 3, 2021

jbrockmendel commented Mar 3, 2021

pep8speaks commented Mar 4, 2021 •

edited

Loading

simonjayhawkins commented Mar 4, 2021

MarcoGorelli commented Mar 4, 2021

MarcoGorelli commented Mar 4, 2021

twoertwein commented Mar 10, 2021

simonjayhawkins commented Mar 10, 2021

simonjayhawkins commented Mar 13, 2021

Dr-Irv commented Mar 13, 2021

jreback commented Mar 14, 2021

simonjayhawkins commented Mar 14, 2021

simonjayhawkins commented Mar 14, 2021 •

edited

Loading

simonjayhawkins commented Mar 21, 2021

simonjayhawkins commented Jan 8, 2022

TST/TYP: start testing public api types #40202

TST/TYP: start testing public api types #40202

Conversation

simonjayhawkins commented Mar 3, 2021

jbrockmendel commented Mar 3, 2021

pep8speaks commented Mar 4, 2021 • edited Loading

Comment last updated at 2021-03-04 12:28:38 UTC

simonjayhawkins commented Mar 4, 2021

MarcoGorelli commented Mar 4, 2021

MarcoGorelli commented Mar 4, 2021

twoertwein commented Mar 10, 2021

simonjayhawkins commented Mar 10, 2021

simonjayhawkins commented Mar 13, 2021

Dr-Irv commented Mar 13, 2021

jreback commented Mar 14, 2021

simonjayhawkins commented Mar 14, 2021

simonjayhawkins commented Mar 14, 2021 • edited Loading

simonjayhawkins commented Mar 21, 2021

simonjayhawkins commented Jan 8, 2022

pep8speaks commented Mar 4, 2021 •

edited

Loading

simonjayhawkins commented Mar 14, 2021 •

edited

Loading