Type annotations for RDA and other analysis related code #3023

fmagin · 2021-12-17T10:47:09Z

Lots of type annotations and fixups for various analysis related concepts and specifically the RDA.

Some of the annotations like 5a3b2cd might seem nearly useless, but they allow the IDE to infer that some access to this plugin is definitely of the type SimStateJNIReferences and methods/attributes that are the same as another class don't show up when searching for references to that member for the other class. All the annotations of networkx.DiGraph usage also serve this purpose, because often angr classes will have some member called successors which has a totally different meaning than DiGraph.successors.
Without annotations PyCharm can't realize that graph.successors is not a case where SimProcedure.successors is used, because graph might technically be of the type SimProcedure.

I am not sure how large the performance impact is to use exception based handling for the function handler. Instead of using hasattr. The massive issue with the hasattr/getattr approach is that it completely destroys the IDE analysis, and now it is at least possible to find every usage of FunctionHandler.handle_external_function and reason about the problem, instead of having to search the entire codebase for the string "handle_external_function"

rhelmot · 2021-12-19T13:16:44Z

So obvi we need to get CI passing, both lint and the testcase that seems to be failing because of the changes (I will not pretend to understand what's going on there), but beyond that, can you provide some sort of metric to indicate that the types you added are correct? Just any static analysis result that indicates the typing coverage has gone up or the number of type errors has gone down?

fmagin · 2021-12-20T07:55:51Z

My current solution for the FunctionHandler caused some testcases to fail still, but the issue that is left is fairly simple, I just didn't account for hasattr(obj, "attrname") also handling the case of obj being None, and forgot that I needed to add a check for that or handle that entirely different. I am going for entirely different. So the remaining testcases and lint issues aren't a problem.

A metric for the improvement is an interesting question, my main reason for writing the annotations in the first place was that it made it easier to work with the code in PyCharm, i.e. better type inference (and especially reducing false positives for some types) and fixing some lint issues PyCharm complains about by default. I'll look into setting up a mypy config that primarily detects wrong type annotations, so we can at least know if some type annotation leads to inconsistencies.

… field usages elsewhere)

fmagin · 2021-12-20T19:45:13Z

@rhelmot now the only linting issue (and CI failure at all) left is:

LINT FAILURE: angr/analyses/reaching_definitions/reaching_definitions.py regressed to 9.95/10.00
... angr/analyses/reaching_definitions/reaching_definitions.py:106:8: W0233: init method from a non direct base class 'ForwardAnalysis' is called (non-parent-init-called)

which seems llike a bug in PyLint pylint-dev/pylint#3505
Though supposedly that should be addressed already, so either the version in the CI is outdated or we encountered some other issue. My guess is that it is the linked bug, and the fix didn't make it into the CI yet.

rhelmot · 2021-12-21T19:09:12Z

Alright - I'll take you at your word that this is an improvement. You should collaborate with @twizmwazin (kevin phoenix on slack) on getting the mypy pass set up in CI!

ulugbekna · 2021-12-21T21:47:36Z

Thanks a lot for the PR! I love seeing to add type annotations improving both ide experience and making closer typechecking with mypy

As side notes: Am I the only one who’s slightly concerned with

The PR diverging from the common style of using single quotes by using double quotes?
Also it seemed slightly inconsistent in using commented type annotations vs ordinary type annotations, ie f(x: ‘int’) vs f(x: int)

fmagin · 2021-12-21T21:54:58Z

@ulugbekna you raise some valid points:

The PR diverging from the common style of using single quotes by using double quotes?

kinda valid, I actually don't make a conscious decision for that and just assumed that the linter would complain if it is something that was deemed relevant...

Also it seemed slightly inconsistent in using commented type annotations vs ordinary type annotations, ie f(x: ‘int’) vs f(x: int)
the uncommented version isn't always possible due to possible import cycles. I am not sure if the recommendation would be to always use the quoted version, so far I basically use unquoted if possible (i.e. the types are already available at runtime) or quoted if this isn't easily possible for one of the various reasons.

Overall, how concerned are you? I am not sure if those are slight stylistic things, or if there are more severe implications I am missing. Probably not for single vs double quotes, but maybe for the quoted vs unquoted types.

ulugbekna · 2021-12-21T22:02:43Z

I’m afraid I’m not expert either. Development can happen incrementally so this PR is definitely a good step forward :-)

…

On Tue, 21 Dec 2021 at 22:55, Florian Magin ***@***.***> wrote: @ulugbekna <https://github.com/ulugbekna> you raise some valid points: The PR diverging from the common style of using single quotes by using double quotes? kinda valid, I actually don't make a conscious decision for that and just assumed that the linter would complain if it is something that was deemed relevant... Also it seemed slightly inconsistent in using commented type annotations vs ordinary type annotations, ie f(x: ‘int’) vs f(x: int) the uncommented version isn't always possible due to possible import cycles. I am not sure if the recommendation would be to always use the quoted version, so far I basically use unquoted if possible (i.e. the types are already available at runtime) or quoted if this isn't easily possible for one of the various reasons. Overall, how concerned are you? I am not sure if those are slight stylistic things, or if there are more severe implications I am missing. Probably not for single vs double quotes, but maybe for the quoted vs unquoted types. — Reply to this email directly, view it on GitHub <#3023 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AD4YR6YEAL5KPNASJGEFSE3USDZTZANCNFSM5KIRCIEA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

fmagin changed the title ~~Fmagin/typing~~ Type annotations for RDA and other analysis related code Dec 17, 2021

fmagin force-pushed the fmagin/typing branch from 3544a3f to 9dc8004 Compare December 17, 2021 11:52

fmagin added 4 commits December 20, 2021 14:20

Full type annotation for ForwardAnalysis

087d2bb

Annotate various networkx.DiGraphs

7f23d2d

Various RDA and FunctionHandler annotations

bc1a873

misc annotations

25f065a

fmagin force-pushed the fmagin/typing branch 3 times, most recently from f65c81c to e17bae4 Compare December 20, 2021 17:28

fmagin added 5 commits December 20, 2021 18:40

Refactor FunctionHandler usage to work well with IDEs

b4ad3c8

Fix linting complaints in light/engine.py

ea9e979

Annotate the JNIReferences plugin (fixes a lot of false positives for…

e272883

… field usages elsewhere)

Misc annotatations

b6f0664

Linting fixes

b79d2b4

fmagin force-pushed the fmagin/typing branch from e17bae4 to b79d2b4 Compare December 20, 2021 17:43

rhelmot merged commit 7ebf017 into angr:master Dec 21, 2021

fmagin mentioned this pull request Dec 28, 2021

More type annotations to clean up mypy issues #3038

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Type annotations for RDA and other analysis related code #3023

Type annotations for RDA and other analysis related code #3023

fmagin commented Dec 17, 2021 •

edited

Loading

rhelmot commented Dec 19, 2021

fmagin commented Dec 20, 2021 •

edited

Loading

fmagin commented Dec 20, 2021

rhelmot commented Dec 21, 2021

ulugbekna commented Dec 21, 2021

fmagin commented Dec 21, 2021 •

edited

Loading

ulugbekna commented Dec 21, 2021 via email

Type annotations for RDA and other analysis related code #3023

Type annotations for RDA and other analysis related code #3023

Conversation

fmagin commented Dec 17, 2021 • edited Loading

rhelmot commented Dec 19, 2021

fmagin commented Dec 20, 2021 • edited Loading

fmagin commented Dec 20, 2021

rhelmot commented Dec 21, 2021

ulugbekna commented Dec 21, 2021

fmagin commented Dec 21, 2021 • edited Loading

ulugbekna commented Dec 21, 2021 via email

fmagin commented Dec 17, 2021 •

edited

Loading

fmagin commented Dec 20, 2021 •

edited

Loading

fmagin commented Dec 21, 2021 •

edited

Loading