every vuln chain #81

KevinHock · 2018-02-28T03:01:55Z

Motivation

What this PR does is change the way PyT finds vulnerabilities, as mentioned in the docs and the thesis, PyT originally used reaching definitions -- slightly modified to keep reassignments going (i.e. keeping the definition reaching) and, ran this slightly modified reaching definitions on every 'secondary' node in addition to the source.

The thing I wanted to do, was handle 'blackbox' function calls well, does url_for with a tainted argument yield a tainted value? What about os.path.join? Or some_random_web_framework_function? Having an answer would have helped us in our last evaluation.

There is a problem with only knowing that a definition reaches a sink, and not how. You can know arg = request.args.get('foo') reaches subprocess.check_output, but if there are multiple paths from source to sink, there may be one path that's vulnerable and one that's not. i.e. If there is a blackbox node that doesn't propagate taint or a sanitizer along the path we don't want to say it "might" not be vulnerable just because we don't know the path. You can see an example in the multi_chain.py test with corresponding mapping. I use 'path' and 'chain' interchangeably.

It's also confusing IMHO to just see all of the secondary nodes in the output as opposed to the ones that lead to the vulnerability, I tried to remedy this by implementing the trim option, which stepped backwards along one path, but that might not be the vulnerable path. Though I still left the default as secondary the nodes if you don't have the trim or interactive flag on.

This also gives us the opportunity to ask the user about if a blackbox function we haven't encountered propagates taint, (if they set the interactive command line option) and then save the mapping to a config, since we can't know all the web frameworks functions in the world.

How It Works

The way this works is by, after knowing that either a source or a secondary node reaches a sink (i.e. doing what we've always done), we construct a definition-use mapping, and traversing through the uses of the source, then the defs of those uses, and so on and so forth until the use is the sink. This was a great opportunity to use yield from and recursion, once we have a 'chain' we pass it to how_vulnerable which checks the chain against the blackbox mapping and sanitizers, if it comes back as FALSE we loop through the other chains and call how_vulnerable on until we find a vulnerable one.

We now use a vuln_factory and pass a dictionary, vuln_deets, as args.

Additional details

We can tell it's definitely sanitized when the sanitizer is an assignment node. Right now we can only say potentially if the sanitizer is an if statement, this is because we're currently not smart enough to tell if e.g. the condition leads to a return. I am okay with this.

Future Work

Being able to tell if the tainted argument is at the beginning or end of the string that ends up in the sink.

def client_passport():
    code = request.args.get('code')
    uri = 'http://localhost:5000/oauth?response_type=%s&client_id=%s&redirect_uri=%s' %(code,client_id,redirect_uri)
    return redirect(uri)

was a false-positive in our last evaluation due to this, this is also helpful for SSRF and other vulnerabilities.

Currently in this PR we stop and return a vulnerability if it's not FALSE, I'll change this in the future to be better and keep on going if the first thing we find is sanitized or unknown (and the e.g. 2nd thing we find is a TRUE vulnerability.)
JSON output
Maybe using textwrap for the vulnerability descriptions, only if it looks better.
I also might implement a command line option to list all of the vulnerable paths, though this is low-priority as it won't help us in our future evaluations.

…namedtuples

…ave somewhere to put the blackbox mapping, DRYd the args.startdate

…igure out UI

…ctory + alphabetize enums, make all end line comments have 2 spaces not 1, add AssignmentCallNode docstrings and DRYness, cleanup dead SinkArgsError code, rename vulnerability_log to vulnerability_helper, refactor all [0:..] to [:..]

…Visiting during an AssignmentCallNode, Rinsed build_def_use_chain, Wiped get_uses

…en we can default arg, Wipe old is_sanitised tests

…def_args_in_temp, make all tests pass, change double quotes to single quotes, cleanup stmt_star_handler/connect_if_allowed/comments/docstrings

…lVisitor stuff to node_types, comments

KevinHock · 2018-03-23T04:45:10Z

LGTM

KevinHock · 2018-03-27T02:03:42Z

👍

[Style] Cleanup def chains and save a little

6fed643

KevinHock mentioned this pull request Feb 28, 2018

pypi packaging #3

Closed

KevinHock added 3 commits March 1, 2018 18:53

[blackbox stuff] wrote get_vulnerability_chains, cleaned up def-use code

7df6529

Merge branch 'master' into every_vuln_chain

71ad9a1

[cleanup] remove if after merging in master that had the RHS vars fix

e075e93

KevinHock added the cool label Mar 2, 2018

nathanbraddock approved these changes Mar 2, 2018

View reviewed changes

KevinHock and others added 5 commits March 2, 2018 19:18

[blackbox stuff]Add interactive UI mode option, cleanup __main__ a bit

12ab2e1

[nothing significant] changing gears to tox branch

57878c9

[gotta go to work] made blackbox mapping file option, refactored all …

2dcd579

…namedtuples

[cleanup] Moved trigger_definitions to vulnerability_definitions to h…

7aa580b

…ave somewhere to put the blackbox mapping, DRYd the args.startdate

[blackbox stuff] start to check and save the mapping, still need to f…

e510739

…igure out UI

KevinHock mentioned this pull request Mar 9, 2018

[cleanup] vulnerabilities.py #92

Merged

KevinHock added 9 commits March 8, 2018 19:00

Merge branch 'master' into every_vuln_chain

1c80a0f

Disinfected all but 3 tests, Scrubbed ugly special case code for Vars…

e5d0b2c

…Visiting during an AssignmentCallNode, Rinsed build_def_use_chain, Wiped get_uses

Mop up reasoning for .args comment, Scour passing in an empty list wh…

a5be8be

…en we can default arg, Wipe old is_sanitised tests

Get path_traversal_sanitised_2 to work, remove a bad connect in save_…

84a25f2

…def_args_in_temp, make all tests pass, change double quotes to single quotes, cleanup stmt_star_handler/connect_if_allowed/comments/docstrings

Merge branch 'master' into every_vuln_chain

7643fbc

Made node types for If and Try statements statements, moved some Labe…

cfbefef

…lVisitor stuff to node_types, comments

Delete intra_cfg, same as in master

0dead7c

[DRY] Remove all 'line_number=node.lineno' for node def sites

1d11a05

KevinHock added 5 commits March 23, 2018 16:55

[DRY] Remove more 'line_number=node.lineno' from node def sites

6bb2a68

Added a comment to ConnectToExitNode

3df4c28

Add period at the end of docstring comment

8ee2e8a

[refactor] namedtuples changed field lists to tuples

d70133d

re-Add period at the end of docstring comment

4447c75

KevinHock merged commit 558d753 into master Mar 27, 2018

KevinHock mentioned this pull request Mar 29, 2018

[WIP] Fix false-negatives and false-positives #99

Closed

KevinHock deleted the every_vuln_chain branch March 29, 2018 03:35

This was referenced Apr 14, 2018

Feature Request: Whitelist lines ending in # nosec #108

Closed

Pagination vulnerabilities #11

Closed

Fix Ifatty line_number arg RaiseNode bug #117

Merged

KevinHock mentioned this pull request Jun 11, 2018

Add a "don't ask me anymore" option to the interactive mode #128

Closed

KevinHock mentioned this pull request Jul 24, 2018

Pathological code causes RecursionError #149

Open

KevinHock mentioned this pull request Nov 22, 2018

(Not an issue right now) Handle multiple returns #53

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

every vuln chain #81

every vuln chain #81

KevinHock commented Feb 28, 2018 •

edited

Loading

KevinHock commented Mar 23, 2018

KevinHock commented Mar 27, 2018

every vuln chain #81

every vuln chain #81

Conversation

KevinHock commented Feb 28, 2018 • edited Loading

Motivation

How It Works

Additional details

Future Work

KevinHock commented Mar 23, 2018

KevinHock commented Mar 27, 2018

KevinHock commented Feb 28, 2018 •

edited

Loading