Add hooks, fickling.load(), and a JSON output format for usability #79

suhacker1 · 2023-12-15T17:54:38Z

This PR makes multiple feature additions for usability. Specifically, this PR adds a fickling import hook, global function hook, fickling.load() function, and a JSON output format for the check_safety component of the CLI. Each of these features can make it easier to integrate fickling into different codebases and tools.

This PR also updates the examples and tests to reflect these new features. Additional important changes made include:

Syncing is_likely_safe in fickle.py with check_safety in analysis.py: A new check_safety method was added to fickle.py as a wrapper with is_likely_safe being marked for deprecation.
Refactoring analysis.py: Not only was ProtoAnalysis split for simplicity, but more structure was added throughout the different analysis classes to enable the reporting of detailed results.
Adding new methods to pytorch.py: The PyTorchModelWrapper class now reports the identified file formats from the validation method.

I would especially appreciate feedback on:

Whether the hook tailored for torch.load should be included
Whether the naming throughout is consistent and appropriate
How usable the interfaces now exposed by fickling are
How useful the current state of the JSON output is

Example JSON Output ("test_unused_variables.json"):

{
    "severity": "OVERTLY_MALICIOUS",
    "analysis": "Call to `eval(b'[5, 6, 7, 8]')` is almost certainly evidence of a malicious pickle file\nVariable `_var0` is assigned value `eval(b'[5, 6, 7, 8]')` but unused afterward; this is suspicious and indicative of a malicious pickle file",
    "detailed_results": {
        "AnalysisResult": {
            "OvertlyBadEval": "eval(b'[5, 6, 7, 8]')",
            "UnusedVariables": [
                "_var0",
                "eval(b'[5, 6, 7, 8]')"
            ]
        }
    }
}

Boyan-MILANOV

@suhacker1 good job on this! It needs some restructuring but the core idea is there :) I've left a bunch of comments, some are nits, but most of them are about code and architecture. Let's address all of them and then I'll make a second pass on the PR.

fickling/__init__.py

fickling/analysis.py

fickling/cli.py

fickling/loader.py

fickling/hook.py

example/context_manager.py

suhacker1 · 2023-12-19T03:40:39Z

Note: We decided not to include the PyTorch global hook in this PR. We also decided to remove the UNKNOWN severity type as we felt it was redundant.

example/context_manager.py

fickling/analysis.py

fickling/context.py

fickling/hook.py

fickling/fickle.py

fickling/loader.py

suhacker1 · 2023-12-19T17:49:49Z

@Boyan-MILANOV This is ready for another review!

Boyan-MILANOV

LGTM now! Good job! 🚀

suhacker1 added 14 commits December 13, 2023 10:33

Make PoC not dependent on the success of the other one

d7126a1

Initial prototype of import hook

44d2277

Encapsulate into function

21507a5

Switch order

b7b47d2

Refine import hook

5126558

Wrote a global function hook with an example

84900f0

Streamline hooks

ca6170e

Sync analysis and safety checks

8b2f690

Better error handling and tests for hook

8a6e4fd

Add context manager code

0870dba

Check torch.load with hook

6a2a5e1

Linting

e0196bb

Fix experiment files

be2488f

Attempt to create a hook for torch.load()

8e3d8d9

suhacker1 marked this pull request as ready for review December 15, 2023 18:32

suhacker1 requested a review from ESultanik as a code owner December 15, 2023 18:32

suhacker1 marked this pull request as draft December 15, 2023 18:32

suhacker1 added 5 commits December 15, 2023 14:14

Separate hook with torch from hook without

539749e

Better semantics for official loader

d4231a2

Add CLI output format for check_safety

f3dfe99

Bring back fickling.load()

f596d38

Lint

3ef6787

suhacker1 changed the title ~~Add hooks for usability~~ Add hooks, `fickling.load(), and a JSON output format for usability Dec 18, 2023

suhacker1 changed the title ~~Add hooks, `fickling.load(), and a JSON output format for usability~~ Add hooks, fickling.load(), and a JSON output format for usability Dec 18, 2023

suhacker1 added 2 commits December 17, 2023 21:44

Temporarily include draft torch hook

c43d135

Add more comments + Linting

3defa3a

Boyan-MILANOV requested changes Dec 18, 2023

View reviewed changes

suhacker1 added 2 commits December 18, 2023 12:31

Handle most comments

2e2b013

Handle stdout in check_safety

a9a0853

suhacker1 marked this pull request as ready for review December 18, 2023 20:03

suhacker1 added 4 commits December 18, 2023 23:01

Clean up code

4945b88

Remove UNKNOWN severity enum

f0c660c

Lint

dd17dff

Remove throwaway args

7557013

Boyan-MILANOV requested changes Dec 19, 2023

View reviewed changes

suhacker1 added 7 commits December 19, 2023 10:41

Expose fickling.check_safety()

1de8840

Fix default severity

baba3bd

Raise exception in loader

ef00d73

More changes

0cfd338

Refactor check_safety

2b2fd4a

Simplify hook

bb87af2

Lint and clean

36c1b96

Boyan-MILANOV approved these changes Dec 19, 2023

View reviewed changes

suhacker1 merged commit 08f98f2 into master Dec 19, 2023
12 checks passed

suhacker1 deleted the sh/usability branch January 4, 2024 03:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add hooks, fickling.load(), and a JSON output format for usability #79

Add hooks, fickling.load(), and a JSON output format for usability #79

suhacker1 commented Dec 15, 2023 •

edited

Loading

Boyan-MILANOV left a comment

suhacker1 commented Dec 19, 2023 •

edited

Loading

suhacker1 commented Dec 19, 2023

Boyan-MILANOV left a comment

Add hooks, fickling.load(), and a JSON output format for usability #79

Add hooks, fickling.load(), and a JSON output format for usability #79

Conversation

suhacker1 commented Dec 15, 2023 • edited Loading

Boyan-MILANOV left a comment

Choose a reason for hiding this comment

suhacker1 commented Dec 19, 2023 • edited Loading

suhacker1 commented Dec 19, 2023

Boyan-MILANOV left a comment

Choose a reason for hiding this comment

suhacker1 commented Dec 15, 2023 •

edited

Loading

suhacker1 commented Dec 19, 2023 •

edited

Loading