Create a structure-aware JavaScript fuzzer to find deep bugs #1902

addisoncrump · 2022-03-06T08:19:52Z

This PR adds two experimental fuzzers which generate valid JavaScript code from Arbitrary structs. These fuzzers (or variants thereof) were used to identify all of my previous PRs and issues. It does not generate identifiers which resolve to built-in types.

I will add documentation when possible, but I've been busy with work and wanted to offer this to y'all so I didn't have to back and forth every time a new PR was merged; possibly useful for OSS-Fuzz/CI in the future. It finds bugs very, very quickly.

If you want to test for yourself, I recommend using cargo fuzz run -s none interp_fuzzer -- -timeout=5.

codecov · 2022-03-06T08:58:31Z

Codecov Report

Merging #1902 (dd85b72) into main (6498216) will decrease coverage by 0.82%.
The diff coverage is 0.85%.

@@            Coverage Diff             @@
##             main    #1902      +/-   ##
==========================================
- Coverage   45.87%   45.05%   -0.83%     
==========================================
  Files         206      208       +2     
  Lines       17102    17445     +343     
==========================================
+ Hits         7846     7860      +14     
- Misses       9256     9585     +329

Impacted Files	Coverage Δ
boa_engine/src/context/mod.rs	`32.39% <0.00%> (-0.31%)`	⬇️
boa_engine/src/lib.rs	`79.31% <ø> (ø)`
boa_engine/src/syntax/ast/constant.rs	`42.85% <ø> (ø)`
boa_engine/src/syntax/ast/node/array/mod.rs	`28.57% <0.00%> (-4.77%)`	⬇️
boa_engine/src/syntax/ast/node/await_expr/mod.rs	`28.57% <0.00%> (-11.43%)`	⬇️
boa_engine/src/syntax/ast/node/block/mod.rs	`41.17% <0.00%> (-2.58%)`	⬇️
boa_engine/src/syntax/ast/node/call/mod.rs	`52.94% <0.00%> (-16.29%)`	⬇️
.../syntax/ast/node/conditional/conditional_op/mod.rs	`45.00% <0.00%> (-19.29%)`	⬇️
...ine/src/syntax/ast/node/conditional/if_node/mod.rs	`52.00% <0.00%> (-13.00%)`	⬇️
...ax/ast/node/declaration/arrow_function_decl/mod.rs	`25.92% <0.00%> (-5.90%)`	⬇️
... and 76 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 6498216...dd85b72. Read the comment docs.

Razican

Looks very good now, just some minor things. I would also still like to have a file in the /docs folder with information on how to execute the fuzzer, how it works and so on.

boa_inputgen/Cargo.toml

boa_inputgen/src/ident_walk.rs

evil.js

Razican · 2022-03-17T18:04:04Z

boa_inputgen/src/ident_walk.rs

+fn extendo<T>(node: &mut T) -> &'static mut T {
+    unsafe { &mut *(node as *mut T) }
+}


Maybe it makes sense to make the function unsafe, and then use the unsafe block where it's used, to make sure that each call is safe (so, invariants are checked on each call)

Okay, I've revamped how extendo (now extend_lifetime -- the original name was more of a personal joke) handles lifetime extension so that it restricts the lifetime to a consistent, non-static lifetime that's decoupled from the original.

This is an unbounded lifetime. It's safe in this context, since we don't return the relevant Vec and all the references point to AST members.

jedel1043

Still missing a review for the fuzz crate, I'll submit my current suggestions to give you time to fix or answer :)

boa_inputgen/src/data.rs

jedel1043 · 2022-03-17T20:27:08Z

boa_inputgen/src/ident_walk.rs

+    }
+}
+
+fn replace_declpattern<'a>(


I'm curious, wouldn't a recursive function remove all instances of extend_lifetime? I'm asking in case you did try it and the borrow checker still complained

It was significantly slower. I originally implemented this as a recursive function which just entirely rebuilt the AST, then a recursive function which mutably descended the AST, and both were significantly slower -- to the point that it affected execution speed of the fuzzer. This walk is much faster.

Huh, that's weird, since rustc uses a recursive visitor. Approximately what was the performance difference between the two implementations?

A few ms per fuzzer test case, but enough to slow me down from about 500 inputs/s to 450 or less. I didn't measure it particularly thoroughly, swapped it over to test the performance difference and had a speed-up so I kept it. 😅

I'm gonna implement that visitor and it'll probably be quicker regardless. This is more slapped together, as you can tell from the use of extend_lifetime.

jedel1043 · 2022-03-17T20:52:31Z

boa_inputgen/src/ident_walk.rs

@@ -0,0 +1,576 @@
+//! Identifier and symbol walker for ensuring that generated inputs do not fail with "string


The function signatures on this file are extremely similar. I would recommend some alternatives:

Create a ReplaceSym trait with a replace method, then implement that only to the statements that transitively contain a Sym.

Put all the functions inside replace_inner so that you can easily see which procedure corresponds to which statement.

Create a fold and a map function for our AST. Personally, I would try to implement this first and fallback to a trait otherwise.

Whoof, implementing a type visitor would certainly be very helpful but I'm not sure if that should also be in this commit. ReplaceSym trait seems like the better option.

Whoof, implementing a type visitor would certainly be very helpful but I'm not sure if that should also be in this commit. ReplaceSym trait seems like the better option.

Yeah, pretty much. You can however open another PR with the change 😉
Jokes aside, we really appreciate code cleanups in our codebase, so if you have any ideas on how to improve our internal APIs, please open up an issue or a PR, we could guide you through our codebase if you need to 😊

jedel1043 · 2022-03-17T21:32:43Z

boa_interner/src/lib.rs

+    #[cfg(not(feature = "fuzzer"))]
    const fn as_raw(self) -> NonZeroUsize {
        self.value
    }
+
+    #[cfg(feature = "fuzzer")]
+    pub const fn as_raw(self) -> NonZeroUsize {
+        self.value
+    }


If users just need to use a feature to call a private function, I'd just expose it as public in the first place. It doesn't really matter in this case since getting a raw NonZeroUsize is not useful to interact with the interner.

This change comes from a place of "change as little as possible of the underlying implementation". :) I'll just replace it.

jedel1043 · 2022-03-17T21:42:57Z

I'm noticing you're merging instead of rebasing when conflicts occur. That's not ideal, since a deletion diff would reappear with a merge, and that has happened to this PR several times already. Unfortunately you cannot switch to rebasing in this PR, or the commit tree will implode and it will be a pain to fix (I speak from personal experience 😅), but I would advise you to rebase instead of merging on your next contributions 😁

addisoncrump · 2022-03-17T21:43:57Z

I'm noticing you're merging instead of rebasing when conflicts occur. That's not ideal, since a deletion diff would reappear with a merge, and that has happened to this PR several times already. Unfortunately you cannot switch to rebasing in this PR, or the commit tree would implode and it would be a pain to fix (I speak from personal experience sweat_smile), but I would advise you to rebase instead of merging on your next contributions grin

Good point. My git-fu needs training...

Co-authored-by: jedel1043 <[email protected]>

addisoncrump · 2022-03-18T01:56:25Z

Switching over to AST-walking based Sym replacement in a separate PR.

addisoncrump added 4 commits March 6, 2022 02:17

init fuzzer

95c8623

appease the mighty rustfmt

bba1252

fix clippy errors by simply allowing it!

6134df0

fix clippy + build by using spin and iter_mut

1550ce4

addisoncrump added 9 commits March 6, 2022 03:29

simplify fuzzer input generation

b7c7caa

fix compile errors when not in feature fuzzer

0e5ad66

add missing cfg

c6ed17b

better clippy lint control

377f55f

standardise arbitrary impl for Name

0d500b9

final touches

e56192c

fix issue caused by a previous bad merge

00cbbb1

update deps for inputgen

4b17781

Merge branch 'main' of github.com:boa-dev/boa into experimental-fuzzer

53ca1a6

jedel1043 requested review from raskad, Razican and HalidOdat March 7, 2022 19:06

jedel1043 added enhancement New feature or request test Issues and PRs related to the tests. labels Mar 7, 2022

jedel1043 modified the milestones: v0.14.0, v0.15.0 Mar 7, 2022

addisoncrump added 3 commits March 7, 2022 21:01

Merge branch 'main' of github.com:boa-dev/boa into experimental-fuzzer

186be92

fix various updates w.r.t. assigntarget

ea9540b

fix some of the tree walking (string disappeared fixes)

621838f

Razican requested review from jasonwilliams, jedel1043 and RageKnify March 12, 2022 10:45

addisoncrump added 2 commits March 12, 2022 13:47

Merge branch 'main' of github.com:boa-dev/boa into experimental-fuzzer

ca9d0ab

explicitly handle this and empty in case new cases are introduced later

30dc27c

addisoncrump added 8 commits March 17, 2022 09:24

mergeup

4f82142

update cargo information to be consistent with others

d38fef5

split inputgen

da91d86

explain max_insns

6abaa1b

docs

750d166

whoops, remove testing file

c1bd554

whoops, missed a docs addition

990740b

fix clippy warnings + errors

ed7eae6

Razican reviewed Mar 17, 2022

View reviewed changes

boa_inputgen/Cargo.toml Outdated Show resolved Hide resolved

boa_inputgen/src/ident_walk.rs Outdated Show resolved Hide resolved

boa_inputgen/src/ident_walk.rs Show resolved Hide resolved

evil.js Outdated Show resolved Hide resolved

addisoncrump added 3 commits March 17, 2022 12:54

Merge branch 'main' of github.com:boa-dev/boa into experimental-fuzzer

6b52c4d

remove unnecessary excludes

e053e29

fix clippy, again

61855a5

Razican reviewed Mar 17, 2022

View reviewed changes

addisoncrump added 8 commits March 17, 2022 13:20

Make extend_lifetime unsafe + give it explicit lifetimes

3ece1e4

whoops, explicit lifetime on replace_inner

47d484a

fix clippy lints

d4bfe84

remove irrelevant comment

7826bac

Add fuzzing docs

ba98b69

typo + link for de-arbitrary

0910729

update command

d18ed44

formatter time

a615901

jedel1043 requested changes Mar 17, 2022

View reviewed changes

clarify character generation in Name

dd85b72

Co-authored-by: jedel1043 <[email protected]>

addisoncrump closed this Mar 18, 2022

addisoncrump mentioned this pull request Mar 18, 2022

Create a structure-aware JavaScript fuzzer addisoncrump/boa#1

Closed

Razican removed this from the v0.15.0 milestone Jun 1, 2022

addisoncrump mentioned this pull request Jul 6, 2022

Create a structure-aware fuzzer (second try) #2169

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create a structure-aware JavaScript fuzzer to find deep bugs #1902

Create a structure-aware JavaScript fuzzer to find deep bugs #1902

addisoncrump commented Mar 6, 2022 •

edited

Loading

codecov bot commented Mar 6, 2022 •

edited

Loading

Razican left a comment

Razican Mar 17, 2022

addisoncrump Mar 17, 2022

addisoncrump Mar 17, 2022 •

edited

Loading

jedel1043 left a comment

jedel1043 Mar 17, 2022

addisoncrump Mar 17, 2022

jedel1043 Mar 17, 2022

addisoncrump Mar 17, 2022

jedel1043 Mar 17, 2022

addisoncrump Mar 17, 2022

jedel1043 Mar 17, 2022

jedel1043 Mar 17, 2022 •

edited

Loading

addisoncrump Mar 17, 2022

jedel1043 commented Mar 17, 2022 •

edited

Loading

addisoncrump commented Mar 17, 2022

addisoncrump commented Mar 18, 2022

		@@ -0,0 +1,576 @@
		//! Identifier and symbol walker for ensuring that generated inputs do not fail with "string

Create a structure-aware JavaScript fuzzer to find deep bugs #1902

Create a structure-aware JavaScript fuzzer to find deep bugs #1902

Conversation

addisoncrump commented Mar 6, 2022 • edited Loading

codecov bot commented Mar 6, 2022 • edited Loading

Codecov Report

Razican left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

addisoncrump Mar 17, 2022 • edited Loading

Choose a reason for hiding this comment

jedel1043 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jedel1043 Mar 17, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jedel1043 commented Mar 17, 2022 • edited Loading

addisoncrump commented Mar 17, 2022

addisoncrump commented Mar 18, 2022

addisoncrump commented Mar 6, 2022 •

edited

Loading

codecov bot commented Mar 6, 2022 •

edited

Loading

addisoncrump Mar 17, 2022 •

edited

Loading

jedel1043 Mar 17, 2022 •

edited

Loading

jedel1043 commented Mar 17, 2022 •

edited

Loading