Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge 4-0-248 to backup_sync #176

Open
wants to merge 420 commits into
base: backup_sync
Choose a base branch
from
Open

Merge 4-0-248 to backup_sync #176

wants to merge 420 commits into from

Conversation

github-actions[bot]
Copy link

Created by Github action

xavierpinho and others added 30 commits September 24, 2024 16:21
There were some unused parser rules adding unnecessary complexity to the parser
Added `simpleCommandArgumentList` to allow multiple call args for `methodInvocationWithoutParentheses`
* fix assignmentModulo

* fix assignmentXor

* fix assignmentOr

* fix assignmentAnd

* fix assignmentExponentiation

* fix assignmentShiftLeft

* fix assignment{Logical,Arithmetic}ShiftRight

* scalafmt
…o#4955)

As soon as `FullNameSemantics` started logging duplicate semantics (in joernio#4954), it was observed that `SemanticTestCpg` was passing a fresh Semantics instance to `EngineContext` instead of passing the one used by `OssDataFlow`. By coincidence, however, they were the same. Nevertheless, this patch guarantees they are the same.
)

* [ruby] Fixed Argument Order on `ArgumentListContextHelper`
Use `line` and `column` information to order arguments correctly.

* Removed instances where sorting was done after the fact

* Fixed tests that validated incorrect behaviour
Using the CDT API (getContainingFilename) now.

For: joernio#4924
* [ruby] ANTLR Profiler Summary
* Introduced global profiling to summarize rule and parse performance across the project
* Added a shutdown hook to dump a summary of the profiler rules in a file `antlr_summary.log` at the root of the project with this information

* Check parent exists before dumping. May not be there during test cases

* Added note around shutdown hook
Sometimes AST trees would be thrown away, specifically when long chained expressions would be compressed into temporary cache variables. This only happens with expressions, and closures typically write directly to the diff graph.

This PR minimizes what is written directly to the diff graph from function creation, and extends `x2cpg.Ast` to also include `CAPTURE` edges. This also introduces `closureToRefs` map to track which `Block` nodes already have live method ASTs in the diff graph.
* [ruby] Changed handling of ForEach loops

* [ruby] fixed double _astIn reference to identifier

* [ruby] fix failing tests
…" (joernio#4985)

* Revert "Revert "[ruby] Ignore "Throwaway" AST Structures (joernio#4982)" (joernio#4983)"

This reverts commit 464480d.

* [ruby] Re-implemented "Ignore "Throwaway" AST Structures (joernio#4982)"
This correctly prevents re-use of nodes that are already being used elsewhere by ensuring deep copies.
 `RubyExpression` nodes don't inherently consider the `span` in the calculation of its hash, so when put into any hashed context, nodes that only rely on `span` alone will collide in these contexts.
… dir (joernio#4956)

The logic to guess the base dir of the installation is quite fiddly but
works for our use cases for astgen. PhpParser implemented something similar, but
not quite - and it failed for buildbot.

On buildbot the installation path for php2cpg is
`/worker/sptestV2-php2cpg/build/x2cpg-internal/php2cpg/target/universal/stage`
which (prior to this PR) leads to an invalid derived php-parser name
and the following error:
```
2024-09-25 09:30:08.623 ERROR Invalid path for PhpParserBin: /worker/sptestV2-/php2cpg/bin/php-parser/php-parser.php
```
…nse header (joernio#4995)

Turns out the HTTPServer expects an explicit "Connection: close" header which requests
the connection to be closed after the transaction ends. Otherwise, it would wait for a 10sec timeout
for the next thread to become available. In case we only allow for one thread (jimple2cpg) that would mean
additional waiting which renders the whole server approach useless. This in now fixes as we immediately close
the connection after the frontend is finished.
This happened for parameter and variable declarations in parentheses.
xavierpinho and others added 30 commits January 28, 2025 20:55
Just discovered the latest addition to joernio#5256 of the sarif config was `val` and would not be able to be redefined for tools extending the REPL.
* [sarif] Add Reporting Descriptors & More "Optionality"
* Added reporting descriptors which allow one to add more meta data to rules, and link findings to a given existing entry.
* Moved the sarif instantiation to the RunBeforeCode object which separates actions from tools deriving from Joern
* Using more "optional" properties where possible on properties which are not required by the sarif schema

* Moved parameter to back

* Test expectations
…o#5273)

GitHub code scanning has strict requirements on SARIF files that need to be enforced, such as minimum elements in an array, line number > 0, etc. Some validators emit warnings on this, but GH fails the pipeline.

Additionally, GitHub makes use of fingerprinting to avoid duplication between versions.

The SARIF conversion has been adapted to accommodate the above.
… file signature format (joernio#5274)

* First version of a JVM binary signature calculator for java parser classes.

* Add binary signatures to javasrc nodes

* Add generic signature for native foreach iterator tmp local

* Do some cleanup

* Add test for type parameter with multiple interface bounds

* Add generic tests description and add unspecified type to lambda type decls

---------

Co-authored-by: Markus Lottmann <[email protected]>
* [pysrc2cpg] Refactor convert methods for function lowering.

Use shared implementation.

* Separate method full names for redefined functions.

* Fmt
…151) (joernio#5286)

* handle errors gracefully

* use scala best practices

* trim dependency names and versions

* test for asserting if version is trimmed

Co-authored-by: Karan Batavia <[email protected]>
…ls (joernio#5290)

* [c#] initial support for replacing setter assignments with setter calls

* translateBinaryOperatorName -> binaryOperatorsMap
* Using Block Scope fix (#109)

* fix methodFullName inside using block

* code refactoring

* Try handling change

* added curly braces

* fix import

* review comments

---------

Co-authored-by: Ankit Kumar <[email protected]>
* fix - partial ast gen result in case of failure

---------

Co-authored-by: PallaviShreshtha <[email protected]>
In the case where an unexpected string in placed in the `filename` property, default to attempting URI construction within a `Try` and returning a `None` on failure
* [c#] fix synthetic set_* method call signature

* [c#] support setter assignments via += et al assignments
Lowering regex matches to reflect assignments made to global variables. The lowering becomes:
```ruby
 tmp = 'hello'.match(/h(el)lo/)
 if tmp
   $~ = tmp
   $& = tmp[0]
   tmp.begin(0)
 else 
   $~ = nil
   $& = nil
   nil
 end
```
Some heuristics to assign group matching global variables is next.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.