Performance monitoring; GitHub integration #1472

marshallward · 2021-08-18T17:36:13Z

This patch introduces two new testing targets to the verification suite
based on a small configuration based on the benchmark regression test.

The profile test is saved as p0 in .testing. Future tests can be
included if appropriate.

The new targets:

make profile: Run the model and record the FMS timings.
make perf: Run the model through the perf tool and record timings
for the resolvable functions (as symbols).

In both cases, the timings are converted to JSON output files and the
top results are reported to stdout, and readable in GitHub actions
output. It can also be run locally.

Support Python scripts have been included to do this work. This will
require a functional Python environment.

Some system and configuration data is logged alongside the timings, but
this is still rather incomplete and needs some further planning.

Times are compared to the target build (usually dev/gfdl). ANSI
terminal highlighting (i.e. color) is to used to highlight excessive
differences.

Current issues:

Model configuration
GitHub timings are still rather unreliable, and should currently only
be treated as crude estimates. This should be considered a work in
progress.
The GitHub profiling rule still builds the standard configuration,
evem though it is unused.
Additional tools are required to push the timings to some database,
either a local sqlite3 or an external one.

This patch introduces two new testing targets to the verification suite based on a small configuration based on the `benchmark` regression test. The profile test is saved a `p0` in `.testing`. Future tests can be included if appropriate. The new targets: * `make profile`: Run the model and record the FMS timings. * `make perf`: Run the model through the `perf` tool and record timings for the resolvable functions (as symbols). In both cases, the timings are converted to JSON output files and the top results are reported to stdout, and readable in GitHub actions output. It can also be run locally. Support Python scripts have been included to do this work. This will require a functional Python environment. Some system and configuration data is logged alongside the timings, but this is still rather incomplete and needs some further planning. Times are compared to the target build (usually dev/gfdl). ANSI terminal highlighting (i.e. color) is to used to highlight excessive differences. Current issues: - Model configuration - GitHub timings are still rather unreliable, and should currently only be treated as crude estimates. This should be considered a work in progress. - The GitHub profiling rule still builds the standard configuration, evem though it is unused. - Additional tools are required to push the timings to some database, either a local sqlite3 or an external one.

marshallward · 2021-08-18T17:38:41Z

Although this work does not yet resemble what we would like to see, I think it's is developed enough to at least be included in the repository.

Future considerations include the following:

Better tuning to report a "slow" or "fast" timing.
Better stability on GitHub Actions: (though this is probably not an option)
Integration with a database for automatic logging.

codecov · 2021-08-18T17:41:26Z

Codecov Report

Merging #1472 (e85f291) into dev/gfdl (e0e70e3) will not change coverage.
The diff coverage is n/a.

@@            Coverage Diff            @@
##           dev/gfdl    #1472   +/-   ##
=========================================
  Coverage     29.13%   29.13%           
=========================================
  Files           235      235           
  Lines         71061    71061           
=========================================
  Hits          20707    20707           
  Misses        50354    50354

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e0e70e3...e85f291. Read the comment docs.

Hallberg-NOAA

This looks reasonable enough, and it has passed the TC testing and the pipeline testing at https://gitlab.gfdl.noaa.gov/ogrp/MOM6/-/pipelines/13469.

marshallward added 2 commits June 18, 2021 12:33

Merge branch 'make_profile' into make_prof_merge

6044c8b

Merge branch 'dev/gfdl' into make_prof_merge

e85f291

Hallberg-NOAA approved these changes Aug 27, 2021

View reviewed changes

Hallberg-NOAA merged commit 79fcdfb into mom-ocean:dev/gfdl Aug 27, 2021

marshallward mentioned this pull request Oct 4, 2021

Dev gfdl main candidate 2021 10 04 #1507

Merged

marshallward deleted the make_prof_merge branch October 20, 2021 13:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance monitoring; GitHub integration #1472

Performance monitoring; GitHub integration #1472

marshallward commented Aug 18, 2021

marshallward commented Aug 18, 2021

codecov bot commented Aug 18, 2021 •

edited

Loading

Hallberg-NOAA left a comment

Performance monitoring; GitHub integration #1472

Performance monitoring; GitHub integration #1472

Conversation

marshallward commented Aug 18, 2021

marshallward commented Aug 18, 2021

codecov bot commented Aug 18, 2021 • edited Loading

Codecov Report

Hallberg-NOAA left a comment

Choose a reason for hiding this comment

codecov bot commented Aug 18, 2021 •

edited

Loading