Adds system benchmarks command #1232

marc-gr · 2023-04-24T08:30:54Z

This adds the ability to define system (end to end) benchmarks at the package level.

Refactor benchrunner to decouple reports presentation from pipeline benchmarks internals
Add benchmark system command
Collect indices, disk and ingest metrics from all nodes
Implement docker input services
Ability to generate data on the fly using the corpus generator
Report to STDOUT a summary of the benchmark metrics
Report to a file
Add documentation
Add test

Depends on elastic/package-spec#512
Closes #1164

TODO in the future(in no specific order)

Collect metrics from elastic agents with system package
Seed generator for consistent data
Send benchmark runtime metrics to the ES Metricstore
Add command to compare two benchmark runs
Support more input service types (like system tests)
Support benchmark scenarios defined outside of the package scope (for easily running scheduled benchmarks)
Create benchmarks dashboard (maybe through a benchmark integration)

Example of usage:

With a benchmark scenario defined as

<package root>/_dev/benchmark/system/100mb-logs-benchmark.yml

---
description: Benchmark 100MiB of data ingested
input: filestream
vars: ~
data_stream.name: test
data_stream.vars.paths:
  - "{{SERVICE_LOGS_DIR}}/corpus-*"
warmup_time_period: 10s
corpora.generator.size: 100MiB
corpora.generator.template.path: ./100mb-logs-benchmark/template.log
corpora.generator.config.path: ./100mb-logs-benchmark/config.yml
corpora.generator.fields.path: ./100mb-logs-benchmark/fields.yml

We then run:

elastic-package benchmark system --benchmark 100mb-logs-benchmark -v

To generate a benchmark report:

--- Benchmark results for package: system_benchmarks - START ---
╭─────────────────────────────────────────────────────╮
│ info                                                │
├──────────────┬──────────────────────────────────────┤
│ benchmark    │                 100mb-logs-benchmark │
│ description  │    Benchmark 100MiB of data ingested │
│ run ID       │ d2960c04-0028-42c9-bafc-35e599563cb1 │
│ package      │                    system_benchmarks │
│ start ts (s) │                           1682320355 │
│ end ts (s)   │                           1682320355 │
│ duration     │                                 2m3s │
╰──────────────┴──────────────────────────────────────╯
╭───────────────────────────────────────────────────────────────────────╮
│ parameters                                                            │
├─────────────────────────────────┬─────────────────────────────────────┤
│ package version                 │                         999.999.999 │
│ input                           │                          filestream │
│ data_stream.name                │                                test │
│ data_stream.vars.paths          │        [/tmp/service_logs/corpus-*] │
│ warmup time period              │                                 10s │
│ benchmark time period           │                                  0s │
│ wait for data timeout           │                                  0s │
│ corpora.generator.size          │                              100MiB │
│ corpora.generator.template.path │ ./100mb-logs-benchmark/template.log │
│ corpora.generator.template.raw  │                                     │
│ corpora.generator.template.type │                                     │
│ corpora.generator.config.path   │   ./100mb-logs-benchmark/config.yml │
│ corpora.generator.config.raw    │                               map[] │
│ corpora.generator.fields.path   │   ./100mb-logs-benchmark/fields.yml │
│ corpora.generator.fields.raw    │                               map[] │
╰─────────────────────────────────┴─────────────────────────────────────╯
╭───────────────────────╮
│ cluster info          │
├───────┬───────────────┤
│ name  │ elasticsearch │
│ nodes │             1 │
╰───────┴───────────────╯
╭─────────────────────────────────────────────────────────────╮
│ data stream stats                                           │
├────────────────────────────┬────────────────────────────────┤
│ data stream                │ logs-system_benchmarks.test-ep │
│ approx total docs ingested │                         410127 │
│ backing indices            │                              1 │
│ store size bytes           │                      136310570 │
│ maximum ts (ms)            │                  1682320467448 │
╰────────────────────────────┴────────────────────────────────╯
╭───────────────────────────────────────╮
│ disk usage for index .ds-logs-system_ │
│ benchmarks.test-ep-2023.04.22-000001  │
│ (for all fields)                      │
├──────────────────────────────┬────────┤
│ total                        │ 99.8mb │
│ inverted_index.total         │ 31.3mb │
│ inverted_index.stored_fields │ 35.5mb │
│ inverted_index.doc_values    │   30mb │
│ inverted_index.points        │  2.8mb │
│ inverted_index.norms         │     0b │
│ inverted_index.term_vectors  │     0b │
│ inverted_index.knn_vectors   │     0b │
╰──────────────────────────────┴────────╯
╭───────────────────────────────────────────────────────────────────────────────────────────╮
│ pipeline logs-system_benchmarks.test-999.999.999 stats in node Qa9ujRVfQuWhqEESdt6xnw     │
├───────────────────────────────────────────────┬───────────────────────────────────────────┤
│ grok ()                                       │ Count: 407819 | Failed: 0 | Time: 16.615s │
│ user_agent ()                                 │   Count: 407819 | Failed: 0 | Time: 768ms │
│ pipeline (logs-system_benchmarks.test@custom) │    Count: 407819 | Failed: 0 | Time: 59ms │
╰───────────────────────────────────────────────┴───────────────────────────────────────────╯

--- Benchmark results for package: system_benchmarks - END   ---
Done

How to run this example locally:

$ make install
$ elastic-package stack up -v -d
$ eval "$(elastic-package stack shellinit)"
$ cd test/packages/benchmarks/system_benchmark/
$ elastic-package build
$ elastic-package install
$ elastic-package benchmark system --benchmark 20mb-logs-benchmark -v

docs/howto/system_benchmarking.md

test/packages/benchmarks/system_benchmark/_dev/benchmark/system/20mb-logs-benchmark.yml

ruflin · 2023-06-14T09:04:32Z

test/packages/benchmarks/system_benchmark/_dev/benchmark/system/20mb-logs-benchmark/config.yml

@@ -0,0 +1,40 @@
+- name: IP


Is it by design that we don't use here names "similar" to ECS like source.ip? I'm aware this is Schema A so not ECS yet at the same time using ECS could make it easy to read (if possible).

This is just an example for a benchmark tests, I'm mostly worried it sets a precedence on how it should be done.

In this case since this represents a Schema A (as it comes from the source), I thought it made more sense to let the field names represent what they are in the original logs, since not always there will even be an ECS field to represent the original data (as it comes after transformations, compositions from several fields, etc.). Even though I do not have a strong opinion on this one I think in this case it might not be too troublesome tbh.

ruflin · 2023-06-14T09:09:27Z

Testing this with the commands provided above and also did some changes to the corpora size for testing. elastic-package install is missing in the command list above to make sure the package is setup, but that is only a testing issue.

Overall, this looks great and I think we should get this rather soonish. I did not review the code on my end only if the command works as expected. The following I noticed: It tells me in the report the corpora was generated under [/tmp/service_logs/corpus-*] but it seems it is generated under /Users/ruflin/.elastic-package/tmp/service_logs (relative path). It seems it also does not clean up the old corpora, so if run it again, it will also ingest the previous ones?

mrodm

LGTM
Just a note about it would be needed to re-generate the README, there is still a reference to pipeline-bench

Pending CI to be successful run, there are some lint errors related to the error messages:
https://buildkite.com/elastic/elastic-package/builds/968#0188aec7-c0b5-48ca-a612-9c62fe86407a/188-330

README.md

marc-gr · 2023-06-20T08:58:12Z

All comments in the PR are addressed now, it relies on some pending spec changed added in elastic/package-spec#526

internal/formatter/yaml_formatter.go

elasticmachine · 2023-06-22T12:24:51Z

💚 Build Succeeded

Buildkite Build
Commit: 223e1ac

History

💚 Build #1013 succeeded 2538ae5
💔 Build #1005 failed e6cce98
💔 Build #1003 failed d226d8d
💔 Build #1000 failed 44d40e7
💔 Build #968 failed fc37e66
💔 Build #965 failed 94f8fd4

cc @marc-gr

mrodm

Thanks for addressing all the comments!
LGTM

jsoriano · 2023-06-23T06:48:47Z

internal/kibana/policies.go

+	Namespace          string   `json:"namespace"`
+	Revision           int      `json:"revision,omitempty"`
+	MonitoringEnabled  []string `json:"monitoring_enabled"`
+	MonitoringOutputID string   `json:"monitoring_output_id"`


@marc-gr it looks like this field is not available in old versions of Kibana, it also seems that it is not used here. Was it needed?

Integrations CI is failing for something related to this field elastic/integrations#6673.

Trying to fix it by adding omitempty in #1320.

cc @mrodm

marc-gr added 13 commits April 24, 2023 09:20

Refactor benchmark command to make it easier to extend

9baf15d

Add new command and read config

e66cb0e

Set up service and generator

6471a04

Improve policy creation and add docker service deployer

2c1ca74

Bump es api version

7e9431d

Add new options

3ecb122

Add new reportable types

8c7317a

Collect and report summary metrics

1365b73

Add documentation

18907a7

Add system benchmark test

eb81501

Save report file

2a212ee

Ensure all data is generated previous to agents being assigned

a23cf93

Save file indented

5b8685f

marc-gr added enhancement New feature or request Team:Security-External Integrations labels Apr 24, 2023

marc-gr requested a review from jsoriano April 24, 2023 08:30

marc-gr added 5 commits April 24, 2023 10:38

Update readme

7a00403

Lint

6248280

Make test faster

c2a739c

Go back to 7.17 client

c31f346

fix tests

100ab02

marc-gr mentioned this pull request Apr 24, 2023

Extend system benchmarks spec elastic/package-spec#512

Merged

2 tasks

marc-gr marked this pull request as ready for review April 24, 2023 09:23

marc-gr and others added 4 commits April 24, 2023 12:14

Change comments

a562da9

Add monitoringenabled option

c40e3f4

Update report.go

db19c61

Merge remote-tracking branch 'upstream/main' into system-benchmarks

4c6c957

ruflin reviewed May 11, 2023

View reviewed changes

docs/howto/system_benchmarking.md Outdated Show resolved Hide resolved

ruflin reviewed May 11, 2023

View reviewed changes

test/packages/benchmarks/system_benchmark/_dev/benchmark/system/20mb-logs-benchmark.yml Outdated Show resolved Hide resolved

jsoriano requested a review from mrodm May 15, 2023 22:42

Merge remote-tracking branch 'upstream/main' into system-benchmarks

94f8fd4

marc-gr requested review from aspacca and mrodm June 12, 2023 08:17

Use policy template name to generate input name

fc37e66

marc-gr requested a review from ruflin June 12, 2023 08:45

ruflin reviewed Jun 14, 2023

View reviewed changes

ruflin approved these changes Jun 14, 2023

View reviewed changes

mrodm reviewed Jun 14, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

bturquet mentioned this pull request Jun 15, 2023

Sample data generation #984

Closed

marc-gr added 6 commits June 20, 2023 10:09

Fix readme

c992308

Add corpora file to report

eaf6b75

Cleanup corpora generated data

6e3009e

Merge remote-tracking branch 'upstream/main' into system-benchmarks

44d40e7

Fix lint error

42cb4a0

Fix test package format

d226d8d

marc-gr added 4 commits June 20, 2023 12:09

Fix lint errors

83b46b7

Fix processor error return

e6cce98

Merge remote-tracking branch 'upstream/main' into system-benchmarks

6052777

Rename folder to accomodate to spec

2538ae5

mrodm reviewed Jun 22, 2023

View reviewed changes

internal/formatter/yaml_formatter.go Outdated Show resolved Hide resolved

Add prefix check

223e1ac

mrodm approved these changes Jun 22, 2023

View reviewed changes

marc-gr merged commit 93a145f into elastic:main Jun 22, 2023

marc-gr deleted the system-benchmarks branch June 22, 2023 14:04

mrodm mentioned this pull request Jun 22, 2023

build(deps): bump github.com/elastic/elastic-package from 0.82.0 to 0.83.0 elastic/integrations#6673

Closed

jsoriano reviewed Jun 23, 2023

View reviewed changes

jsoriano mentioned this pull request Jun 27, 2023

Disable monitoring on system tests #1326

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds system benchmarks command #1232

Adds system benchmarks command #1232

marc-gr commented Apr 24, 2023 •

edited

Loading

ruflin Jun 14, 2023

marc-gr Jun 20, 2023 •

edited

Loading

ruflin commented Jun 14, 2023

mrodm left a comment

marc-gr commented Jun 20, 2023

elasticmachine commented Jun 22, 2023

mrodm left a comment

jsoriano Jun 23, 2023

Adds system benchmarks command #1232

Adds system benchmarks command #1232

Conversation

marc-gr commented Apr 24, 2023 • edited Loading

ruflin Jun 14, 2023

Choose a reason for hiding this comment

marc-gr Jun 20, 2023 • edited Loading

Choose a reason for hiding this comment

ruflin commented Jun 14, 2023

mrodm left a comment

Choose a reason for hiding this comment

marc-gr commented Jun 20, 2023

elasticmachine commented Jun 22, 2023

💚 Build Succeeded

History

mrodm left a comment

Choose a reason for hiding this comment

jsoriano Jun 23, 2023

Choose a reason for hiding this comment

marc-gr commented Apr 24, 2023 •

edited

Loading

marc-gr Jun 20, 2023 •

edited

Loading