chromium-88.0.4324.182/tools/perf

<!-- Copyright 2020 The Chromium Authors. All rights reserved.
     Use of this source code is governed by a BSD-style license that can be
     found in the LICENSE file.
-->

# Chrome Benchmarking System

# Overview

This directory contains benchmarks and infrastructure to test Chrome and
Chromium and output performance measurements. These benchmarks are continuously
run on the [perf waterfall](https://ci.chromium.org/p/chrome/g/chrome.perf/console).

For more information on how Chrome measures performance, see
[here](/docs/speed/how_does_chrome_measure_performance.md).

# Using The Chrome Benchmarking System

## Analyzing Results From The Perf Waterfall

The [ChromePerf Dashboard](https://chromeperf.appspot.com/) is the destination
for all metrics generated by the perf waterfall. It provides tools to set up a
dashboard for performance of a set of tests + metrics over time. In addition, it
provides the ability to launch a bisection by selecting a point on the
dashboard.

## Running A Single Test

The Chrome Benchmarking System has two methods for manually running performance tests:
run_benchmark and Pinpoint.

run_benchmark is useful for creating and debugging benchmarks using local
devices. Run from the command line, it has a number of flags useful for
determining the internal state of the benchmark. For more information, see
[here](https://chromium.googlesource.com/catapult.git/+/HEAD/telemetry/docs/run_benchmarks_locally.md).

[Pinpoint](https://pinpoint-dot-chromeperf.appspot.com/) wraps run_benchmark and
provides the ability to remotely run A/B benchmarks using any platform available
in our lab. It will run a benchmark for as many iterations as needed to get a
statistically significant result, then visualize it.

If your're trying to debug a test or figure out how the infrastructure works,
the easiest way is to set up the debugger in VSCode (guide
[here](../../docs/vscode_python.md))] and set a breakpoint in
/tools/perf/core/benchmark_runner.py.

## Creating New Tests (stories)

[This document](https://chromium.googlesource.com/catapult.git/+/HEAD/telemetry)
provides an oveview of how tests are structured and some of the underlying
technologies. After reading that doc, figure out if your story fits into an
existing benchmark by checking
[here](https://goto.google.com/chrome-benchmarking-sheet) (or
[here](https://bit.ly/chrome-benchmarks) for non-Googlers).

* If it does, follow the instructions next to it. If there are no instructions,
  find the test type in src/tools/perf/page_sets.
* Otherwise, read [this](https://docs.google.com/document/d/1ni2MIeVnlH4bTj4yvEDMVNxgL73PqK_O9_NUm3NW3BA/edit).

After figuring out where your story fits, create a new one. There is a
considerable amount of variation between different benchmarks, so use a nearby
story as a model. You may also need to introduce custom JavaScript to drive
interactions on the page or to deal with nondeterminsim. For an example, search
[this file](https://source.chromium.org/chromium/chromium/src/+/master:tools/perf/page_sets/system_health/browsing_stories.py?q=browsing_stories.py&ss=chromium)
for browse:tools:sheets:2019.

Next, we need to use WPR (WebPageReplay) to record all of the content requested by the test. By default,
tests spin up a local webserver using these recordings, removing one source of
nondeterminism. To do that, run:

```./tools/perf/record_wpr --browser=system --story-filter=STORY_NAME BENCHMARK_NAME```

Next, we need to verify the recording works. To do so, run the test:

```./tools/perf/run_benchmark run BENCHMARK_NAME --browser=system --story-filter=STORY_NAME ```

After running this, you will need to verify the following:

* Does the browser behave the same as it did when creating the recording? If not, is the difference in behavior acceptable?
* Are there any concerning errors generated by Chrome when running run_benchmark? These will appear in the output of run_benchmark.
* Check the benchmarks in the link generated by run_benchmark. Does everything look reasonable?

If any problems were encountered, review or add custom JavaScript as described in the previous section. Alternatively, ask for help.

If everything looks good, upload your WPR archive by following the instructions
in [Upload the recording to Cloud Storage](https://sites.google.com/a/chromium.org/dev/developers/telemetry/record_a_page_set)
and create a CL.

# Tools In This Directory

This directory contains a variety of tools that can be used to run benchmarks,
interact with speed services, and manage performance waterfall configurations.
It also has commands for running functional unittests.

## run_tests

This command allows you to run functional tests against the python code in this
directory. For example, try:

```
./run_tests results_dashboard_unittest
```

Note that the positional argument can be any substring within the test name.

This may require you to set up your `gsutil config` first.

## run_benchmark

This command allows running benchmarks defined in the chromium repository,
specifically in [tools/perf/benchmarks][benchmarks_dir]. If you need it,
documentation is available on how to [run benchmarks locally][run_locally] and
how to properly [set up your device][device_setup].

[benchmarks_dir]: https://cs.chromium.org/chromium/src/tools/perf/benchmarks/
[run_locally]: https://chromium.googlesource.com/catapult.git/+/HEAD/telemetry/docs/run_benchmarks_locally.md
[device_setup]: /docs/speed/benchmark/telemetry_device_setup.md

## update_wpr

A helper script to automate various tasks related to the update of
[Web Page Recordings][wpr] for our benchmarks. In can help creating new
recordings from live websites, replay those to make sure they work, upload them
to cloud storage, and finally send a CL to review with the new recordings.

[wpr]: https://github.com/catapult-project/catapult/tree/master/web_page_replay_go

## pinpoint_cli

A command line interface to the [pinpoint][] service. Allows to create new jobs,
check the status of jobs, and fetch their measurements as csv files.

[pinpoint]: https://pinpoint-dot-chromeperf.appspot.com

## flakiness_cli

A command line interface to the [flakiness dashboard][].

[flakiness dashboard]: https://test-results.appspot.com/dashboards/flakiness_dashboard.html

## soundwave

Allows to fetch data from the [Chrome Performance Dashboard][chromeperf] and
stores it locally on a SQLite database for further analysis and processing. It
also allows defining [studies][], pre-sets of measurements a team is interested
in tracking, and uploads them to cloud storage to visualize with the help of
[Data Studio][]. This currently backs the [v8][v8_dashboard] and
[health][health_dashboard] dashboards.

[chromeperf]: https://chromeperf.appspot.com/
[studies]: https://cs.chromium.org/chromium/src/tools/perf/cli_tools/soundwave/studies/
[Data Studio]: https://datastudio.google.com/
[v8_dashboard]: https://datastudio.google.com/s/iNcXppkP3DI
[health_dashboard]: https://datastudio.google.com/s/jUXfKZXXfT8

## pinboard

Allows scheduling daily [pinpoint][] jobs to compare measurements with/without a
patch being applied. This is useful for teams developing a new feature behind a
flag, who wants to track the effects on performance as the development of their
feature progresses. Processed data for relevant measurements is uploaded to
cloud storage, where it can be read by [Data Studio][]. This also backs data
displayed on the [v8][v8_dashboard] dashboard.
Name		Date	Size	#Lines	LOC
..		16-Feb-2021	-
benchmarks/	H	03-May-2022	-	3,725	2,471
chrome_telemetry_build/	H	03-May-2022	-	291	246
clear_system_cache/	H	16-Feb-2021	-	78	54
cli_tools/	H	16-Feb-2021	-	5,183	4,015
contrib/	H	16-Feb-2021	-	5,260	3,974
core/	H	16-Feb-2021	-	66,849	63,774
examples/	H	16-Feb-2021	-	36	33
experimental/	H	16-Feb-2021	-	797	610
measurements/	H	16-Feb-2021	-	359	245
metrics/	H	16-Feb-2021	-	31	20
page_sets/	H	03-May-2022	-	29,026	24,711
testdata/	H	03-May-2022	-	1,261	1,229
.gitignore	H A D	16-Feb-2021	82	7	6
BUILD.gn	H A D	16-Feb-2021	2.6 KiB	103	81
DIR_METADATA	H A D	16-Feb-2021	405	11	10
OWNERS	H A D	16-Feb-2021	518	26	18
PRESUBMIT.py	H A D	16-Feb-2021	5.7 KiB	175	149
README.md	H A D	16-Feb-2021	7.3 KiB	164	111
benchmark.csv	H A D	16-Feb-2021	10.6 KiB	74	73
bootstrap_deps	H A D	16-Feb-2021	1.3 KiB	35	32
diagnose_test_failure	H A D	16-Feb-2021	3.3 KiB	108	85
expectations.config	H A D	16-Feb-2021	33.2 KiB	442	408
export_csv	H A D	16-Feb-2021	1.4 KiB	58	41
fetch_benchmark_deps.py	H A D	16-Feb-2021	5.1 KiB	141	100
fetch_benchmark_deps_unittest.py	H A D	16-Feb-2021	2.8 KiB	79	57
find_dependencies	H A D	16-Feb-2021	372	16	6
flakiness_cli	H A D	16-Feb-2021	446	17	7
generate_legacy_perf_dashboard_json.py	H A D	16-Feb-2021	9.4 KiB	271	201
generate_legacy_perf_dashboard_json_unittest.py	H A D	16-Feb-2021	3.4 KiB	102	73
generate_perf_data	H A D	16-Feb-2021	374	16	6
generate_perf_sharding.py	H A D	16-Feb-2021	13.6 KiB	350	287
list_affected_benchmarks	H A D	16-Feb-2021	931	33	21
list_benchmarks	H A D	16-Feb-2021	1 KiB	42	25
pinboard	H A D	16-Feb-2021	455	21	11
pinpoint_cli	H A D	16-Feb-2021	2.4 KiB	73	57
process_perf_results.py	H A D	16-Feb-2021	24.9 KiB	676	508
process_perf_results_unittest.py	H A D	16-Feb-2021	10.9 KiB	287	241
pylintrc	H A D	16-Feb-2021	1.1 KiB	55	43
record_wpr	H A D	16-Feb-2021	565	19	10
results_processor	H A D	16-Feb-2021	385	17	7
run_benchmark	H A D	16-Feb-2021	420	18	7
run_gtest_benchmark.py	H A D	16-Feb-2021	4.1 KiB	131	93
run_rendering_benchmark_with_gated_performance_unittest.py	H A D	16-Feb-2021	12.8 KiB	356	296
run_telemetry_tests	H A D	16-Feb-2021	983	35	22
run_tests	H A D	16-Feb-2021	717	29	16
scripts_smoke_unittest.py	H A D	16-Feb-2021	16.4 KiB	393	342
soundwave	H A D	16-Feb-2021	3.6 KiB	101	82
system_health_stories.csv	H A D	16-Feb-2021	8.5 KiB	125	124
update_wpr	H A D	16-Feb-2021	360	16	6
validate_perf_json_config	H A D	16-Feb-2021	388	16	6
validate_story_expectation_data	H A D	16-Feb-2021	378	16	6
validate_tbmv3_metric	H A D	16-Feb-2021	481	21	11
validate_wpr_archives	H A D	16-Feb-2021	3.5 KiB	114	89