Blame - docs/speed/perf_trybots.md - chromium/src

blob: 1990d34e0c9c15645e9e111d9df0165c68a17721 [file] [log] [blame] [view]

sullivan	10cf4db1	2017-06-17 01:11:37	[diff] [blame]	1	# Perf Try Bots
				2
Dave Tu	c7bc05f	2018-02-16 20:41:21	[diff] [blame]	3	Chrome has a performance lab with dozens of device and OS configurations.
				4	[Pinpoint](https://pinpoint-dot-chromeperf.appspot.com) is the service that lets
				5	you run performance tests in the lab. With Pinpoint, you can run try jobs, which
				6	let you put in a Gerrit patch, and it will run tip-of-tree with and without the
				7	patch applied.
				8
sullivan	10cf4db1	2017-06-17 01:11:37	[diff] [blame]	9	[TOC]
				10
Dave Tu	c7bc05f	2018-02-16 20:41:21	[diff] [blame]	11	## Why perf try jobs?
sullivan	10cf4db1	2017-06-17 01:11:37	[diff] [blame]	12
Dave Tu	c7bc05f	2018-02-16 20:41:21	[diff] [blame]	13	* All of the devices exactly match the hardware and OS versions in the perf
				14	continuous integration suite.
				15	* The devices have the "maintenance mutex" enabled, reducing noise from
				16	background processes.
Dave Tu	c7bc05f	2018-02-16 20:41:21	[diff] [blame]	17	* Some regressions take multiple repeats to reproduce, and Pinpoint
				18	automatically runs multiple times and aggregates the results.
				19	* Some regressions reproduce on some devices but not others, and Pinpoint will
				20	run the job on multiple devices.
Leina Sun	f31a3e2e	2023-01-28 00:16:44	[diff] [blame]	21	* Each iteration runs both arms on the same device, eliminating confounding factors like across-device variability
Simon	dfc644a1	2018-01-12 21:59:03	[diff] [blame]	22
sullivan	10cf4db1	2017-06-17 01:11:37	[diff] [blame]	23	## Starting a perf try job
				24
John Chen	b2002a3d8	2020-03-25 03:14:35	[diff] [blame]	25	* Visit [Pinpoint](https://pinpoint-dot-chromeperf.appspot.com).
				26	* Check the upper-right corner of the page. If you see a "Sign in" link,
				27	click it and sign in with an account that has trybot access.
				28	(If the link shows "Sign out", then you are already signed in.)
				29	* Click the perf try button in the bottom right corner of the screen.
sullivan	10cf4db1	2017-06-17 01:11:37	[diff] [blame]	30
Dave Tu	c7bc05f	2018-02-16 20:41:21	[diff] [blame]	31	![Pinpoint Perf Try Button](images/pinpoint-perf-try-button.png)
				32
Simon	dfc644a1	2018-01-12 21:59:03	[diff] [blame]	33	You should see the following dialog popup:
sullivan	10cf4db1	2017-06-17 01:11:37	[diff] [blame]	34
Simon	dfc644a1	2018-01-12 21:59:03	[diff] [blame]	35	![Perf Try Dialog](images/pinpoint-perf-try-dialog.png)
sullivan	10cf4db1	2017-06-17 01:11:37	[diff] [blame]	36
Caleb Rouleau	c565effe	2020-03-23 22:37:09	[diff] [blame]	37	Benchmark Configuration\| Description
Simon	dfc644a1	2018-01-12 21:59:03	[diff] [blame]	38	--- \| ---
Dave Tu	c7bc05f	2018-02-16 20:41:21	[diff] [blame]	39	Bot \| The device type to run the test on. All hardware configurations in our perf lab are supported.
Dave Tu	c7bc05f	2018-02-16 20:41:21	[diff] [blame]	40	Benchmark \| A telemetry benchmark. E.g. `system_health.common_desktop`<br><br>All the telemetry benchmarks are supported by the perf trybots. To get a full list, run `tools/perf/run_benchmark list`<br><br>To learn more about the benchmarks, you can read about the [system health benchmarks](https://docs.google.com/document/d/1BM_6lBrPzpMNMtcyi2NFKGIzmzIQ1oH3OlNG27kDGNU/edit?ts=57e92782), which test Chrome's performance at a high level, and the [benchmark harnesses](https://docs.google.com/spreadsheets/d/1ZdQ9OHqEjF5v8dqNjd7lGUjJnK6sgi8MiqO7eZVMgD0/edit#gid=0), which cover more specific areas.
Caleb Rouleau	c565effe	2020-03-23 22:37:09	[diff] [blame]	41	Story \| (optional) A specific story from the benchmark to run. Note that if the story you want isn't on the dropdown it could be because the story is new and so the Chromeperf dashboard database doesn't know about it yet. In that case you can still free-form type the exact story name into the field.
				42	Story Tags \| (optional) A list of story tags. All stories in the given benchmark that match any of the tags will be run.
sullivan	10cf4db1	2017-06-17 01:11:37	[diff] [blame]	43
Caleb Rouleau	c565effe	2020-03-23 22:37:09	[diff] [blame]	44	Note that you must provide either a Story or a Story Tag for Pinpoint to run.
				45	Per [this explanation](https://bugs.chromium.org/p/chromium/issues/detail?id=1017811#c6), running an entire benchmark on Pinpoint can cause significant problems if the benchmark is large. For this reason, some small benchmarks have an 'all' tag available that applies to all the stories in the benchmark, so please use that tag to run all the stories for a small benchmark. Please see [this bug](https://bugs.chromium.org/p/chromium/issues/detail?id=1023451) for details on work to add the 'all' tag to more benchmarks. If you want to run a large benchmark, consider choosing one of the tags that benchmark provides to select a subset of the available stories for that benchmark.
				46
				47	<br><br>
				48
				49	Job Configuration\| Description
				50	--- \| ---
Leina Sun	f31a3e2e	2023-01-28 00:16:44	[diff] [blame]	51	Attempt Count \| The number of iterations Pinpoint will run on both arms. Pinpoint will spread iterations evenly across all available devices. Pinpoint will also randomize which arm runs first and ensure that the number of iterations going first are the same for both arms.
Leina Sun	8e72ca22	2023-02-06 20:28:49	[diff] [blame]	52	Base Git Hash \| The Git Hash of the control arm. This git hash must have already landed on main or a release branch and cannot be the git hash associated with a gerrit CL in flight. Default is `HEAD`.
				53	Exp Git Hash \| Same as Base Git Hash for the experiment arm. Default is `HEAD`.
				54	Base Patch \| (optional) The patch you want the control arm to run the benchmark on. Patches in dependent repos (e.g. v8, skia) are supported. Pinpoint will also post updates on the Gerrit comment list. Must be entered as a URL.
Leina Sun	f31a3e2e	2023-01-28 00:16:44	[diff] [blame]	55	Exp Patch \| (optional) Same as Base Patch for the experiment arm.
Funing Wang	7cc2750	2023-01-30 23:49:13	[diff] [blame]	56	Extra arguments on base commit \| (optional) Extra arguments for the test. E.g. `--extra-chrome-categories=foo,bar`<br>or`--enable-features=foo,bar`(shortening the args by omitting "--extra-browser-args" prefix)<br><br>To see all arguments, run `tools/perf/run_benchmark run --help`
Leina Sun	f31a3e2e	2023-01-28 00:16:44	[diff] [blame]	57	Extra arguments on experiment commit \| (optional) Same as base commit for the experiment arm. Note that some arguments will apply to both arms.
				58	Monorail Project \| The repo the Git hashes are from. Default is `chromium`.
Caleb Rouleau	c565effe	2020-03-23 22:37:09	[diff] [blame]	59	Bug ID \| (optional) A bug ID. Pinpoint will post updates on the bug.
Leina Sun	f31a3e2e	2023-01-28 00:16:44	[diff] [blame]	60	Batch ID \| (optional) A batch ID used to track relevant jobs for the Chrome Health Initiative. We recommend leaving this blank.
Caleb Rouleau	c565effe	2020-03-23 22:37:09	[diff] [blame]	61
Leina Sun	8e72ca22	2023-02-06 20:28:49	[diff] [blame]	62	### Example: Evaluating a CL's impact on the performance of Speedometer2 on Mac M1
				63
				64	Here is an example of a try job that would evaluate the impact of a [patch CL](https://chromium-review.googlesource.com/c/chromium/src/+/3498915/1) at the tip of main on Speedometer2 on Mac M1. This experiment set up can also be used to see if the patch CL can address a performance regression as long as the regression is visible at HEAD.
				65
				66	![Perf Try Dialog](images/example-try-job.png)
				67
Dave Tu	c7bc05f	2018-02-16 20:41:21	[diff] [blame]	68	## Interpreting the results
				69
				70	### Detailed results
				71
				72	On the Job result page, click the "Analyze benchmark results" link at the top. See the [metrics results UI documentation](https://github.com/catapult-project/catapult/blob/master/docs/metrics-results-ui.md) for more details on reading the results.
				73
				74	### Traces
				75
				76	On the Job result page, there is a chart containing two dots. The left dot represents HEAD and the right dot represents the patch. Clicking on the right dot reveals some colored bars; each box represents one benchmark run. Click on one of the runs to see trace links.
				77
				78	![Trace links](images/pinpoint-trace-links.png)
Leina Sun	8e72ca22	2023-02-06 20:28:49	[diff] [blame]	79
				80	# Contact
				81
				82	* For more questions, email browser-perf-engprod@google.com
				83	* Bugs on Pinpoint issues should have Component: [Speed>Bisection](https://bugs.chromium.org/p/chromium/issues/list?q=component:Speed%3EBisection).