Profiling Cargo Check and Analyzing Perf Profiles with `perf focus`

go into the directory of a specific test; we'll use `clap-rs` as an example: ```bash cd collector/compile-benchmarks/clap-3.1.6 ``` In this case, let's say we want to profile the `cargo check` performance. In that case, I would first run some basic commands to build the dependencies: ```bash # Setup: first clean out any old results and build the dependencies: cargo +<toolchain> clean CARGO_INCREMENTAL=0 cargo +<toolchain> check ``` (Again, `<toolchain>` should be replaced with the name of the toolchain we made in the first step.) Next: we want record the execution time for *just* the clap-rs crate, running cargo check. I tend to use `cargo rustc` for this, since it also allows me to add explicit flags, which we'll do later on. ```bash touch src/lib.rs CARGO_INCREMENTAL=0 perf record -F99 --call-graph dwarf cargo rustc --profile check --lib ``` Note that final command: it's a doozy! It uses the `cargo rustc` command, which executes rustc with (potentially) additional options; the `--profile check` and `--lib` options specify that we are doing a `cargo check` execution, and that this is a library (not a binary). At this point, we can use `perf` tooling to analyze the results. For example: ```bash perf report ``` will open up an interactive TUI program. In simple cases, that can be helpful. For more detailed examination, the [`perf-focus` tool][pf] can be helpful; it is covered below. **A note of caution.** Each of the rustc-perf tests is its own special snowflake. In particular, some of them are not libraries, in which case you would want to do `touch src/main.rs` and avoid passing `--lib`. I'm not sure how best to tell which test is which to be honest. ### Gathering NLL data If you want to profile an NLL run, you can just pass extra options to the `cargo rustc` command, like so: ```bash touch src/lib.rs CARGO_INCREMENTAL=0 perf record -F99 --call-graph dwarf cargo rustc --profile check --lib -- -Z borrowck=mir ``` ## Analyzing a perf profile with `perf focus` Once you've gathered a perf profile, we want to get some information about it. For this, I personally use [perf focus][pf]. It's a kind of simple but useful tool that lets you answer queries like: - "how much time was spent in function F" (no matter where it was called from) - "how much time was spent in function F when it was called from G" - "how much time was spent in function F *excluding* time spent in G" - "what functions does F call and how much time does it spend in them" To understand how it works, you have to know just a bit about perf. Basically, perf works by *sampling* your process on a regular basis (or whenever some event occurs). For each sample, perf gathers a backtrace. `perf focus` lets you write a regular expression that tests which functions appear in that backtrace, and then tells you which percentage of samples had a backtrace that met the regular expression. It's probably easiest to explain by walking through how I would analyze NLL performance. ### Installing `perf-focus` You can install perf-focus using `cargo install`: ```bash cargo install perf-focus ``` ### Example: How much time is spent in MIR borrowck? Let's say we've gathered the NLL data for a test. We'd like to know how much time it is spending in the MIR borrow-checker. The "main" function of the MIR borrowck is called `do_mir_borrowck`, so we can do this command: ```bash $ perf focus '{do_mir_borrowck}' Matcher : {do_mir_borrowck} Matches : 228 Not Matches: 542 Percentage : 29% ``` The `'{do_mir_borrowck}'` argument is called the **matcher**. It specifies the test to be applied on the backtrace. In this case, the `{X}` indicates that there must be *some* function on the backtrace that meets the regular expression `X`. In this case, that regex is just the name of the function we want (in fact, it's a subset of the name; the full name includes a bunch of other stuff, like the module path). In this mode, perf-focus just prints out the percentage of

This section details the process of profiling `cargo check` performance for a specific crate (clap-rs) within the rustc-perf environment. It covers the commands to set up the environment, record the execution time using `perf`, and analyze the results with the `perf report` tool. Additionally, it discusses how to gather data specifically for NLL (Non-Lexical Lifetimes) analysis by passing extra options to the `cargo rustc` command. The section then introduces `perf focus`, a tool for querying perf profiles, and provides a step-by-step example of how to use it to determine the time spent in the MIR borrow checker.