Here are some recent and important revisions. 👉 Complete list of results.
Key: 📄: table, 📈: time plot, 🧠: memory plot
| date | fork/ref | hash/flags | vs. 3.11.0: | vs. 3.12.0: | vs. 3.13.0: | vs. base: |
|---|---|---|---|---|---|---|
| 2022-03-23 | python/v3.10.4 | 9d38120 |
| date | fork/ref | hash/flags | vs. 3.11.0: | vs. 3.12.0: | vs. 3.13.0: | vs. base: |
|---|---|---|---|---|---|---|
| 2025-10-19 | python/bedaea05987738c4c6b9 | bedaea0 | ||||
| 2025-10-12 | python/d6dd64ac654898b4ce71 | d6dd64a | ||||
| 2025-10-04 | python/880c9526f91960b9cba5 | 880c952 |
* indicates that the exact same versions of pyperformance was not used.
For the results above, the "faster/slower" result is a geometric mean of each of the benchmarks. The "reliability (rel)" number is the likelihood that the change is faster or slower based on the Hierarchical Performance Testing (HPT) method. For more details, visit each individual result's README.md.
Below are longitudinal timing results. There are also 🧠 longitudinal memory results.
Improvement of the geometric mean of key merged benchmarks, computed with pyperf compare.
The results have a resolution of 0.01 (1%).
There is also a longitudinal plot by benchmark.
Visit the 🔒 benchmark action and click the "Run Workflow" button.
The available parameters are:
fork: The fork of CPython to benchmark. If benchmarking a pull request, this would normally be your GitHub username.ref: The branch, tag or commit SHA to benchmark. If a SHA, it must be the full SHA, since finding it by a prefix is not supported.machine: The machine to run on. One oflinux-amd64(default),windows-amd64,darwin-arm64orall.benchmark_base: If checked, the base of the selected branch will also be benchmarked. The base is determined by runninggit merge-base upstream/main $ref.pystats: If checked, collect the pystats from running the benchmarks.
To watch the progress of the benchmark, select it from the 🔒 benchmark action page. It may be canceled from there as well. To show only your benchmark workflows, select your GitHub ID from the "Actor" dropdown.
When the benchmarking is complete, the results are published to this repository and will appear in the complete table. Each set of benchmarks will have:
- The raw
.jsonresults from pyperformance. - Comparisons against important reference releases, as well as the merge base of the branch if
benchmark_basewas selected. These include- A markdown table produced by
pyperf compare_to. - A set of "violin" plots showing the distribution of results for each benchmark.
- A set of plots showing the memory change for each benchmark (for immediate bases only, on non-Windows platforms).
- A markdown table produced by
The most convenient way to get results locally is to clone this repo and git pull from it.
To automate benchmarking runs, it may be more convenient to use the GitHub CLI.
Once you have gh installed and configured, you can run benchmarks by cloning this repository and then from inside it:
$ gh workflow run benchmark.yml -f fork=me -f ref=my_branchAny of the parameters described above are available at the commandline using the -f key=value syntax.
To collect Linux perf sampling profile data for a benchmarking run, run the _benchmark action and check the perf checkbox.
Follow this by a run of the _generate action to regenerate the plots.