[PoC] [WiP Draft] Performance gate prototype #1825

vytas7 · 2020-12-20T19:07:39Z

This is an early prototype of how implementing the https://github.com/pythonspeed/cachegrind-benchmarking approach could look like for Falcon.

The prototype builds upon this excellent article by @itamarst: https://pythonspeed.com/articles/consistent-benchmarking-in-ci/ (thanks @njsmith for kindly pointing to this idea).

Closes #1450

To do if we want to proceed with this:

ASGI performance metric
Barebones "Hello, World!" performance metric
Media performance metric
Routing performance metric
URL params and headers performance metric
Clean up tox environments, requirements etc
Add Cython support
Run Cython gates for master only (?)
Add PyPy support, master only (?); probably only informational?

codecov · 2020-12-20T19:08:51Z

Codecov Report

Merging #1825 (30d2886) into master (29b05ed) will not change coverage.
The diff coverage is n/a.

@@            Coverage Diff            @@
##            master     #1825   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           54        54           
  Lines         5154      5154           
  Branches       831       831           
=========================================
  Hits          5154      5154

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 29b05ed...30d2886. Read the comment docs.

itamarst · 2020-12-21T14:26:32Z

Excited to see this, and to see if it turns out to be useful. I'm going to be setting up something similar for one of my projects, so will share what I come up with if this is still in progress, or maybe steal from you, depending 😀

itamarst · 2020-12-21T21:51:58Z

Note that I've found a bug in the cachegrind.py script calculation, so you'll want to pull a new version once I've updated it (tomorrow hopefully).

vytas7 · 2020-12-21T21:57:58Z

Heh, thanks for heads up @itamarst !
I did experience that the cachegrind.py metric was a lot (1-2 orders of magnitude) noisier than the instruction count from valgrind, if that is what would be revised.
But maybe it is a bit noisier in its nature, so I didn't pay too much attention to it.

vytas7 · 2020-12-21T22:01:36Z

@itamarst Btw, another thing I was really surprised when toying with this approach, was how much "rogue" Python hash seeds can affect performance 🙂

itamarst · 2020-12-21T22:07:00Z

Yeah, you really want to set a fixed PYTHONHASHSEED for consistency.

The issue was that I was counting L3 hits wrong, if something hit RAM I also counted this as hitting L3 (i.e. L3 hits were too high). I am not sure if this will have much impact on the noisiness though.

itamarst · 2020-12-22T15:26:53Z

OK, https://github.com/pythonspeed/cachegrind-benchmarking has been updated.

vytas7 · 2020-12-22T19:48:44Z

Thanks @itamarst , I'll check that out!

vytas7 · 2020-12-26T17:33:22Z

@itamarst thanks again for the update.
The noisiness is gone, and the variation of the least-squares linear regression fitting error is now low beyond belief 💯 .
In fact, I'm now getting exactly the same cost of one iteration in two different CI runs (within at least 9 significant digits!).

…8 builds

itamarst · 2021-01-12T15:45:00Z

I got benchmarks working. Beyond what is in original repository:

I store results as pretty-printed, sorted JSON file on disk. Expectation is that developer runs this locally and checks in result, but I'm single-person project.
On every PR, I run the benchmarks again and add diff as GitHub comment: https://github.com/pythonspeed/filprofiler/blob/master/.github/workflows/main.yml#L120
In addition to setting PYTHONHASHSEED and figuring out equivalent fixed-seed for Rust, I ended up using Conda environments to reduce noise between my machine and GitHub Actions machines (so e.g. it's the same Python binary, rather than different dot-versions or different compilers etc.). The result is quite consistent on my machine, and a little noisy between my machine and VMs, but better than it would be without Conda.

Example output: pythonspeed/filprofiler#110

vytas7 · 2022-03-13T16:39:55Z

This has been rotting for so long due to the lack of my bandwidth, that I'm thinking to wait a couple of weeks more and then migrate this to Ubuntu 22.04 + CPython 3.10, to have the same gauge for a longer time.

I'm hoping to circle back on this shortly after we release 3.1.

vytas7 added 5 commits December 18, 2020 13:33

perf(CI): performance gate doodles (WiP)

ec4ae41

WiP: some doodles

5e6ce5e

WiP: some doodles (contd.)

a95d130

perf: add a performance testing gate prototype

fe41f96

Merge branch 'master' into performance-gate

bfc7406

vytas7 marked this pull request as draft December 20, 2020 19:08

perf: adjust baseline constants for Ubuntu 20.04

f31b20f

vytas7 added 3 commits December 26, 2020 16:24

Merge branch 'master' into performance-gate

5102ef2

perf(CI): run measurements under Ubuntu 20.04 Python builds

ad71631

perf(CI): source a new version of cachegrind-benchmarking

4dd07e1

vytas7 added 7 commits December 26, 2020 18:48

perf(CI): adjust the performance baseline for Ubuntu 20.04 CPython 3.…

0108a73

…8 builds

perf(CI): add a new metric pertinent to query params

c0a32ac

perf(CI): actually include the query params metric

145fa53

perf(CI): extend definitions to perf_asgi and perf_query

7efbb9a

perf(CI): record the observed query metric value into baseline

2301667

perf(CI): add a basic ASGI performance metric

fdb21af

perf(CI): establish baseline for the ASGI metric

30d2886

kgriffs added the pr::delay-for-next-release label Mar 9, 2021

kgriffs mentioned this pull request Mar 9, 2021

Test: Add basic ASGI tests to benchmark suite #1881

Open

kgriffs removed the pr::delay-for-next-release label May 10, 2021

kgriffs mentioned this pull request Aug 4, 2021

Roadmap: 3.x #1894

Closed

14 tasks

vytas7 added the pr::delay-for-next-release label Mar 13, 2022

vytas7 removed the pr::delay-for-next-release label Mar 26, 2022

vytas7 mentioned this pull request May 22, 2022

Roadmap: 4.x #2073

Open

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PoC] [WiP Draft] Performance gate prototype #1825

[PoC] [WiP Draft] Performance gate prototype #1825

vytas7 commented Dec 20, 2020 •

edited

codecov bot commented Dec 20, 2020 •

edited

itamarst commented Dec 21, 2020

itamarst commented Dec 21, 2020

vytas7 commented Dec 21, 2020 •

edited

vytas7 commented Dec 21, 2020

itamarst commented Dec 21, 2020

itamarst commented Dec 22, 2020

vytas7 commented Dec 22, 2020

vytas7 commented Dec 26, 2020

itamarst commented Jan 12, 2021

vytas7 commented Mar 13, 2022 •

edited

[PoC] [WiP Draft] Performance gate prototype #1825

Are you sure you want to change the base?

[PoC] [WiP Draft] Performance gate prototype #1825

Conversation

vytas7 commented Dec 20, 2020 • edited

codecov bot commented Dec 20, 2020 • edited

Codecov Report

itamarst commented Dec 21, 2020

itamarst commented Dec 21, 2020

vytas7 commented Dec 21, 2020 • edited

vytas7 commented Dec 21, 2020

itamarst commented Dec 21, 2020

itamarst commented Dec 22, 2020

vytas7 commented Dec 22, 2020

vytas7 commented Dec 26, 2020

itamarst commented Jan 12, 2021

vytas7 commented Mar 13, 2022 • edited

vytas7 commented Dec 20, 2020 •

edited

codecov bot commented Dec 20, 2020 •

edited

vytas7 commented Dec 21, 2020 •

edited

vytas7 commented Mar 13, 2022 •

edited