Refactor benchmarks by ricardoV94 · Pull Request #1993 · pymc-devs/pytensor

ricardoV94 · 2026-03-20T16:46:05Z

Work in #1122 and #1961 made me realize our test suite is a bit of a mess

CI job was running in "FAST_COMPILE" ??
After Make Numba the default linker #1862 many benchmarks that were focused on "CVM" now runned in "NUMBA", sometimes even both
Tests were duplicated across the codebase, and in odd places, making it hard to see what was already being benchmarked.

This PR collects all benchmarks under tests/benchmark. This will be changed again by the #1122 but the changes should be separate from it.

jessegrabowski · 2026-03-23T16:59:28Z

tests/benchmarks/test_compilation.py

+        config.change_flags(numba__cache=cache) if cache is not None else nullcontext()
+    )
+    with ctx:
+        benchmark.pedantic(compile_and_call_once, rounds=5, iterations=1)


why rounds >2?

less noise?

jessegrabowski · 2026-03-23T16:59:39Z

tests/benchmarks/test_convolve1d.py

+def test_convolve1d_benchmark_c(batch, convolve_mode, benchmark):
+    _test_convolve1d_benchmark(
+        mode="CVM", batch=batch, convolve_mode=convolve_mode, benchmark=benchmark
+    )
+
+
+@pytest.mark.parametrize("convolve_mode", ("full", "valid"), ids=lambda x: f"mode={x}")
+@pytest.mark.parametrize("batch", (False, True), ids=lambda x: f"batch={x}")
+def test_convolve1d_benchmark_numba(batch, convolve_mode, benchmark):


why not parameterize mode

I did that at first, makes it hard to isolate the mode I am interested in benching, and we usually try to optimize one mode at a time, so the others are just noise

jessegrabowski · 2026-03-23T17:02:18Z

tests/benchmarks/test_elemwise.py

+    y = vector("z")
+    out = exp(2 * x * y + y)
+
+    rng = np.random.default_rng(42)


i put use sum(map(ord)) for seed in my instructions file, i hate seeing 42 everywhere (HAHA NERD NUMBER)

also for the purpose of ASV im not sure using a seed is sufficient. I guess the reason you want to seed it is to have better comparability between runs, but we can't be use that the generator state will be advanced in the same way between different PRs

Why isn't a seed sufficient? this is local to the test. default_rng(42) yields the same state (for the same os/numpy version)

Yeah but the benchmarks will run across os/numpy versions, that's what I was thinking about

It's an extremely minor point

jessegrabowski · 2026-03-23T17:06:47Z

tests/benchmarks/test_join.py

+
+def _test_join_benchmark(mode, ndim, axis, memory_layout, gc, benchmark):
+    if ndim == 1 and not (memory_layout == "C-contiguous" and axis == 0):
+        pytest.skip("Redundant parametrization")


we do this in linalg too, is there not a more idiomatic way to handle this in pytest?

You can make a fancy fixture that skips it programatically, but then you have to go read the logic elsewhere

jessegrabowski · 2026-03-23T17:07:03Z

tests/benchmarks/test_linalg.py

+@pytest.mark.parametrize("rewrite", [True, False], ids=["rewrite", "no_rewrite"])
+@pytest.mark.parametrize("size", [10, 100, 1000], ids=["small", "medium", "large"])
+def test_block_diag_dot_benchmark(benchmark, size, rewrite):
+    rng = np.random.default_rng()


to seed or not to seed, that is the question

Seems fine unseeded? It shouldn't affect anything.

Fix mode in benchmark suite

08bf88c

ricardoV94 force-pushed the cleanup_benchmarks branch 3 times, most recently from 353512b to b306eaa Compare March 20, 2026 17:54

ricardoV94 mentioned this pull request Mar 20, 2026

Numba CAReduce: reorder loops based on strides #1961

Open

ricardoV94 requested a review from jessegrabowski March 20, 2026 18:01

ricardoV94 force-pushed the cleanup_benchmarks branch from b306eaa to ae39a14 Compare March 20, 2026 18:34

Move benchmarks tests into separate folder and cleanup redundancies

a19dfa2

ricardoV94 force-pushed the cleanup_benchmarks branch from ae39a14 to a19dfa2 Compare March 20, 2026 22:03

ricardoV94 changed the title ~~Cleanup benchmarks~~ Refactor benchmarks Mar 20, 2026

ricardoV94 marked this pull request as ready for review March 20, 2026 22:12

jessegrabowski approved these changes Mar 23, 2026

View reviewed changes

ricardoV94 merged commit c311830 into pymc-devs:v3 Mar 23, 2026
66 checks passed

ricardoV94 deleted the cleanup_benchmarks branch March 23, 2026 18:27

ricardoV94 mentioned this pull request Mar 24, 2026

Remove deprecated PyTensor function functionality and reduce overhead #1351

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor benchmarks#1993

Refactor benchmarks#1993
ricardoV94 merged 2 commits intopymc-devs:v3from
ricardoV94:cleanup_benchmarks

ricardoV94 commented Mar 20, 2026

Uh oh!

jessegrabowski Mar 23, 2026

Uh oh!

ricardoV94 Mar 23, 2026

Uh oh!

jessegrabowski Mar 23, 2026

Uh oh!

ricardoV94 Mar 23, 2026

Uh oh!

jessegrabowski Mar 23, 2026

Uh oh!

jessegrabowski Mar 23, 2026

Uh oh!

ricardoV94 Mar 23, 2026

Uh oh!

jessegrabowski Mar 23, 2026 •

edited

Loading

Uh oh!

jessegrabowski Mar 23, 2026

Uh oh!

ricardoV94 Mar 23, 2026

Uh oh!

jessegrabowski Mar 23, 2026

Uh oh!

ricardoV94 Mar 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ricardoV94 commented Mar 20, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jessegrabowski Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jessegrabowski Mar 23, 2026 •

edited

Loading