CI: testing python 3.14 and sphinx9 and docutils 0.22 by bsipocz · Pull Request #704 · executablebooks/MyST-NB

bsipocz · 2025-12-12T06:09:45Z

This is to close #680 and other cleanups

bsipocz · 2026-01-17T04:06:49Z

OK, so we have some docutils 0.22 incompatibilities for the tests. I've just merged something for sphinx-tabs that may be good enough workaround here, too, and I'm planning to get back to trying it next week.

bsipocz · 2026-01-24T03:41:55Z

It looks like this is done except that some of the jobs are picking up newer pillow (12.x) or are not compatible with the one we pin (py3.14 and pillow 11.0) -- So I'm a little bit stuck.

@choldgraf - would you help out, should I just: xfail the few tests that are effected if it's not the correct pillow version; or update it to be 12.0.0 everywhere? If the latter; what is the way to update the expected outputs?

flying-sheep · 2026-01-29T10:36:23Z

tests/conftest.py

+        if docutils.__version_info__ < (0, 22):
+            data = data.replace('linenos="False"', 'linenos="0"')
+            data = data.replace('nowrap="False"', 'nowrap="0"')
+            data = data.replace('linenos="True"', 'linenos="1"')
+            data = data.replace('internal="True"', 'internal="1"')


why not do it the other way around? then you don’t need to change all the XML and you can just remove this block once 0.22 is the lowest supported version.

Because this way the files are docutils 0.22 compatible and this workaround can be removed when the older versions are dropped.

The problem with this pr has nothing to do with docutils any more, but are stuck with the image tests.

bsipocz · 2026-02-04T00:54:21Z

Well, I'm not happy, I cannot reproduce the failing builds locally, the same version combo just passes all tests. Could this be OSX vs linux related?

choldgraf · 2026-02-04T02:01:57Z

I have a Mac, I can test it out this week - would that help?

bsipocz · 2026-02-04T02:02:05Z

With changing the OS we're down to only 2 tests failing as opposed to the 6. I'm really on the verge of somehow just ignore those image outputs as this is getting ridiculous.

choldgraf · 2026-02-04T02:05:14Z

IMO - ignore them, don't worry about it. This feels too defensive to be worth the hassle

bsipocz · 2026-02-04T02:12:09Z

I have a Mac, I can test it out this week - would that help?

No, we need the opposite, a linux box to generate outputs as I couldn't do that on my mac (and frankly, some of these were always failing locally, so the opposite was kind of the status quo)

choldgraf · 2026-02-04T02:14:16Z

Got it - imo I suggest we not worry about this and just skip the tests on some platforms

bsipocz · 2026-02-04T02:24:25Z

Yes, I'll do some skipping/rearanging, but it has to wait until tomorrow

flying-sheep · 2026-02-04T08:42:45Z

Yeah, this image stuff is a huge problem for our tests too, and we don’t even test on multiple platforms.

matplotlib.testing.setup() helps a little, but in the end, each tiny font rendering difference makes pixel-wise comparisons a fool’s errand.

Matplotlib even has a freetype_version parameter for its @matplotlib.testing.decorators.image_comparison, but nobody seems to be using that.

I’d love to help if we get a conversation going in the community how to fix this once and for all. (Maybe use a font renderer that was compiled to wasm using a fully reproducible setup, so we can use the same version as long as we want and get pixel-perfect results)

bsipocz · 2026-02-04T17:46:27Z

@flying-sheep - Well, other projects do use pytest-mpl which gives the usual relative and absolute tolerance options, but I don't immediately see how that could be plugged into this package and certainly don't have enough time to make it happen.

flying-sheep · 2026-02-04T20:25:44Z

pytest-mpl just wraps the APIs I just mentioned, the tolerance options come from them.

But that's exactly what I've been talking about: tolerance isn't very meaningful when everything shifting around by a pixel every few months causes you to bump the tolerance every few months until the test is meaningless.

I don't think we can do anything here, just shouting out in case someone has a rigorous solution.

bsipocz added the maintanence Maintanence and cleanup label Dec 12, 2025

bsipocz force-pushed the CI_maintenance branch 2 times, most recently from afc0b8d to a43342b Compare December 12, 2025 06:48

bsipocz mentioned this pull request Dec 21, 2025

Support the latest sphinx and python versions #705

Open

bsipocz force-pushed the CI_maintenance branch 3 times, most recently from 49b335d to 428a4ef Compare January 17, 2026 03:17

bsipocz changed the title ~~CI: matrix maintenance~~ CI: testing python 3.14 and sphinx9 Jan 17, 2026

bsipocz changed the title ~~CI: testing python 3.14 and sphinx9~~ CI: testing python 3.14 and sphinx9 and docutils 0.22 Jan 24, 2026

bsipocz force-pushed the CI_maintenance branch from e6509c0 to 91273a6 Compare January 27, 2026 04:58

flying-sheep reviewed Jan 29, 2026

View reviewed changes

flying-sheep mentioned this pull request Jan 29, 2026

Update infrastructure for new pythons and prep for latest sphinx #706

Merged

bsipocz added 10 commits February 3, 2026 16:36

CI: remove commented out codecov

635c25e

CI: adding python 3.14 to the CI matrix

35d3236

MAINT: removing python 3.9 support

4c4cb0c

CI: fully remove the coverage job

52a064f

CI: adding back sphinx9 testing

05602fb

Adding workaround for tests to pass both <0.22 and 0.22+ docutils

3984c89

MAINT: kicking down the line to deal with the pending deprecation

26fad38

Pin pillow everywhere, so png outputs stay the same

d68df03

CI: use newer pillow in tests

b04fdaa

Update test outputs to use newer pillow

bf2970e

bsipocz force-pushed the CI_maintenance branch from 98fd0e3 to bf2970e Compare February 4, 2026 00:37

bsipocz force-pushed the CI_maintenance branch from 9299767 to 5ca2e87 Compare February 4, 2026 01:22

bsipocz added 2 commits February 3, 2026 17:32

Hard pinning mpl for tests

bee59da

Test outputs are sensitive to patch version of mpl, too

bd4943d

bsipocz force-pushed the CI_maintenance branch from c315f75 to bd4943d Compare February 4, 2026 01:32

Pre commit should leave the tests outputs alone

a45b9e0

bsipocz force-pushed the CI_maintenance branch from 41b7878 to a45b9e0 Compare February 4, 2026 01:34

Testing if latest failures are OS related (locally tests pass)

d5353c7

Conversation

bsipocz commented Dec 12, 2025

Uh oh!

bsipocz commented Jan 17, 2026

Uh oh!

bsipocz commented Jan 24, 2026

Uh oh!

flying-sheep Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

bsipocz Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

bsipocz commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

choldgraf commented Feb 4, 2026

Uh oh!

bsipocz commented Feb 4, 2026

Uh oh!

choldgraf commented Feb 4, 2026

Uh oh!

bsipocz commented Feb 4, 2026

Uh oh!

choldgraf commented Feb 4, 2026

Uh oh!

bsipocz commented Feb 4, 2026

Uh oh!

flying-sheep commented Feb 4, 2026

Uh oh!

bsipocz commented Feb 4, 2026

Uh oh!

flying-sheep commented Feb 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bsipocz commented Feb 4, 2026 •

edited

Loading