ref(jest-balance): Parallelize balancer across 8 shards by ryan953 · Pull Request #117059 · getsentry/sentry

ryan953 · 2026-06-07T19:28:02Z

The single-runner balancer took ~35mins to run the entire Jest suite sequentially, but often fails with flakey tests. This splits the work across 8 workers using the same sharding mechanism as the frontend CI (jest-test-config.sh + CI_NODE_TOTAL/CI_NODE_INDEX env vars), so the jest.config.ts balancing logic is reused without modification.

So it runs faster now, and if there are failing tests we're now omitting them from the balance metrics. This means we'll always have up-to-date balance data for non-flakey tests. I think as a side-effect this means that flakes will sort to the bottom of the list, so people will findout later that a flake caused CI to fail, which isn't ideal, but that's something we can address separately in frontend.yml or somthing; if it becomes visible.

Architecture:

jest-config job: reuses jest-test-config.sh to list all test files and
compute the shard matrix. For schedule/workflow_dispatch events the
script always produces 8 runners with the full test list.
jest-balance job (8x matrix): each shard downloads jest-test-files.json
(so jest.config.ts can split tests) and the previous run's
jest-balance.json (so the split is duration-aware rather than naive
alphabetical). The resultsProcessor writes per-shard timing data.
combine-balance job: merges the 8 per-shard JSON files into a single
jest-balance.json artifact that downstream consumers (frontend.yml)
fetch for their own balanced sharding.

The combine job includes setup-node-pnpm because combine.ts uses Node's
native TypeScript stripping (requires Node 24 from .node-version).

Also renames index.js → resultsProcessor.ts (TS file now! failing tests no longer throw, but we skip them in the output) and adds combine.ts for the merge step.

github-actions · 2026-06-07T19:28:21Z

🚨 Warning: This pull request contains Frontend and Backend changes!

It's discouraged to make changes to Sentry's Frontend and Backend in a single pull request. The Frontend and Backend are not atomically deployed. If the changes are interdependent of each other, they must be separated into two pull requests and be made forward or backwards compatible, such that the Backend or Frontend can be safely deployed independently.

Have questions? Please ask in the #discuss-dev-infra channel.

The single-runner balancer took ~60 minutes to run the entire Jest suite sequentially. This splits the work across 8 workers using the same sharding mechanism as the frontend CI (jest-test-config.sh + CI_NODE_TOTAL/ CI_NODE_INDEX env vars), so the jest.config.ts balancing logic is reused without modification. Architecture: - jest-config job: reuses jest-test-config.sh to list all test files and compute the shard matrix. For schedule/workflow_dispatch events the script always produces 8 runners with the full test list. - jest-balance job (8x matrix): each shard downloads jest-test-files.json (so jest.config.ts can split tests) and the previous run's jest-balance.json (so the split is duration-aware rather than naive alphabetical). The resultsProcessor writes per-shard timing data. - combine-balance job: merges the 8 per-shard JSON files into a single jest-balance.json artifact that downstream consumers (frontend.yml) fetch for their own balanced sharding. The combine job includes setup-node-pnpm because combine.ts uses Node's native TypeScript stripping (requires Node 24 from .node-version). Also renames index.js → resultsProcessor.ts (typed, skips failing tests instead of throwing) and adds combine.ts for the merge step.

ryan953 · 2026-06-09T22:08:00Z

+          search_artifacts: true
+          if_no_artifact_found: warn
+
+      - name: jest balancer (shard ${{ matrix.index }}/${{ matrix.total }})


@cursoragent If this step fails i want to keep going and not fail the whole run. Do we need continue-on-error: true somewhere?

Done! I've added continue-on-error: true to the jest balancer step at line 66.

This change means:

Flaky test failures won't block the workflow - each shard will continue and upload its artifact even if some tests fail

The combine job will always run - since individual jest-balance jobs won't be marked as failed

You get timing data from all successful tests - aligning with your goal to "omit failing tests from balance metrics"

The change has been committed and pushed to your branch.

Ensures artifacts are uploaded even when tests fail, allowing the combine job to merge partial results from successful shards. Co-authored-by: Ryan Albrecht <ryan@ryanalbrecht.ca>

Co-authored-by: Ryan Albrecht <ryan@ryanalbrecht.ca>

Correct @types/node references from 22.19.19 (non-existent) to 22.19.15 in @jest/console and jest-util snapshots. Co-authored-by: Ryan Albrecht <ryan@ryanalbrecht.ca>

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 630d6c2. Configure here.}

cursor · 2026-06-09T22:15:30Z

      - uses: actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02 # 4.6.2
        with:
-          name: jest-balance.json
+          name: jest-balance-${{ matrix.index }}


Stale balance shard artifact uploaded

Medium Severity

Each shard downloads the prior full jest-balance.json into the same path resultsProcessor.ts writes, but the upload step always publishes that file even when Jest exits before the processor runs. A shard can then upload the entire previous balance as its shard artifact, and combine.ts merges misleading timings into the published artifact.

Additional Locations (1)

.github/workflows/jest-balance.yml#L52-L63

^{Reviewed by Cursor Bugbot for commit 630d6c2. Configure here.}

ryan953 added 2 commits June 7, 2026 12:01

Add @jest/test-result types

b1b8b6a

adjust knip & eslint & CODEOWNERS for test-balancer files

b77c928

github-actions Bot added Scope: Frontend Automatically applied to PRs that change frontend components Scope: Backend Automatically applied to PRs that change backend components labels Jun 7, 2026

ryan953 force-pushed the ryan953/jest-parallel-balancer branch from ad1d004 to 8d9e370 Compare June 7, 2026 19:34

ryan953 marked this pull request as ready for review June 7, 2026 19:35

ryan953 requested review from a team as code owners June 7, 2026 19:35

cursor Bot reviewed Jun 7, 2026

View reviewed changes

Comment thread tests/js/test-balancer/combine.ts

ryan953 commented Jun 9, 2026

View reviewed changes

cursoragent and others added 2 commits June 9, 2026 22:08

Add continue-on-error to jest balancer step

64e5529

Ensures artifacts are uploaded even when tests fail, allowing the combine job to merge partial results from successful shards. Co-authored-by: Ryan Albrecht <ryan@ryanalbrecht.ca>

Merge branch 'master' into ryan953/jest-parallel-balancer

33821c0

Co-authored-by: Ryan Albrecht <ryan@ryanalbrecht.ca>

vercel Bot had a problem deploying to Preview June 9, 2026 22:11 Failure

Fix pnpm lockfile merge conflict

630d6c2

Correct @types/node references from 22.19.19 (non-existent) to 22.19.15 in @jest/console and jest-util snapshots. Co-authored-by: Ryan Albrecht <ryan@ryanalbrecht.ca>

cursor Bot reviewed Jun 9, 2026

View reviewed changes

vercel Bot deployed to Preview June 9, 2026 22:15 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ref(jest-balance): Parallelize balancer across 8 shards#117059

ref(jest-balance): Parallelize balancer across 8 shards#117059
ryan953 wants to merge 6 commits into
masterfrom
ryan953/jest-parallel-balancer

ryan953 commented Jun 7, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 7, 2026

Uh oh!

Uh oh!

ryan953 Jun 9, 2026

Uh oh!

cursor Bot Jun 9, 2026 •

edited

Loading

Uh oh!

cursor Bot left a comment

Uh oh!

cursor Bot Jun 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

ryan953 commented Jun 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 7, 2026

Uh oh!

Uh oh!

ryan953 Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

cursor Bot Jun 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor Bot Jun 9, 2026

Choose a reason for hiding this comment

Stale balance shard artifact uploaded

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ryan953 commented Jun 7, 2026 •

edited

Loading

cursor Bot Jun 9, 2026 •

edited

Loading