[SPARK-56327][PYTHON][TESTS] Fix grouped map pandas tests for pandas 3 by ueshin · Pull Request #55146 · apache/spark

ueshin · 2026-04-01T22:06:44Z

What changes were proposed in this pull request?

This PR updates python/pyspark/sql/tests/pandas/test_pandas_grouped_map.py for pandas 3 behavior in grouped map pandas UDF tests.

The changes are:

update the expected boolean inversion in test_supported_types to use ~pdf.bool on pandas 3 while keeping the existing behavior on older pandas versions
update several pandas-side expected-value paths to avoid grouping directly by the same in-DataFrame column on pandas 3
use copied groupers such as pdf.id.copy() so the grouped pandas input still keeps the grouping columns while preserving the original grouping semantics
add comments explaining why the pandas 3 branch uses copied groupers

Why are the changes needed?

In pandas 3, GroupBy.apply drops grouping columns when the grouping key is the same DataFrame column. These tests build expected results by applying the Python function to pandas-grouped data, so the previous expectations no longer match the grouped map input shape seen by Spark in cases that rely on grouping columns remaining present.

The boolean expectation also needs a pandas-3-specific branch because the old scalar-style inversion logic does not match the pandas 3 object being operated on in these tests.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Updated the related tests.

Was this patch authored or co-authored using generative AI tooling?

Generated-by: Codex (GPT-5)

ueshin · 2026-04-01T22:07:04Z

cc @gaogaotiantian @HyukjinKwon @zhengruifeng

ueshin · 2026-04-02T20:03:23Z

Thanks! merging to master.

Fix grouped map pandas tests for pandas 3

aaca5af

zhengruifeng approved these changes Apr 2, 2026

View reviewed changes

Fix.

65ad01f

ueshin closed this in e9a348e Apr 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-56327][PYTHON][TESTS] Fix grouped map pandas tests for pandas 3#55146

[SPARK-56327][PYTHON][TESTS] Fix grouped map pandas tests for pandas 3#55146
ueshin wants to merge 2 commits intoapache:masterfrom
ueshin:issues/SPARK-56327/test_pandas_grouped_map

ueshin commented Apr 1, 2026

Uh oh!

ueshin commented Apr 1, 2026

Uh oh!

ueshin commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ueshin commented Apr 1, 2026

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

ueshin commented Apr 1, 2026

Uh oh!

ueshin commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants