C.mze cq 105 by Martozar · Pull Request #1489 · gooddata/gooddata-python-sdk

Martozar · 2026-03-30T08:02:12Z

No description provided.

Add arrow_convertor.py with convert_arrow_table_to_dataframe(), which turns the GoodData /binary execution endpoint response into a pandas DataFrame matching the JSON-path output (MultiIndex rows/columns, totals uppercased, transposition handled via x-gdc-view-v1.isTransposed). Wire it into DataFrameFactory.for_exec_def_arrow() replacing the previous .to_pandas() stub.

Add compute_row_totals_indexes() which derives row_totals_indexes from the Arrow table metadata and execution dimension headers, matching the DataFrameMetadata produced by the JSON path. Update for_exec_def_arrow() to return (DataFrame, DataFrameMetadata) for API parity with for_exec_def(), making the two paths interchangeable. primary_labels_from_index/columns are empty dicts as the Arrow path does not support use_primary_labels_in_attributes.

- Compute primary attribute labels from Arrow table metadata and populate DataFrameMetadata.primary_labels_from_index/columns (was returning empty dicts previously) - Add DataFrameFactory.for_arrow_table() for callers who hold a pa.Table already (raw export REST path, future Flight RPC); accepts optional BareExecutionResponse for accurate row_totals_indexes - Make DataFrameMetadata.execution_response Optional to support the no-execution-response path - Export convert_arrow_table_to_dataframe from gooddata_pandas.__init__

Add a thin method that polls the raw export endpoint for an already-submitted export and returns Arrow IPC bytes. Reuses the existing _get_exported_content() polling loop; the caller is responsible for submitting the execution and export request.

- read_result_arrow(): drain response into BytesIO before releasing the connection, eliminating the fragile try/finally ordering - Remove unused _FULL_TYPES_MAPPER dead code from arrow_convertor.py - Remove stale docstring claiming primary_labels are always empty dicts (compute_primary_labels() now populates them) - Add for_arrow_table() to DataFrameFactory class docstring - convert_arrow_table_to_dataframe stub in __init__.py now raises ImportError with a helpful pyarrow install hint instead of being silently absent

[project.optional-dependencies] was incorrectly placed between dependencies and classifiers in both gooddata-pandas and gooddata-sdk pyproject.toml files, causing TOML to parse classifiers as belonging to optional-dependencies instead of [project]. This caused hatchling to fail with "Dependency of option 'classifiers' is invalid". Regenerate uv.lock after fixing the TOML structure. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

The keyword form (executionResponse=) breaks the generated API client at runtime — __init__ expects execution_response as a positional arg. Revert to the original call and suppress the ty false-positive on the __new__ signature with type: ignore[invalid-argument-type].

codecov · 2026-03-30T09:31:36Z

Codecov Report

❌ Patch coverage is 89.23611% with 31 lines in your changes missing coverage. Please review.
✅ Project coverage is 77.55%. Comparing base (d6472f7) to head (9111af3).

Files with missing lines	Patch %	Lines
...s/gooddata-pandas/src/gooddata_pandas/dataframe.py	46.42%	15 Missing ⚠️
...ta-sdk/src/gooddata_sdk/compute/model/execution.py	35.29%	11 Missing ⚠️
...es/gooddata-pandas/src/gooddata_pandas/__init__.py	50.00%	3 Missing ⚠️
...data-pandas/src/gooddata_pandas/arrow_convertor.py	99.12%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1489      +/-   ##
==========================================
+ Coverage   77.32%   77.55%   +0.22%     
==========================================
  Files         227      229       +2     
  Lines       14768    15054     +286     
==========================================
+ Hits        11420    11675     +255     
- Misses       3348     3379      +31

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

gooddata_api_client is installed as a wheel in CI so ty cannot resolve its .pyi stubs, making the suppressions both ineffective and unnecessary there. Remove them to keep the code clean.

… paths - Parametrize test_compute_primary_labels against all 36 fixtures: verifies compute_primary_labels output against ground truth stored in meta.json, covering _compute_primary_labels_from_inline (identity branch) and _compute_primary_labels_from_fields across all cases. - Add test_primary_labels_from_inline_separate_column: exercises the branch where primaryLabelId != labelId and a separate column exists. - Add test_primary_labels_from_inline_fallback_identity: exercises the fallback branch where the primary label column is absent. - Add test_primary_labels_from_fields_skips_non_string: exercises the continue branch for non-string label_values / primary_label_values. - Add test_for_arrow_table_without_execution_response: tests DataFrameFactory.for_arrow_table with no execution_response, covering the no-server-needed path and verifying empty metadata fields. - Add test_arrow_converter_unknown_types_mapper: tests the ValueError raised for unrecognised types_mapper values.

hkad98 · 2026-03-30T11:49:44Z

packages/gooddata-sdk/pyproject.toml

 ]

+[project.optional-dependencies]
+arrow = ["pyarrow>=16.1.0"]


Nitpick: this is a new dependency. Consider setting the threshold higher e.g., pyarrow>=23.0.1

Jira: CQ-105 risk: low

no23reason · 2026-03-31T08:30:03Z

packages/gooddata-pandas/src/gooddata_pandas/dataframe.py

            grand_totals_position=grand_totals_position,
        )

+    def for_exec_def_arrow(


Hm, I would probably do this as a parameter to the existing functions. Because ideally, we would like to use Arrow only moving forward. So some opt-in parameter that we can then flip the default value of would be a better migration experience. Wdyt?

But the mappers make this a bit awkward 🤔

Jira: CQ-105 risk: low

Martozar requested review from hkad98, jaceksan, lupko and pcerny as code owners March 30, 2026 08:02

Martozar force-pushed the c.mze-cq-105 branch from ee912b2 to a77ddb9 Compare March 30, 2026 08:04

Martozar added 10 commits March 30, 2026 10:26

feat: read arrow table

ad633a2

test: add ground-truth fixture tests for Arrow converter

9a87fbc

refactor: simplify

9f168ba

refactor: optional arrow

314d1c2

feat: add type mapper

e943384

Martozar force-pushed the c.mze-cq-105 branch from 0f482ec to 0423ca9 Compare March 30, 2026 08:29

Martozar and others added 4 commits March 30, 2026 10:43

fix: align with repo structure

737064a

fix: resolve ty type-check errors in gooddata-pandas

3a0c402

fix: add pyarrow to test deps

0380d40

Martozar force-pushed the c.mze-cq-105 branch from 7453528 to 0380d40 Compare March 30, 2026 09:22

Martozar added 2 commits March 30, 2026 11:35

fix: remove unnecessary type: ignore suppressions from dataframe.py

f670c8a

gooddata_api_client is installed as a wheel in CI so ty cannot resolve its .pyi stubs, making the suppressions both ineffective and unnecessary there. Remove them to keep the code clean.

Martozar marked this pull request as draft March 30, 2026 10:47

test: improve code coverage

990ecfd

hkad98 reviewed Mar 30, 2026

View reviewed changes

Martozar added 2 commits March 31, 2026 10:28

refactor: remove magic strings

a1d2dc1

Jira: CQ-105 risk: low

test: test missing metadata keys

19938cf

Jira: CQ-105 risk: low

no23reason reviewed Mar 31, 2026

View reviewed changes

refactor: increase pyarrow version

9111af3

Jira: CQ-105 risk: low

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

C.mze cq 105#1489

C.mze cq 105#1489
Martozar wants to merge 21 commits intogooddata:masterfrom
Martozar:c.mze-cq-105

Martozar commented Mar 30, 2026

Uh oh!

codecov bot commented Mar 30, 2026 •

edited

Loading

Uh oh!

hkad98 Mar 30, 2026

Uh oh!

no23reason Mar 31, 2026

Uh oh!

no23reason Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Martozar commented Mar 30, 2026

Uh oh!

codecov bot commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

hkad98 Mar 30, 2026

Choose a reason for hiding this comment

Uh oh!

no23reason Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

no23reason Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Mar 30, 2026 •

edited

Loading