Skip to content

[MNT] Dockerized tests for CI runs using localhost#1629

Open
satvshr wants to merge 94 commits intoopenml:mainfrom
satvshr:i1614
Open

[MNT] Dockerized tests for CI runs using localhost#1629
satvshr wants to merge 94 commits intoopenml:mainfrom
satvshr:i1614

Conversation

@satvshr
Copy link
Contributor

@satvshr satvshr commented Jan 29, 2026

Metadata

Details

  • What does this PR implement/fix? Explain your changes.
    This PR implements the setting up of the v1 and v2 test servers in CI using docker via localhost.

PGijsbers and others added 11 commits January 20, 2026 12:35
Locally, MinIO already has more parquet files than on the test server.
Note that the previously strategy didn't work anymore if the server
returned a parquet file, which is the case for the new local setup.
This means it is not reliant on the evaluation engine processing the
dataset. Interestingly, the database state purposely seems to keep
the last task's dataset in preparation explicitly (by having
processing marked as done but having to dataset_status entry).
@codecov-commenter
Copy link

codecov-commenter commented Jan 29, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 52.72%. Comparing base (39daaef) to head (27d3aba).

Additional details and impacted files
@@            Coverage Diff             @@
##             main    #1629      +/-   ##
==========================================
- Coverage   53.07%   52.72%   -0.35%     
==========================================
  Files          37       37              
  Lines        4381     4381              
==========================================
- Hits         2325     2310      -15     
- Misses       2056     2071      +15     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@satvshr satvshr marked this pull request as ready for review January 31, 2026 16:13
@satvshr satvshr marked this pull request as draft January 31, 2026 16:14
Copy link
Collaborator

@geetu040 geetu040 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think any failing test is coming from this PR. It would be better to conditionally skip them and link #1657. If there is new failure message which is not already mentioned there, then please comment down the failure with the failing tests so it could be tracked there. Also if some tests are failing because of pandas, create a separate issue for that, skip and link to these then.

@satvshr satvshr requested a review from geetu040 February 24, 2026 15:14
Copy link
Collaborator

@geetu040 geetu040 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, this has not been addressed yet #1629 (comment)

Copy link
Collaborator

@geetu040 geetu040 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are so many changes not related to the issue. These must be coming from upgraded pre-commit in your local setting. Can you undo these changes? This PR should touch only the docker related part.

@geetu040
Copy link
Collaborator

The PR looks good, tests are also passing. I will approve this as #1629 (review) is addressed.

Copy link
Collaborator

@geetu040 geetu040 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is it complete from your side? Looks fine to me.

Also are we sure we have skipped all the right tests?

I will also take a look at the skips, match them with reported tests locally and on the issue #1657

@satvshr
Copy link
Contributor Author

satvshr commented Feb 27, 2026

Is it complete from your side? Looks fine to me.

Yes, it is.

Also are we sure we have skipped all the right tests?

I did not use your issue as reference but rather the failures which I observed in this PR alone, in the links mentioned here in the comment I left on the issue itself.

I will also take a look at the skips, match them with reported tests locally and on the issue #1657

I have updated the comment linked above on #1657 but feel free to cross-verify if you want to.

Copy link
Collaborator

@geetu040 geetu040 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great work on this.

Tests that pass on the remote test server but fail on the local test server are currently skipped, and we're tracking those in #1657

FYI: running the docker servers in CI takes less than 3 minutes and the tests that previously relied on remote-test-server now using the local-test-server are much faster and reliable since the probability for race-conditions is highly reduced.

@fkiraly @PGijsbers please review and merge.

@PGijsbers
Copy link
Collaborator

I looked at this PR just now and I think we should first investigate the reason for the failure of the tests. Has anyone tried this locally? Because when I try it locally it works (spin up services, point python to localhost:8000 and run tests). There are a few small exceptions, but the vast majority of marked tests pass. This makes me think it is rather an issue with the deployment on GH CI, perhaps there are issues with write permissions or so.

@geetu040
Copy link
Collaborator

geetu040 commented Mar 4, 2026

Has anyone tried this locally?

@PGijsbers Yes, they fail locally as well.

I have added some details on the failures in #1657

Quoting from above

Could it be a problem with the arff-to-pq service in docker-compose of openml/services? Since the image mentioned openml-arff-to-pq-converter-cron is not available as a public frozen docker image on docker-hub and therefore the service fallbacks to use the context given in config/arff-to-pq-converter for normal user (non openml-dev team with access to the private image).
But this is just my guess, it can be totally irrelevant to the failure.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[MNT] Intermediate test plan

4 participants