Rework tests to facilitate parametrization and finer scale testing of changes #161

sckott · 2025-04-10T21:28:31Z

#83
#139

Rework global submit job fixture to separate job submission per test. Removed the submit_wdls fixture - for those tests that used it, new utility fxn submit_wdl to submit jobs within each test. These tests now can use @pytest.mark.parametrize to parametrize over paths to WDL dirs.
There's a bunch of new yml cassette files from the above change - PLEASE IGNORE THESE!
Using python library pytest-xdist now to parallelize tests, especially important now with more distinct tests to run. We use it by having it in our pyproject.toml, and you can use it with pytest like -n auto
The downside of replacing a global submit_wdls fixture with each test submitting their own jobs is that an entire test suite run takes longer now - the last full run i tried on gh actions took 45 min then failed, so may need to seek out ways to speed this up. However, the other change here should help. That is:
Added a new GH Action api-tests-changed.yml which will look for changed WDL dirs and run its tests only across all tests leveraging the -k flag in pytest which look for all tests that use that WDL dir, even accounting for parametrized tests
The above approach can also be used locally so folks can tweak a WDL repo, run just its tests using the -k flag and not have to run the entire test suite

TODO:

After merging we'll need to check the new gh action with some changes to WDL repos

…or each test

…ob ids with use with submit_wdls fixture thats now removed

seankross · 2025-04-10T22:50:37Z

Looking extremely reasonable so far.

sckott · 2025-04-15T21:12:34Z

I'm trying this out by running the rewrite cassettes workflow with this branch

i'm also getting retry failures when running tests in parallel. I'm trying right now to see if it changes when we do not use parallel (it does change - we get back to our failures due to the globbing thing) thinking that perhaps we're running out of connections and either we need to use fewer simultaneous connections or just not use parallel. these errors are masking other errors all set now - just can't have too many workers
Now trying fewer workers than given by auto to see what the optimal number is
- Last try was with 24 workers. and that took 45 min which was I think the fastest we're going to get. Tried 48 workers, and that gave errors again, so number of workers should be 24 or maybe a bit higher, let's just go with 24 for now.

…ers if so, else auto for personal machine use

tefirman

Really like this approach of spelling out which WDL's failed during test-successes.py, test-failures.py, etc., think it'll help a ton when reading GHA check failures.

Also a huge fan of the individual cassette approach for each WDL, much more isolated changes when updating one of the unit tests. api-tests-changed.yml looks good, just one small note about -n auto, definitely want to give it a test run once it's merged.

My only other suggestion is to update the function docstrings accordingly, specifically the input argument descriptions. I'm pretty sure I understand what each of them do now, but just don't want to confuse ourselves in the future.

tests/cromwellapi/test-call.py

tests/cromwellapi/submit_wdl.py

tests/cromwellapi/test-call.py

.github/workflows/api-tests-changed.yml

sckott · 2025-04-16T22:06:40Z

For the failing test check, i'm not sure if it's all due to #160 and the globbing issue or not. 🤷🏽

sckott · 2025-04-16T22:07:45Z

Thanks for looking @tefirman - Made changes. Yeah I think we'll benefit from seeing what actual WDLs are failing rather than having to manually go in and find which one is failing.

tefirman · 2025-04-16T22:30:22Z

For the failing test check, i'm not sure if it's all due to #160 and the globbing issue or not. 🤷🏽

I'm seeing a lot of "connection refused" errors for the validation tests (helloHostname is a good example), feels unrelated...

seankross · 2025-04-17T14:14:39Z

Given the changes on dev, what do we need to do here to make sure the tests pass appropriately?

sckott · 2025-04-17T16:05:07Z

@seankross I think we'd just need to do a rewrite of the cassettes locally then push up on this branch because the test thats failing is doing a cached run, i'll do that

tefirman · 2025-04-17T16:28:48Z

I think we'd just need to do a rewrite of the cassettes locally then push up on this branch because the test thats failing is doing a cached run, i'll do that

Ahhhh, that makes total sense. I was going to say, I ran things locally and things seemed to be working fine on my end. Glad to know it's just a re-run situation.

sckott · 2025-04-17T17:21:48Z

@seankross @tefirman okay, rewrite locally done, and running now, hopefully we get green

sckott · 2025-04-17T17:31:43Z

we're all set on the cached api tests now https://github.com/FredHutch/wdl-unit-tests/actions/runs/14521232867/job/40742413310?pr=161

but I see i need to fiddle with the api tests changed yml now just a sec

sckott · 2025-04-17T17:34:38Z

@seankross @tefirman okay, we're all green on tests, good to merge?

sckott · 2025-04-17T17:34:47Z

i thnk i need one more approval @tefirman

tefirman

Thanks @sckott !

sckott added 10 commits March 13, 2025 13:51

use parametrization approch #83 for just two tests: failures and call

2e3eb7a

merge from main

f992777

use pytest-xdist for parallel tests

36a8cb1

merge

b54a4f1

remove submit_wdls fixture; replace it with separate job submission f…

9bc1ba0

…or each test

remove mocked_submissions.json that was a global cache of submitted j…

57aae2d

…ob ids with use with submit_wdls fixture thats now removed

record new cassettes and submission job id mocks

c5c4e68

more files

fa0be96

use utility fxn in validate test

35a8973

add new gh action for running rewrite tests for each changed WDL dir

1e59066

sckott added the infrastructure Infrastructure fix to execute WDL GitHub Actions label Apr 10, 2025

sckott requested review from seankross and tefirman April 10, 2025 21:28

sckott and others added 2 commits April 15, 2025 10:32

Merge branch 'main' into parametrize-paths

2762361

dont use paralell tests, see if that fixes the retry failures

9027fc7

sckott added 5 commits April 15, 2025 14:33

pytest paralell workers: detect if running on gh actions, use 12 work…

eee7859

…ers if so, else auto for personal machine use

remove testing line in makefile

62987fe

makefile: 24 workers on gh actions

a317fcb

makefile: 48 workers on gh actions

aaf021f

makefile: back to 24 workers on gh actions

7b8d632

tefirman requested changes Apr 16, 2025

View reviewed changes

tests/cromwellapi/test-call.py Outdated Show resolved Hide resolved

tests/cromwellapi/submit_wdl.py Show resolved Hide resolved

tests/cromwellapi/test-call.py Show resolved Hide resolved

.github/workflows/api-tests-changed.yml Outdated Show resolved Hide resolved

sckott added 3 commits April 16, 2025 14:34

docstring improvements

29e8579

docstrings for submit_wdl

e3bb228

set workers to 24 for pytest xdist

8d7b2dc

seankross approved these changes Apr 17, 2025

View reviewed changes

seankross added this to the v0.2 test infrastructure milestone Apr 17, 2025

rewrite cassettes after change to dev api for globs

f1ac2e3

sckott added 2 commits April 17, 2025 10:31

gitignore ruff cache

1e95472

modify changed-files rule to ignore directories that do not have WDLs

e9d1eb0

seankross requested review from tefirman and seankross April 17, 2025 17:36

seankross approved these changes Apr 17, 2025

View reviewed changes

tefirman approved these changes Apr 17, 2025

View reviewed changes

sckott merged commit 9a7580e into main Apr 17, 2025
45 checks passed

This was referenced Apr 17, 2025

Test new GHA for running on detected WDL file changes #164

Closed

Document new workflow for tests on a more granular level #165

Closed

tefirman mentioned this pull request Apr 24, 2025

158 test symlinks alone #159

Merged

Rework tests to facilitate parametrization and finer scale testing of changes #161

Rework tests to facilitate parametrization and finer scale testing of changes #161

Uh oh!

Conversation

sckott commented Apr 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seankross commented Apr 10, 2025

Uh oh!

sckott commented Apr 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tefirman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sckott commented Apr 16, 2025

Uh oh!

sckott commented Apr 16, 2025

Uh oh!

tefirman commented Apr 16, 2025

Uh oh!

seankross commented Apr 17, 2025

Uh oh!

sckott commented Apr 17, 2025

Uh oh!

tefirman commented Apr 17, 2025

Uh oh!

sckott commented Apr 17, 2025

Uh oh!

sckott commented Apr 17, 2025

Uh oh!

sckott commented Apr 17, 2025

Uh oh!

sckott commented Apr 17, 2025

Uh oh!

tefirman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sckott commented Apr 10, 2025 •

edited

Loading

sckott commented Apr 15, 2025 •

edited

Loading