Skip to content

[CB][Tests] Check output of scheduling tests on Spyre #337

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 31 commits into from
Jul 30, 2025

Conversation

sducouedic
Copy link
Collaborator

@sducouedic sducouedic commented Jul 25, 2025

This code adds some logic to check the ouput of the scheduling steps tests. The output checking is done only on Spyre to save some compute for the cpu tests.

[Important note]
The output comparison checking of scheduling tests don't pass on CPU neither do they on Spyre. Exact reasons why are not sure, on cpu it might be because of high entropy of the randomly generated prompt tokens.

Signed-off-by: Sophie du Couédic <[email protected]>
Signed-off-by: Sophie du Couédic <[email protected]>
Signed-off-by: Sophie du Couédic <[email protected]>
Signed-off-by: Sophie du Couédic <[email protected]>
Signed-off-by: Sophie du Couédic <[email protected]>
This reverts commit 0868f1e.

Signed-off-by: Sophie du Couédic <[email protected]>
This reverts commit 843e7ca.

Signed-off-by: Sophie du Couédic <[email protected]>
Signed-off-by: Sophie du Couédic <[email protected]>
Signed-off-by: Sophie du Couédic <[email protected]>
Signed-off-by: Sophie du Couédic <[email protected]>
Copy link

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint --active --inexact

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

@sducouedic sducouedic force-pushed the scheduling_tests_check_output branch from 877ac8e to 44b630d Compare July 25, 2025 14:34
Signed-off-by: Sophie du Couédic <[email protected]>
@sducouedic sducouedic force-pushed the scheduling_tests_check_output branch from 44b630d to 71315bd Compare July 25, 2025 15:06
@sducouedic sducouedic marked this pull request as ready for review July 25, 2025 23:34


@pytest.fixture
def set_random_seed(request):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you add a type hint for the request parameter?

Copy link
Collaborator

@maxdebayser maxdebayser left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree with the other reviews that the compare with HF block can be factored out into a function. Other than that, this PR LGTM.

Copy link
Collaborator

@prashantgupta24 prashantgupta24 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The output comparison checking of scheduling tests don't pass on CPU neither do they on Spyre. Exact reasons why are not sure, on cpu it might be because of high entropy of the randomly generated prompt tokens.

That's concerning. Blocking this PR until we get to the bottom of that

Signed-off-by: Prashant Gupta <[email protected]>
@prashantgupta24
Copy link
Collaborator

I'm also not sure if we want to go down the route of comparing outputs against transformers or do we want to switch to comparing outputs against static batching?

Copy link
Collaborator

@prashantgupta24 prashantgupta24 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

All tests passed on eager + spyre!

Copy link
Collaborator

@rafvasq rafvasq left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the updates @prashantgupta24!

Signed-off-by: Prashant Gupta <[email protected]>
Signed-off-by: Prashant Gupta <[email protected]>
Signed-off-by: Prashant Gupta <[email protected]>
@prashantgupta24 prashantgupta24 merged commit f0b3ead into main Jul 30, 2025
17 checks passed
@prashantgupta24 prashantgupta24 deleted the scheduling_tests_check_output branch July 30, 2025 17:34
yannicks1 added a commit that referenced this pull request Aug 5, 2025
…scheduler step tests (#360)

### [docs][CB] remove warning that no output correctness is asserted for
scheduler step tests


we now do assert output correctness for the scheduling test since #337 ,
hence this warning should be removed.

Signed-off-by: Yannick Schnider <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants