-
Notifications
You must be signed in to change notification settings - Fork 20
[CB][Tests] Check output of scheduling tests on Spyre #337
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: Sophie du Couédic <[email protected]>
Signed-off-by: Sophie du Couédic <[email protected]>
Signed-off-by: Sophie du Couédic <[email protected]>
Signed-off-by: Sophie du Couédic <[email protected]>
Signed-off-by: Sophie du Couédic <[email protected]>
This reverts commit 0868f1e. Signed-off-by: Sophie du Couédic <[email protected]>
Signed-off-by: Sophie du Couédic <[email protected]>
This reverts commit 843e7ca. Signed-off-by: Sophie du Couédic <[email protected]>
Signed-off-by: Sophie du Couédic <[email protected]>
…d returns it Signed-off-by: Sophie du Couédic <[email protected]>
Signed-off-by: Sophie du Couédic <[email protected]>
Signed-off-by: Sophie du Couédic <[email protected]>
👋 Hi! Thank you for contributing to vLLM support on Spyre.
Or this can be done with
Now you are good to go 🚀 |
877ac8e
to
44b630d
Compare
Signed-off-by: Sophie du Couédic <[email protected]>
44b630d
to
71315bd
Compare
Signed-off-by: Sophie du Couédic <[email protected]>
Signed-off-by: Sophie du Couédic <[email protected]>
|
||
|
||
@pytest.fixture | ||
def set_random_seed(request): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you add a type hint for the request parameter?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree with the other reviews that the compare with HF block can be factored out into a function. Other than that, this PR LGTM.
Signed-off-by: Prashant Gupta <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The output comparison checking of scheduling tests don't pass on CPU neither do they on Spyre. Exact reasons why are not sure, on cpu it might be because of high entropy of the randomly generated prompt tokens.
That's concerning. Blocking this PR until we get to the bottom of that
Signed-off-by: Prashant Gupta <[email protected]>
Signed-off-by: Prashant Gupta <[email protected]>
I'm also not sure if we want to go down the route of comparing outputs against transformers or do we want to switch to comparing outputs against static batching? |
Signed-off-by: Prashant Gupta <[email protected]>
Signed-off-by: Prashant Gupta <[email protected]>
Signed-off-by: Prashant Gupta <[email protected]>
Signed-off-by: Prashant Gupta <[email protected]>
Signed-off-by: Prashant Gupta <[email protected]>
Signed-off-by: Prashant Gupta <[email protected]>
Signed-off-by: Prashant Gupta <[email protected]>
Signed-off-by: Prashant Gupta <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
All tests passed on eager + spyre!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the updates @prashantgupta24!
Signed-off-by: Prashant Gupta <[email protected]>
Signed-off-by: Prashant Gupta <[email protected]>
Signed-off-by: Prashant Gupta <[email protected]>
…scheduler step tests (#360) ### [docs][CB] remove warning that no output correctness is asserted for scheduler step tests we now do assert output correctness for the scheduling test since #337 , hence this warning should be removed. Signed-off-by: Yannick Schnider <[email protected]>
This code adds some logic to check the ouput of the scheduling steps tests. The output checking is done only on Spyre to save some compute for the cpu tests.
[Important note]
The output comparison checking of scheduling tests don't pass on CPU neither do they on Spyre. Exact reasons why are not sure, on cpu it might be because of high entropy of the randomly generated prompt tokens.