Skip to content

Conversation

maxreciprocate
Copy link
Collaborator

This PR fixes an indexing error when using a single q-head instead of two.
To reproduce the error on main: python examples/randomwalks/ilql_randomwalks.py '{"method": {"two_qs": false}}'

https://wandb.ai/sorry/trlx/reports/Single-Q-vs-two-Qs-on-randomwalks--Vmlldzo0MzA2NTc2

@maxreciprocate maxreciprocate requested a review from cat-state May 9, 2023 14:16
@Dahoas
Copy link
Collaborator

Dahoas commented Jun 14, 2023

Looks good to me. We can merge, unless this was addressed in a more recent pr? @maxreciprocate

@maxreciprocate
Copy link
Collaborator Author

@Dahoas It was not addressed in any of recent prs

@Dahoas
Copy link
Collaborator

Dahoas commented Jun 15, 2023

Let's merge then

@Dahoas Dahoas merged commit 65aafdd into main Jun 15, 2023
@maxreciprocate maxreciprocate deleted the fix-ilql-single-q-head branch June 15, 2023 16:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants