-
-
Notifications
You must be signed in to change notification settings - Fork 941
Open
Description
def sample(self, batch_size: int) -> Tuple[TEN, TEN, TEN, TEN, TEN, TEN]:
ids = th.randint(self.cur_size - 1, size=(batch_size,), requires_grad=False)
return (
self.states[ids],
self.actions[ids],
self.rewards[ids],
self.undones[ids],
self.unmasks[ids],
self.states[ids + 1],
)
Is this not a good way to deal with it? I think other people use dictionaries to deal with it。
Metadata
Metadata
Assignees
Labels
No labels