Skip to content

Conversation

vwxyzjn
Copy link
Owner

@vwxyzjn vwxyzjn commented Apr 13, 2022

Description

This PR sets the total-timesteps of ppo_continuous_action.py to be 1M, which matches the same timesteps budget in the original paper. We will defer the documentation of this change when we do the ppo benchmark soon (see #121)

Types of changes

  • Bug fix
  • New feature
  • New algorithm
  • Documentation

@vercel
Copy link

vercel bot commented Apr 13, 2022

This pull request is being automatically deployed with Vercel (learn more).
To see the status of your deployment, click below or on the icon next to each commit.

🔍 Inspect: https://vercel.com/vwxyzjn/cleanrl/4Zs7uiDa8EB3aRXPmmi1X6y8PLpk
✅ Preview: https://cleanrl-git-align-ppo-timesteps-vwxyzjn.vercel.app

@gitpod-io
Copy link

gitpod-io bot commented Apr 13, 2022

@vwxyzjn vwxyzjn merged commit 443bb14 into master Apr 14, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant