Skip to content
View awestover's full-sized avatar

Block or report awestover

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. how-bad-can-ai-be how-bad-can-ai-be Public

    Python

  2. misalignment-by-default misalignment-by-default Public

    Python 2 1

  3. DQN-maze-solver DQN-maze-solver Public

    Investigating whether or not RL agents can acausally collaborate with other instances of themselves.

    Python 1

  4. transformer-shortest-paths transformer-shortest-paths Public

    Experimentally evaluating transformer's generalization on a synthetic task

    HTML 1

  5. activation-steering-vs-prompting activation-steering-vs-prompting Public

    Is activation steering more powerful than prompting at mitigating deception in some current reasoning LLMs?

    Jupyter Notebook 1

  6. theland theland Public

    theland

    JavaScript 2