Reinforcement Learner, Researcher
-
ByteDance
- Singapore
- richardli.xyz
- @RichardYRLi
Pinned Loading
-
hijkzzz/Awesome-LLM-Strawberry
hijkzzz/Awesome-LLM-Strawberry PublicA collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
-
HyperAgent
HyperAgent PublicThe official code repo for HyperAgent algorithm published in ICML 2024.
Python 7
-
anthropics/claude-code
anthropics/claude-code PublicClaude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
-
volcengine/verl
volcengine/verl Publicverl: Volcano Engine Reinforcement Learning for LLMs
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



