Skip to content
View yang-su2000's full-sized avatar
👀
You found me!
👀
You found me!

Block or report yang-su2000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
yang-su2000/README.md

Welcome to Yang's GitHub

Hi there! I am a leading explorer on fundamental agentic research at Qwen Team.

Some topics I am interested in include

  • code/tool centric agentic rl (collaborative/adversarial training)
  • context/memory rl (online learning)
  • environment alignment (self-evolve and scaling algorithms)

I am happy to chat and discuss potential collaborations, feel free to reach out by

Linkedin Twitter Gmail WeChat

🌟 Studying Zone

(2025.06) My thought about agentic rl training.

(2024.09-) I joined Qwen Team as a researcher 🥝!

(2024.01-) I am part-time collaborating with Cornell ICPC and Millennium to build LLMs for code and data generation.

  • This work is called ALICE (Aligning Language models for Interactive Code Execution), find more about it at alicellm.github.io.
  • ALICE is a meta-agent collaboration system that generates high-quality data through multi-turn interactions and feedback without human intervention.
  • It produces multimodal data with traces from agent strategies like ReAct and Reflexion, which are scarce but offer potential for aligning advanced LLMs.

(2023-2024) I I led the prior work of ALICE called Voice2Action with Cornell XRC, an Unity Package for real-time code execution in VR; and studied on large-scale generation augmented retrieval systems (opposed to RAG) at Cornell NLP.

(2021-2022) I interned on graph machine learning at AWS AI Lab and contributed to the open source Deep Graph Library.

👀 Chilling Zone

I like programming! I lead the "Cornell Tech" Group at Cornell ICPC and won the Top 20% in 2023 Regional!

I enjoy cooking, listening to music of all forms, playing ping-pong, reading science fiction, and more!

LeetCode CodeForces Visitors

⚡ Developing Zone

Pinned Loading

  1. Voice2Action Voice2Action Public

    ALICE and its prior work, Voice2Action: Language Models as Agent for Efficient Real-Time Interaction in Virtual Reality

    C# 41 6

  2. RageAgainstThePixel/com.openai.unity RageAgainstThePixel/com.openai.unity Public

    A Non-Official OpenAI Rest Client for Unity (UPM)

    C# 575 85

  3. boson-ai/RPBench-Auto boson-ai/RPBench-Auto Public

    An automated pipeline for evaluating LLMs for role-playing.

    Python 202 10

  4. luyug/GradCache luyug/GradCache Public

    Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

    Python 417 26

  5. Authorship-Identification-with-NLP Authorship-Identification-with-NLP Public

    Large-scale user portarit ranking and generation augmented retrieval systems.

    Jupyter Notebook 6 1

  6. dmlc/dgl dmlc/dgl Public

    Python package built to ease deep learning on graph, on top of existing DL frameworks.

    Python 14.1k 3.1k