Skip to content
@hkust-nlp

NLP Group @ HKUST

We are a group of NLP researchers in the Hong Kong University of Science and Technology

Pinned Loading

  1. simpleRL-reason simpleRL-reason Public

    Simple RL training for reasoning

    Python 3.8k 279

  2. CodeIO CodeIO Public

    [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

    Python 559 32

  3. deita deita Public

    Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

    Python 575 32

  4. ceval ceval Public

    Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

    Python 1.8k 83

  5. AgentBoard AgentBoard Public

    An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]

    SAS 362 37

  6. Toolathlon Toolathlon Public

    The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

    Python 135 9

Repositories

Showing 10 of 26 repositories
  • Toolathlon Public

    The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

    hkust-nlp/Toolathlon’s past year of commit activity
    Python 135 9 2 1 Updated Nov 13, 2025
  • COMP4901B-LLMs Public

    "Large Language Models" Course (COMP4901B) offered in HKUST

    hkust-nlp/COMP4901B-LLMs’s past year of commit activity
    Python 7 8 0 0 Updated Nov 12, 2025
  • deepsearch-tts Public

    Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification

    hkust-nlp/deepsearch-tts’s past year of commit activity
    Python 20 1 1 0 Updated Oct 8, 2025
  • RL-Verifier-Robustness Public

    From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.

    hkust-nlp/RL-Verifier-Robustness’s past year of commit activity
    Python 23 MIT 1 0 0 Updated Oct 7, 2025
  • WebExplorer Public

    The official repo of "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"

    hkust-nlp/WebExplorer’s past year of commit activity
    Python 87 1 0 0 Updated Sep 29, 2025
  • model-task-align-rl Public

    The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".

    hkust-nlp/model-task-align-rl’s past year of commit activity
    Python 15 MIT 0 0 0 Updated Sep 3, 2025
  • simpleRL-reason Public

    Simple RL training for reasoning

    hkust-nlp/simpleRL-reason’s past year of commit activity
    Python 3,786 MIT 279 29 1 Updated Aug 3, 2025
  • ceval Public

    Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

    hkust-nlp/ceval’s past year of commit activity
    Python 1,780 MIT 83 6 0 Updated Jul 27, 2025
  • mstar Public

    [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning

    hkust-nlp/mstar’s past year of commit activity
    69 MIT 3 1 0 Updated Jul 13, 2025
  • Laser Public

    Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping

    hkust-nlp/Laser’s past year of commit activity
    Python 59 4 3 0 Updated May 22, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…