NLP Group @ HKUST

simpleRL-reason Public

Simple RL training for reasoning

Python 3.8k 279

CodeIO Public

[ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Python 559 32

deita Public

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 575 32

ceval Public

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Python 1.8k 83

AgentBoard Public

An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]

Toolathlon Public

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Python 135 9

Provide feedback