-
Northeastern University
- Shengyang
Pinned Loading
-
DeepSpeed-Chat-Extension
DeepSpeed-Chat-Extension PublicThis repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
-
NiuTrans/Vision-LLM-Alignment
NiuTrans/Vision-LLM-Alignment PublicThis repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
-
NiuTrans/GRAM
NiuTrans/GRAM PublicCode for ICML 2025 paper "GRAM: A Generative Foundation Reward Model for Reward Generalization"
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.