Skip to content

Conversation

yusun-nlp
Copy link
Contributor

Motivation

Add 0-shot evaluation methods and postprocessor for Smolinstruct benchmark in the chemistry science domain.

Modification

Added the 0-shot configs python files, and modify the smolinstruct postprocessor file.

Notes

Already tested on Deepseek-R1, Deepseek-V3-0324.

yusun-nlp added 3 commits May 26, 2025 21:20
Add 0-shot evaluation and postprocess functions for Smolinstruct
Copy link
Contributor

@MaiziXiao MaiziXiao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Contributor

@MaiziXiao MaiziXiao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Collaborator

@liushz liushz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@MaiziXiao MaiziXiao merged commit d572761 into open-compass:main May 29, 2025
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants