Reproduce experimental results

Hi, thank you for your great job. I am trying to reproduce the experimental results in the paper. I used the ToolLLM v2 model with the CoT method for reasoning, and the results I obtained are around 30%, which is close to the 32.3% initially mentioned in your paper. However, you have updated the reasoning code, and the experimental results have now reached 51.8%. Could you provide more details on the improvements made? I am wondering if the incomplete cache might be the reason for the poorer results. I look forward to your reply. Thank you.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reproduce experimental results #26

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Reproduce experimental results #26

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions