-
Notifications
You must be signed in to change notification settings - Fork 17
Open
Description
Hi
It seems the pretrained checkpoint has execution accuracy of 98.4 on Spider test set. Is this desired? It is significantly higher than table 1 (70.0).
The full result is
easy medium hard extra all
count 470 857 463 357 2147
===================== EXECUTION ACCURACY =====================
execution 0.996 0.974 0.983 0.994 0.984
The result is the same for CLLM and Pretrained LLM
Metadata
Metadata
Assignees
Labels
No labels