hi authors,
may I know have you include the experimental results and related code for the setting of VLM's Interactive Evaluation , under the chapter of 4.3. Comprehensive Ability of VLMs ?
I guess the Figure 5 radar chart is for Non-interactive Evaluation setting, right ?
thanks.