Hello,
I was wondering whether there is any difference between Answer Relevance and Answer Faithfulness. Conceptually there is of course, but the code for training LLM judges and actually judging samples seems exactly the same. Would it not make sense for the answer relevance metric to not take into account the context during training and judging?