primeqa.mrc.metrics.nq_f1.nq_eval.score_long_answer#
- primeqa.mrc.metrics.nq_f1.nq_eval.score_long_answer(gold_label_list, pred_label, long_non_null_threshold=None)#
Scores a long answer as correct or not.
First decide if there is a gold long answer with LONG_NO_NULL_THRESHOLD.
The prediction will get a match if: a. There is a gold long answer. b. The prediction span match exactly with one of the non-null gold
long answer span.
- Parameters
gold_label_list – A list of NQLabel, could be None.
pred_label – A single NQLabel, could be None.
long_non_null_threshold – Min number of non null spans in the annotations to consider the question as having a non null answer
- Returns
gold_has_answer, pred_has_answer, is_correct, score