primeqa.mrc.metrics.nq_f1.nq_eval.compute_final_f1#
- primeqa.mrc.metrics.nq_f1.nq_eval.compute_final_f1(long_answer_stats, short_answer_stats)#
Computes overall F1 given long and short answers, ignoring scores.
Note: this assumes that the answers have been thresholded.
- Parameters
long_answer_stats – List of long answer scores.
short_answer_stats – List of short answer scores.
- Returns
Dictionary of name (string) -> score.