primeqa.mrc.metrics.nq_f1.nq_eval.compute_optimal_metrics#

primeqa.mrc.metrics.nq_f1.nq_eval.compute_optimal_metrics(long_answer_stats: List[List[Union[bool, float]]], short_answer_stats: List[List[Union[bool, float]]]) Dict[str, float]#

Computes overall metrics for long and short answers for their respective optimal thresholds :param long_answer_stats: List of long answer scores. :param short_answer_stats: List of short answer scores.

Returns

Ordered Dictionary of name (string) -> score.