primeqa.mrc.metrics.tydi_f1.tydi_eval.score_answers#

primeqa.mrc.metrics.tydi_f1.tydi_eval.score_answers(gold_annotation_dict, pred_dict, passage_non_null_threshold, span_non_null_threshold, verbose, skip_missing_example_ids=True, minimal_offsets_per_passage=False)#

Scores all answers for all documents.

Parameters

gold_annotation_dict – a dict from example id to list of `TyDiLabel`s.
pred_dict – a dict from example id to list of `TyDiLabel`s.
passage_non_null_threshold –
span_non_null_threshold – minimal number of non-null annotations per example to be considered non-null
verbose – whether to enable verbose logging
skip_missing_example_ids – skip missing examples
minimal_offsets_per_passage – whether minimal answer offsets are per passage (as opposed to per document)

Returns

List of scores for passage answers. minimal_answer_stats: List of scores for minimal answers.

Return type

passage_answer_stats