BioASQ Participants Area
Task 10b: Test Results of Phase B
The test results are presented in separate tables for each type of annotation. The "System Description" of each system is used.The evaluation measures that are used in Task B are presented here .
Warning: For ideal answers, good ROUGE results do not always imply good manual scores.
Test batch 1
Exact Answers
| Yes/No | Factoid | List | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| System | Accuracy | F1 Yes | F1 No | Macro F1 | Strict Acc. | Lenient Acc. | MRR | Mean Prec. | Recall | F-Measure |
| bio-answerfinder | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.3824 | 0.4118 | 0.3897 | 0.5552 | 0.6107 | 0.5639 |
| bio-answerfinder-2 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.3824 | 0.4118 | 0.3897 | 0.5552 | 0.6107 | 0.5639 |
| Knowledge graph QA | 0.6087 | 0.7097 | 0.4000 | 0.5548 | 0.2647 | 0.4412 | 0.3480 | 0.4481 | 0.3964 | 0.3540 |
| orpheus_kg | 0.9565 | 0.9714 | 0.9091 | 0.9403 | 0.1471 | 0.1471 | 0.1471 | 0.4233 | 0.4321 | 0.4240 |
| LaRSA | 0.9565 | 0.9697 | 0.9231 | 0.9464 | 0.2941 | 0.5588 | 0.4118 | 0.5714 | 0.4821 | 0.4959 |
| UDEL-LAB1 | 0.9565 | 0.9714 | 0.9091 | 0.9403 | 0.3529 | 0.5882 | 0.4397 | 0.6974 | 0.8226 | 0.7346 |
| UDEL-LAB2 | 0.9565 | 0.9714 | 0.9091 | 0.9403 | 0.3824 | 0.5588 | 0.4534 | 0.7201 | 0.8405 | 0.7469 |
| UDEL-LAB3 | 0.9565 | 0.9714 | 0.9091 | 0.9403 | 0.2941 | 0.4412 | 0.3578 | 0.6893 | 0.7429 | 0.6731 |
| UDEL-LAB4 | 0.9565 | 0.9714 | 0.9091 | 0.9403 | 0.3235 | 0.4412 | 0.3775 | 0.6762 | 0.8464 | 0.7229 |
| UDEL-LAB5 | 0.9565 | 0.9714 | 0.9091 | 0.9403 | 0.3824 | 0.5882 | 0.4706 | 0.4714 | 0.8310 | 0.5810 |
| MQ-1 | 0.7391 | 0.8500 | - | 0.4250 | - | - | - | - | - | - |
| MQ-2 | 0.7391 | 0.8500 | - | 0.4250 | - | - | - | - | - | - |
| AUEB-System1 | 0.7391 | 0.8500 | - | 0.4250 | 0.2941 | 0.3529 | 0.3235 | 0.1143 | 0.0786 | 0.0905 |
| AUEB-System2 | 0.7391 | 0.8500 | - | 0.4250 | 0.2941 | 0.3824 | 0.3333 | 0.0536 | 0.0429 | 0.0476 |
| AUEB-System3 | 0.7391 | 0.8500 | - | 0.4250 | 0.2941 | 0.3824 | 0.3221 | 0.0571 | 0.0786 | 0.0633 |
| AUEB-System4 | 0.7391 | 0.8500 | - | 0.4250 | 0.2941 | 0.4118 | 0.3392 | 0.0607 | 0.0786 | 0.0667 |
| simple baseline solr | 0.6522 | 0.7895 | - | 0.3947 | 0.2941 | 0.2941 | 0.2941 | - | - | - |
| KU-AAA637-system1 | 0.9130 | 0.9412 | 0.8333 | 0.8873 | 0.3529 | 0.5294 | 0.4167 | 0.5629 | 0.5774 | 0.5310 |
| Ir_sys1 | 0.9565 | 0.9697 | 0.9231 | 0.9464 | 0.4118 | 0.5000 | 0.4559 | 0.5714 | 0.4702 | 0.4887 |
| Ir_sys2 | 0.9130 | 0.9412 | 0.8333 | 0.8873 | 0.3529 | 0.5000 | 0.4176 | 0.5595 | 0.4738 | 0.4948 |
| Ir_sys3 | 0.6957 | 0.8000 | 0.3636 | 0.5818 | 0.4118 | 0.5294 | 0.4559 | 0.6224 | 0.4881 | 0.5238 |
| lalala | 0.9565 | 0.9697 | 0.9231 | 0.9464 | 0.3824 | 0.5000 | 0.4363 | 0.6046 | 0.7286 | 0.6459 |
| bioinfo-0 | 0.4783 | 0.4545 | 0.5000 | 0.4773 | - | - | - | - | - | - |
| Ir_sys4 | 0.9565 | 0.9697 | 0.9231 | 0.9464 | 0.2353 | 0.4118 | 0.3039 | 0.7143 | 0.5619 | 0.6048 |
| bioinfo-1 | 0.6957 | 0.7407 | 0.6316 | 0.6862 | - | - | - | - | - | - |
| BioASQ_Baseline | 0.4783 | 0.4545 | 0.5000 | 0.4773 | 0.0588 | 0.2647 | 0.1186 | 0.3016 | 0.3750 | 0.2650 |
Ideal Answers
| Automatic scores (Rouge - R) | Manual scores | |||||||
|---|---|---|---|---|---|---|---|---|
| System | R-2 (Rec) | R-2 (F1) | R-SU4 (Rec) | R-SU4 (F1) | Readability | Recall | Precision | Repetition |
| bio-answerfinder | 0.5057 | 0.3894 | 0.5063 | 0.3851 | 4.40 | 4.42 | 4.31 | 4.62 |
| bio-answerfinder-2 | 0.4303 | 0.3913 | 0.4339 | 0.3894 | 4.57 | 4.29 | 4.36 | 4.92 |
| Knowledge graph QA | 0.2491 | 0.1732 | 0.2526 | 0.1715 | 2.07 | 2.31 | 2.19 | 2.34 |
| orpheus_kg | 0.0807 | 0.0970 | 0.0808 | 0.0961 | 4.42 | 4.18 | 4.56 | 4.92 |
| LaRSA | - | - | - | - | - | - | - | - |
| UDEL-LAB1 | - | - | - | - | 0.38 | 0.38 | 0.38 | 0.38 |
| UDEL-LAB2 | - | - | - | - | 0.38 | 0.38 | 0.38 | 0.38 |
| UDEL-LAB3 | - | - | - | - | 0.38 | 0.38 | 0.38 | 0.38 |
| UDEL-LAB4 | - | - | - | - | 0.38 | 0.38 | 0.38 | 0.38 |
| UDEL-LAB5 | - | - | - | - | 0.38 | 0.38 | 0.38 | 0.38 |
| MQ-1 | 0.5665 | 0.3511 | 0.5705 | 0.3418 | 4.14 | 4.51 | 4.01 | 4.29 |
| MQ-2 | 0.5526 | 0.3406 | 0.5561 | 0.3316 | 4.13 | 4.49 | 4.02 | 4.27 |
| AUEB-System1 | - | - | - | - | - | - | - | - |
| AUEB-System2 | - | - | - | - | - | - | - | - |
| AUEB-System3 | - | - | - | - | - | - | - | - |
| AUEB-System4 | - | - | - | - | - | - | - | - |
| simple baseline solr | 0.0694 | 0.1103 | 0.0508 | 0.0829 | 4.07 | 3.44 | 4.41 | 4.94 |
| KU-AAA637-system1 | - | - | - | - | - | - | - | - |
| Ir_sys1 | 0.5791 | 0.3512 | 0.5833 | 0.3414 | 4.13 | 4.56 | 4.07 | 4.28 |
| Ir_sys2 | 0.5569 | 0.3383 | 0.5614 | 0.3299 | 4.16 | 4.57 | 4.03 | 4.28 |
| Ir_sys3 | 0.0877 | 0.1284 | 0.0652 | 0.0954 | 4.04 | 3.80 | 4.37 | 4.91 |
| lalala | 0.5748 | 0.3464 | 0.5773 | 0.3367 | 4.17 | 4.47 | 4.10 | 4.23 |
| bioinfo-0 | - | - | - | - | - | - | - | - |
| Ir_sys4 | 0.2651 | 0.2396 | 0.2620 | 0.2329 | 4.06 | 3.41 | 3.69 | 4.70 |
| bioinfo-1 | - | - | - | - | - | - | - | - |
| BioASQ_Baseline | - | - | - | - | - | - | - | - |
Test batch 2
Exact Answers
| Yes/No | Factoid | List | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| System | Accuracy | F1 Yes | F1 No | Macro F1 | Strict Acc. | Lenient Acc. | MRR | Mean Prec. | Recall | F-Measure |
| simple baseline solr | 0.6111 | 0.7200 | 0.3636 | 0.5418 | 0.4118 | 0.4118 | 0.4118 | 0.2000 | 0.0611 | 0.0902 |
| AUEB-System1 | 0.6667 | 0.8000 | - | 0.4000 | 0.3824 | 0.5000 | 0.4363 | 0.3967 | 0.2324 | 0.2663 |
| AUEB-System2 | 0.6667 | 0.8000 | - | 0.4000 | 0.3235 | 0.5000 | 0.3995 | 0.1400 | 0.1574 | 0.1299 |
| AUEB-System3 | 0.6667 | 0.8000 | - | 0.4000 | 0.4118 | 0.4706 | 0.4363 | 0.2533 | 0.2407 | 0.2191 |
| AUEB-System4 | 0.6667 | 0.8000 | - | 0.4000 | 0.2647 | 0.4706 | 0.3578 | 0.2867 | 0.2204 | 0.2172 |
| NCU-IISR/AS-GIS-1 | 0.6667 | 0.8000 | - | 0.4000 | - | - | - | - | - | - |
| NCU-IISR/AS-GIS-2 | 0.6667 | 0.8000 | - | 0.4000 | - | - | - | - | - | - |
| NCU-IISR/AS-GIS-3 | 0.6667 | 0.8000 | - | 0.4000 | - | - | - | - | - | - |
| bio-answerfinder | 0.9444 | 0.9600 | 0.9091 | 0.9345 | 0.4118 | 0.4706 | 0.4412 | 0.5254 | 0.4157 | 0.4485 |
| bio-answerfinder-2 | 0.9444 | 0.9600 | 0.9091 | 0.9345 | 0.4118 | 0.4706 | 0.4412 | 0.5254 | 0.4157 | 0.4485 |
| orpheus_kg | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.3529 | 0.3529 | 0.3529 | 0.3833 | 0.3546 | 0.3657 |
| LaRSA | 0.3889 | 0.1538 | 0.5217 | 0.3378 | 0.4412 | 0.5882 | 0.5098 | 0.6143 | 0.4281 | 0.4490 |
| MQ-1 | 0.6667 | 0.8000 | - | 0.4000 | - | - | - | - | - | - |
| MQ-2 | 0.6667 | 0.8000 | - | 0.4000 | - | - | - | - | - | - |
| UDEL-LAB1 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.5882 | 0.6471 | 0.6103 | 0.6707 | 0.6530 | 0.6393 |
| UDEL-LAB2 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.5882 | 0.6471 | 0.6176 | 0.6914 | 0.7193 | 0.6787 |
| UDEL-LAB3 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.5882 | 0.6765 | 0.6235 | 0.7042 | 0.7400 | 0.7051 |
| UDEL-LAB4 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.5882 | 0.7059 | 0.6206 | 0.6859 | 0.7530 | 0.7011 |
| KU-AAA637-system1 | 0.9444 | 0.9600 | 0.9091 | 0.9345 | 0.5294 | 0.6471 | 0.5745 | 0.4975 | 0.4170 | 0.3949 |
| KU-AAA637-system2 | 0.9444 | 0.9600 | 0.9091 | 0.9345 | 0.5294 | 0.6176 | 0.5686 | 0.5324 | 0.4337 | 0.4210 |
| KU-AAA637-system3 | 0.9444 | 0.9600 | 0.9091 | 0.9345 | 0.5294 | 0.6471 | 0.5745 | 0.5324 | 0.4337 | 0.4210 |
| KU-AAA637-system4 | 0.9444 | 0.9600 | 0.9091 | 0.9345 | 0.5588 | 0.6471 | 0.5907 | 0.5324 | 0.4337 | 0.4210 |
| KU-AAA637-system5 | 0.9444 | 0.9600 | 0.9091 | 0.9345 | 0.5588 | 0.6176 | 0.5833 | 0.5324 | 0.4337 | 0.4210 |
| bioinfo-0 | 0.5000 | 0.5263 | 0.4706 | 0.4985 | - | - | - | - | - | - |
| bioinfo-1 | 0.5000 | 0.5263 | 0.4706 | 0.4985 | - | - | - | - | - | - |
| bioinfo-2 | 0.6667 | 0.7273 | 0.5714 | 0.6494 | - | - | - | - | - | - |
| bioinfo-3 | 0.4444 | 0.5000 | 0.3750 | 0.4375 | - | - | - | - | - | - |
| bioinfo-4 | 0.5000 | 0.5263 | 0.4706 | 0.4985 | - | - | - | - | - | - |
| NCU-1 | 0.7778 | 0.8571 | 0.5000 | 0.6786 | - | - | - | - | - | - |
| simple truncation | 0.6667 | 0.8000 | - | 0.4000 | - | - | - | - | - | - |
| Ir_sys1 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.5588 | 0.6176 | 0.5833 | 0.5785 | 0.4004 | 0.4247 |
| Ir_sys2 | 0.8333 | 0.8889 | 0.6667 | 0.7778 | 0.5588 | 0.6176 | 0.5882 | 0.5893 | 0.4763 | 0.4956 |
| Ir_sys3 | 0.8333 | 0.8889 | 0.6667 | 0.7778 | 0.5882 | 0.6471 | 0.6103 | 0.6226 | 0.4281 | 0.4504 |
| Ir_sys4 | 0.7222 | 0.7826 | 0.6154 | 0.6990 | 0.5882 | 0.6176 | 0.5980 | 0.6149 | 0.4504 | 0.4722 |
| lalala | 0.7222 | 0.7826 | 0.6154 | 0.6990 | 0.5882 | 0.6471 | 0.6039 | 0.4955 | 0.6067 | 0.5177 |
| BioASQ_Baseline | 0.4444 | 0.3750 | 0.5000 | 0.4375 | 0.0882 | 0.2647 | 0.1382 | 0.2892 | 0.4019 | 0.3053 |
Ideal Answers
| Automatic scores (Rouge - R) | Manual scores | |||||||
|---|---|---|---|---|---|---|---|---|
| System | R-2 (Rec) | R-2 (F1) | R-SU4 (Rec) | R-SU4 (F1) | Readability | Recall | Precision | Repetition |
| simple baseline solr | 0.4464 | 0.2606 | 0.4547 | 0.2515 | 3.63 | 4.06 | 3.46 | 3.87 |
| AUEB-System1 | - | - | - | - | - | - | - | - |
| AUEB-System2 | - | - | - | - | - | - | - | - |
| AUEB-System3 | - | - | - | - | - | - | - | - |
| AUEB-System4 | - | - | - | - | - | - | - | - |
| NCU-IISR/AS-GIS-1 | 0.4704 | 0.4151 | 0.4772 | 0.4040 | 4.61 | 4.49 | 4.48 | 4.94 |
| NCU-IISR/AS-GIS-2 | 0.4840 | 0.4283 | 0.4907 | 0.4177 | 4.53 | 4.48 | 4.46 | 4.90 |
| NCU-IISR/AS-GIS-3 | 0.4824 | 0.4276 | 0.4897 | 0.4173 | 4.52 | 4.48 | 4.44 | 4.89 |
| bio-answerfinder | 0.5531 | 0.3683 | 0.5600 | 0.3609 | 4.27 | 4.59 | 4.19 | 4.46 |
| bio-answerfinder-2 | 0.4749 | 0.4031 | 0.4805 | 0.3963 | 4.51 | 4.44 | 4.37 | 4.89 |
| orpheus_kg | 0.1411 | 0.1714 | 0.1332 | 0.1636 | 4.47 | 4.29 | 4.44 | 4.92 |
| LaRSA | - | - | - | - | - | - | - | - |
| MQ-1 | 0.6053 | 0.3387 | 0.6184 | 0.3256 | 4.29 | 4.70 | 4.08 | 4.34 |
| MQ-2 | 0.6079 | 0.3387 | 0.6207 | 0.3258 | 4.26 | 4.70 | 4.08 | 4.37 |
| UDEL-LAB1 | - | - | - | - | 0.38 | 0.38 | 0.38 | 0.38 |
| UDEL-LAB2 | - | - | - | - | 0.38 | 0.38 | 0.38 | 0.38 |
| UDEL-LAB3 | - | - | - | - | 0.38 | 0.38 | 0.38 | 0.38 |
| UDEL-LAB4 | - | - | - | - | 0.38 | 0.38 | 0.38 | 0.38 |
| KU-AAA637-system1 | - | - | - | - | - | - | - | - |
| KU-AAA637-system2 | - | - | - | - | - | - | - | - |
| KU-AAA637-system3 | - | - | - | - | - | - | - | - |
| KU-AAA637-system4 | - | - | - | - | - | - | - | - |
| KU-AAA637-system5 | - | - | - | - | - | - | - | - |
| bioinfo-0 | - | - | - | - | - | - | - | - |
| bioinfo-1 | - | - | - | - | - | - | - | - |
| bioinfo-2 | - | - | - | - | - | - | - | - |
| bioinfo-3 | - | - | - | - | - | - | - | - |
| bioinfo-4 | - | - | - | - | - | - | - | - |
| NCU-1 | 0.4881 | 0.4187 | 0.4943 | 0.4073 | 4.57 | 4.52 | 4.48 | 4.93 |
| simple truncation | 0.7085 | 0.3414 | 0.7128 | 0.3251 | 4.04 | 4.90 | 3.82 | 4.08 |
| Ir_sys1 | 0.6085 | 0.3421 | 0.6169 | 0.3272 | 4.28 | 4.77 | 4.09 | 4.33 |
| Ir_sys2 | 0.6128 | 0.3441 | 0.6282 | 0.3317 | 4.24 | 4.70 | 4.10 | 4.33 |
| Ir_sys3 | 0.6026 | 0.3388 | 0.6153 | 0.3254 | 4.23 | 4.76 | 4.10 | 4.31 |
| Ir_sys4 | 0.1114 | 0.1654 | 0.0864 | 0.1318 | 3.86 | 3.39 | 3.92 | 4.73 |
| lalala | 0.3131 | 0.2698 | 0.3210 | 0.2699 | 4.12 | 3.46 | 3.68 | 4.44 |
| BioASQ_Baseline | - | - | - | - | - | - | - | - |
Test batch 3
Exact Answers
| Yes/No | Factoid | List | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| System | Accuracy | F1 Yes | F1 No | Macro F1 | Strict Acc. | Lenient Acc. | MRR | Mean Prec. | Recall | F-Measure |
| AUEB-System1 | 0.7600 | 0.8636 | - | 0.4318 | 0.3750 | 0.5313 | 0.4349 | 0.1818 | 0.1584 | 0.1292 |
| AUEB-System2 | 0.7600 | 0.8636 | - | 0.4318 | 0.4688 | 0.4688 | 0.4688 | 0.1409 | 0.1662 | 0.1421 |
| AUEB-System3 | 0.7600 | 0.8636 | - | 0.4318 | 0.3438 | 0.5313 | 0.4323 | 0.1152 | 0.1351 | 0.1065 |
| AUEB-System4 | 0.7600 | 0.8636 | - | 0.4318 | 0.5000 | 0.5313 | 0.5156 | 0.1273 | 0.0766 | 0.0669 |
| AUEB-System5 | 0.7600 | 0.8636 | - | 0.4318 | 0.3750 | 0.5625 | 0.4505 | 0.1227 | 0.1403 | 0.1128 |
| bio-answerfinder | 0.8800 | 0.9189 | 0.7692 | 0.8441 | 0.3750 | 0.4063 | 0.3906 | 0.6273 | 0.4472 | 0.4843 |
| bio-answerfinder-2 | 0.8800 | 0.9189 | 0.7692 | 0.8441 | 0.3750 | 0.4063 | 0.3906 | 0.6273 | 0.4472 | 0.4843 |
| LaRSA | 0.9200 | 0.9500 | 0.8000 | 0.8750 | 0.5313 | 0.6875 | 0.5990 | 0.4923 | 0.4128 | 0.4052 |
| MQ-1 | 0.7600 | 0.8636 | - | 0.4318 | - | - | - | - | - | - |
| MQ-2 | 0.7600 | 0.8636 | - | 0.4318 | - | - | - | - | - | - |
| NCU-1 | 0.8800 | 0.9268 | 0.6667 | 0.7967 | 0.4063 | 0.4063 | 0.4063 | 0.3030 | 0.1608 | 0.1971 |
| NCU-IISR/AS-GIS-1 | 0.8800 | 0.9268 | 0.6667 | 0.7967 | 0.4063 | 0.4063 | 0.4063 | 0.3030 | 0.1608 | 0.1971 |
| NCU-IISR/AS-GIS-2 | 0.8800 | 0.9268 | 0.6667 | 0.7967 | 0.4063 | 0.4063 | 0.4063 | 0.3030 | 0.1608 | 0.1971 |
| NCU-IISR/AS-GIS-3 | 0.8800 | 0.9268 | 0.6667 | 0.7967 | 0.4063 | 0.4063 | 0.4063 | 0.3030 | 0.1608 | 0.1971 |
| BioASQ-2022_UNCC | 0.8800 | 0.9231 | 0.7273 | 0.8252 | 0.5000 | 0.6875 | 0.5714 | - | - | - |
| UDEL-LAB1 | 0.9600 | 0.9730 | 0.9231 | 0.9480 | 0.5625 | 0.6563 | 0.6042 | 0.5442 | 0.6742 | 0.5655 |
| UDEL-LAB2 | 0.9600 | 0.9730 | 0.9231 | 0.9480 | 0.5313 | 0.6563 | 0.5885 | 0.5174 | 0.6591 | 0.5558 |
| UDEL-LAB4 | 0.9600 | 0.9730 | 0.9231 | 0.9480 | 0.5313 | 0.6250 | 0.5729 | 0.5263 | 0.5985 | 0.5188 |
| UDEL-LAB3 | 0.9600 | 0.9730 | 0.9231 | 0.9480 | 0.5000 | 0.6250 | 0.5469 | 0.5293 | 0.6439 | 0.5255 |
| UDEL-LAB5 | 0.9600 | 0.9730 | 0.9231 | 0.9480 | 0.5625 | 0.6563 | 0.5948 | 0.5447 | 0.5682 | 0.5094 |
| KU-AAA637-system1 | 0.8800 | 0.9189 | 0.7692 | 0.8441 | 0.5000 | 0.6875 | 0.5729 | 0.5203 | 0.4794 | 0.4201 |
| KU-AAA637-system2 | 0.9200 | 0.9444 | 0.8571 | 0.9008 | 0.5313 | 0.6875 | 0.5911 | 0.5152 | 0.4794 | 0.4192 |
| KU-AAA637-system3 | 0.9200 | 0.9444 | 0.8571 | 0.9008 | 0.5313 | 0.6875 | 0.5885 | 0.5152 | 0.4794 | 0.4192 |
| KU-AAA637-system4 | 0.8800 | 0.9189 | 0.7692 | 0.8441 | 0.5313 | 0.6875 | 0.5964 | 0.5195 | 0.4794 | 0.4212 |
| KU-AAA637-system5 | 0.8800 | 0.9189 | 0.7692 | 0.8441 | 0.4375 | 0.6875 | 0.5323 | 0.5126 | 0.4613 | 0.4083 |
| Fleming-3 | 0.8000 | 0.8837 | 0.2857 | 0.5847 | - | - | - | - | - | - |
| bioinfo-0 | 0.7200 | 0.7879 | 0.5882 | 0.6881 | - | - | - | - | - | - |
| bioinfo-1 | 0.8000 | 0.8649 | 0.6154 | 0.7401 | - | - | - | - | - | - |
| bioinfo-2 | 0.6800 | 0.7500 | 0.5556 | 0.6528 | - | - | - | - | - | - |
| bioinfo-3 | 0.7600 | 0.8421 | 0.5000 | 0.6711 | - | - | - | - | - | - |
| bioinfo-4 | 0.7200 | 0.7879 | 0.5882 | 0.6881 | - | - | - | - | - | - |
| new kgqa for yesno | 0.7200 | 0.8293 | 0.2222 | 0.5257 | 0.3438 | 0.5625 | 0.4245 | 0.1955 | 0.2333 | 0.2030 |
| extractive | 0.7600 | 0.8636 | - | 0.4318 | - | - | - | - | - | - |
| Ir_sys1 | 0.9200 | 0.9444 | 0.8571 | 0.9008 | 0.4688 | 0.6875 | 0.5573 | 0.5325 | 0.4552 | 0.4567 |
| Ir_sys2 | 0.9200 | 0.9444 | 0.8571 | 0.9008 | 0.4375 | 0.6563 | 0.5151 | 0.5189 | 0.3697 | 0.4000 |
| Ir_sys3 | 0.8400 | 0.9000 | 0.6000 | 0.7500 | 0.5000 | 0.6563 | 0.5625 | 0.5038 | 0.3652 | 0.3758 |
| Ir_sys4 | 0.8400 | 0.9000 | 0.6000 | 0.7500 | 0.5313 | 0.6250 | 0.5703 | 0.5195 | 0.3392 | 0.3740 |
| lalala | 0.9200 | 0.9444 | 0.8571 | 0.9008 | 0.5000 | 0.6250 | 0.5573 | 0.4418 | 0.6076 | 0.4769 |
| BioASQ_Baseline | 0.2400 | 0.0952 | 0.3448 | 0.2200 | 0.1563 | 0.4063 | 0.2526 | 0.1338 | 0.3478 | 0.1752 |
Ideal Answers
| Automatic scores (Rouge - R) | Manual scores | |||||||
|---|---|---|---|---|---|---|---|---|
| System | R-2 (Rec) | R-2 (F1) | R-SU4 (Rec) | R-SU4 (F1) | Readability | Recall | Precision | Repetition |
| AUEB-System1 | - | - | - | - | - | - | - | - |
| AUEB-System2 | - | - | - | - | - | - | - | - |
| AUEB-System3 | - | - | - | - | - | - | - | - |
| AUEB-System4 | - | - | - | - | - | - | - | - |
| AUEB-System5 | - | - | - | - | - | - | - | - |
| bio-answerfinder | 0.4721 | 0.3219 | 0.4879 | 0.3190 | 4.30 | 4.64 | 4.26 | 4.58 |
| bio-answerfinder-2 | 0.3752 | 0.3471 | 0.3925 | 0.3474 | 4.61 | 4.61 | 4.43 | 4.97 |
| LaRSA | 0.4935 | 0.3222 | 0.4977 | 0.3086 | 4.54 | 4.40 | 3.90 | 4.73 |
| MQ-1 | 0.5446 | 0.3253 | 0.5508 | 0.3119 | 4.14 | 4.74 | 4.07 | 4.26 |
| MQ-2 | 0.5328 | 0.3179 | 0.5392 | 0.3046 | 4.17 | 4.76 | 4.10 | 4.23 |
| NCU-1 | 0.4072 | 0.3696 | 0.4106 | 0.3613 | 4.58 | 4.48 | 4.49 | 4.94 |
| NCU-IISR/AS-GIS-1 | 0.3787 | 0.3508 | 0.3786 | 0.3395 | 4.62 | 4.40 | 4.46 | 4.99 |
| NCU-IISR/AS-GIS-2 | 0.4054 | 0.3690 | 0.4083 | 0.3611 | 4.58 | 4.46 | 4.49 | 4.97 |
| NCU-IISR/AS-GIS-3 | 0.3910 | 0.3553 | 0.3959 | 0.3480 | 4.62 | 4.42 | 4.44 | 4.97 |
| BioASQ-2022_UNCC | - | - | - | - | - | - | - | - |
| UDEL-LAB1 | - | - | - | - | 0.36 | 0.36 | 0.36 | 0.36 |
| UDEL-LAB2 | - | - | - | - | 0.36 | 0.36 | 0.36 | 0.36 |
| UDEL-LAB4 | - | - | - | - | 0.36 | 0.36 | 0.36 | 0.36 |
| UDEL-LAB3 | - | - | - | - | 0.36 | 0.36 | 0.36 | 0.36 |
| UDEL-LAB5 | - | - | - | - | 0.36 | 0.36 | 0.36 | 0.36 |
| KU-AAA637-system1 | - | - | - | - | - | - | - | - |
| KU-AAA637-system2 | - | - | - | - | - | - | - | - |
| KU-AAA637-system3 | - | - | - | - | - | - | - | - |
| KU-AAA637-system4 | - | - | - | - | - | - | - | - |
| KU-AAA637-system5 | - | - | - | - | - | - | - | - |
| Fleming-3 | - | - | - | - | 1.28 | 0.96 | 1.19 | 1.39 |
| bioinfo-0 | - | - | - | - | - | - | - | - |
| bioinfo-1 | - | - | - | - | - | - | - | - |
| bioinfo-2 | - | - | - | - | - | - | - | - |
| bioinfo-3 | - | - | - | - | - | - | - | - |
| bioinfo-4 | - | - | - | - | - | - | - | - |
| new kgqa for yesno | 0.1960 | 0.1455 | 0.2060 | 0.1430 | 1.89 | 2.19 | 1.94 | 2.11 |
| extractive | 0.1304 | 0.0979 | 0.1274 | 0.0939 | 0.80 | 1.09 | 1.01 | 1.08 |
| Ir_sys1 | 0.5603 | 0.3408 | 0.5585 | 0.3238 | 4.11 | 4.72 | 4.06 | 4.28 |
| Ir_sys2 | 0.5677 | 0.3403 | 0.5778 | 0.3258 | 4.13 | 4.80 | 4.08 | 4.29 |
| Ir_sys3 | 0.2892 | 0.2512 | 0.3020 | 0.2548 | 4.09 | 3.81 | 3.63 | 4.64 |
| Ir_sys4 | 0.5241 | 0.3178 | 0.5334 | 0.3060 | 4.11 | 4.74 | 4.08 | 4.24 |
| lalala | 0.5677 | 0.3403 | 0.5778 | 0.3258 | 4.13 | 4.80 | 4.08 | 4.29 |
| BioASQ_Baseline | - | - | - | - | - | - | - | - |
Test batch 4
Exact Answers
| Yes/No | Factoid | List | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| System | Accuracy | F1 Yes | F1 No | Macro F1 | Strict Acc. | Lenient Acc. | MRR | Mean Prec. | Recall | F-Measure |
| AUEB-System1 | 0.7083 | 0.8293 | - | 0.4146 | 0.2903 | 0.4516 | 0.3495 | 0.2917 | 0.1285 | 0.1571 |
| AUEB-System2 | 0.7083 | 0.8293 | - | 0.4146 | 0.3226 | 0.4516 | 0.3710 | 0.1000 | 0.0917 | 0.0933 |
| AUEB-System3 | 0.7083 | 0.8293 | - | 0.4146 | 0.2903 | 0.4516 | 0.3602 | 0.1694 | 0.1424 | 0.1446 |
| AUEB-System4 | 0.7083 | 0.8293 | - | 0.4146 | 0.2903 | 0.4839 | 0.3710 | 0.2250 | 0.2118 | 0.2032 |
| AUEB-System5 | 0.7083 | 0.8293 | - | 0.4146 | 0.2903 | 0.5484 | 0.3855 | 0.2222 | 0.1563 | 0.1741 |
| extractive | 0.7083 | 0.8293 | - | 0.4146 | - | - | - | - | - | - |
| Fleming-3 | 0.7500 | 0.8500 | 0.2500 | 0.5500 | - | - | - | - | - | - |
| bio-answerfinder | 0.9583 | 0.9714 | 0.9231 | 0.9473 | 0.3548 | 0.4194 | 0.3871 | 0.3727 | 0.2701 | 0.2733 |
| bio-answerfinder-2 | 0.9583 | 0.9714 | 0.9231 | 0.9473 | 0.3548 | 0.4194 | 0.3871 | 0.3727 | 0.2701 | 0.2733 |
| bioinfo-0 | 0.7917 | 0.8387 | 0.7059 | 0.7723 | - | - | - | - | - | - |
| bioinfo-1 | 0.7917 | 0.8718 | 0.4444 | 0.6581 | - | - | - | - | - | - |
| bioinfo-2 | 0.8750 | 0.9032 | 0.8235 | 0.8634 | - | - | - | - | - | - |
| bioinfo-3 | 0.8333 | 0.8947 | 0.6000 | 0.7474 | - | - | - | - | - | - |
| bioinfo-4 | 0.7917 | 0.8387 | 0.7059 | 0.7723 | - | - | - | - | - | - |
| LaRSA | 0.9583 | 0.9714 | 0.9231 | 0.9473 | 0.4516 | 0.6129 | 0.5129 | 0.4736 | 0.3104 | 0.3048 |
| BioASQ-2022_UNCC | 0.9167 | 0.9412 | 0.8571 | 0.8992 | 0.5161 | 0.6129 | 0.5645 | - | - | - |
| KU-AAA637-system1 | 0.9583 | 0.9714 | 0.9231 | 0.9473 | 0.4839 | 0.6452 | 0.5495 | 0.4491 | 0.3326 | 0.3068 |
| KU-AAA637-system2 | 0.9583 | 0.9714 | 0.9231 | 0.9473 | 0.4839 | 0.6129 | 0.5430 | 0.4750 | 0.3535 | 0.3268 |
| KU-AAA637-system3 | 0.9583 | 0.9714 | 0.9231 | 0.9473 | 0.4839 | 0.6452 | 0.5495 | 0.4491 | 0.3118 | 0.2846 |
| KU-AAA637-system4 | 0.9583 | 0.9714 | 0.9231 | 0.9473 | 0.5161 | 0.6452 | 0.5656 | 0.4819 | 0.3660 | 0.3537 |
| KU-AAA637-system5 | 0.9583 | 0.9714 | 0.9231 | 0.9473 | 0.4516 | 0.5806 | 0.5108 | 0.5083 | 0.4146 | 0.3669 |
| MQ-1 | 0.7083 | 0.8293 | - | 0.4146 | - | - | - | - | - | - |
| MQ-2 | 0.7083 | 0.8293 | - | 0.4146 | - | - | - | - | - | - |
| UDEL-LAB1 | 0.9583 | 0.9697 | 0.9333 | 0.9515 | 0.4839 | 0.6129 | 0.5387 | 0.5799 | 0.5017 | 0.4950 |
| UDEL-LAB2 | 0.9583 | 0.9697 | 0.9333 | 0.9515 | 0.4839 | 0.6129 | 0.5484 | 0.5834 | 0.5844 | 0.5386 |
| UDEL-LAB3 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.5161 | 0.6129 | 0.5484 | 0.5584 | 0.4438 | 0.4501 |
| UDEL-LAB4 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.5484 | 0.6129 | 0.5613 | 0.6162 | 0.4753 | 0.4752 |
| NCU-1 | 0.8333 | 0.8947 | 0.6000 | 0.7474 | 0.3548 | 0.3548 | 0.3548 | 0.4500 | 0.2801 | 0.3190 |
| UDEL-LAB5 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.5161 | 0.5806 | 0.5484 | 0.6132 | 0.4426 | 0.4434 |
| NCU-IISR/AS-GIS-1 | 0.8333 | 0.8947 | 0.6000 | 0.7474 | 0.3548 | 0.3548 | 0.3548 | 0.4500 | 0.2801 | 0.3190 |
| NCU-IISR/AS-GIS-2 | 0.8333 | 0.8947 | 0.6000 | 0.7474 | 0.3548 | 0.3548 | 0.3548 | 0.4500 | 0.2801 | 0.3190 |
| simple truncation | 0.7083 | 0.8293 | - | 0.4146 | - | - | - | - | - | - |
| Ir_sys1 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.4839 | 0.6452 | 0.5495 | 0.4444 | 0.2410 | 0.2747 |
| NCU-IISR-AS-GIS-4 | 0.8333 | 0.8947 | 0.6000 | 0.7474 | 0.3226 | 0.3226 | 0.3226 | 0.3778 | 0.3122 | 0.3224 |
| NCU-IISR-AS-GIS-5 | 0.8333 | 0.8947 | 0.6000 | 0.7474 | 0.3226 | 0.3226 | 0.3226 | 0.3778 | 0.3053 | 0.3200 |
| Ir_sys2 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.4516 | 0.5161 | 0.4839 | 0.3889 | 0.2847 | 0.2718 |
| Ir_sys3 | 0.9583 | 0.9714 | 0.9231 | 0.9473 | 0.5161 | 0.6774 | 0.5806 | 0.4583 | 0.3243 | 0.3314 |
| Ir_sys4 | 0.9583 | 0.9714 | 0.9231 | 0.9473 | 0.4516 | 0.5806 | 0.5161 | 0.5083 | 0.2826 | 0.2841 |
| lalala | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.5806 | 0.6452 | 0.5995 | 0.4089 | 0.4507 | 0.3835 |
| BioASQ_Baseline | 0.2917 | 0.1905 | 0.3704 | 0.2804 | 0.1613 | 0.3226 | 0.2177 | 0.2163 | 0.4035 | 0.2582 |
Ideal Answers
| Automatic scores (Rouge - R) | Manual scores | |||||||
|---|---|---|---|---|---|---|---|---|
| System | R-2 (Rec) | R-2 (F1) | R-SU4 (Rec) | R-SU4 (F1) | Readability | Recall | Precision | Repetition |
| AUEB-System1 | - | - | - | - | - | - | - | - |
| AUEB-System2 | - | - | - | - | - | - | - | - |
| AUEB-System3 | - | - | - | - | - | - | - | - |
| AUEB-System4 | - | - | - | - | - | - | - | - |
| AUEB-System5 | - | - | - | - | - | - | - | - |
| extractive | 0.1809 | 0.1414 | 0.1811 | 0.1386 | 1.14 | 1.24 | 1.12 | 1.23 |
| Fleming-3 | - | - | - | - | 1.10 | 0.90 | 1.11 | 1.33 |
| bio-answerfinder | 0.5208 | 0.3812 | 0.5183 | 0.3703 | 4.37 | 4.44 | 4.37 | 4.42 |
| bio-answerfinder-2 | 0.4459 | 0.4249 | 0.4411 | 0.4142 | 4.63 | 4.36 | 4.61 | 4.98 |
| bioinfo-0 | - | - | - | - | - | - | - | - |
| bioinfo-1 | - | - | - | - | - | - | - | - |
| bioinfo-2 | - | - | - | - | - | - | - | - |
| bioinfo-3 | - | - | - | - | - | - | - | - |
| bioinfo-4 | - | - | - | - | - | - | - | - |
| LaRSA | 0.5568 | 0.4127 | 0.5584 | 0.4013 | 4.41 | 4.68 | 4.27 | 4.59 |
| BioASQ-2022_UNCC | - | - | - | - | - | - | - | - |
| KU-AAA637-system1 | - | - | - | - | - | - | - | - |
| KU-AAA637-system2 | - | - | - | - | - | - | - | - |
| KU-AAA637-system3 | - | - | - | - | - | - | - | - |
| KU-AAA637-system4 | - | - | - | - | - | - | - | - |
| KU-AAA637-system5 | - | - | - | - | - | - | - | - |
| MQ-1 | 0.5818 | 0.3613 | 0.5970 | 0.3520 | 4.09 | 4.69 | 4.00 | 4.31 |
| MQ-2 | 0.5825 | 0.3636 | 0.5958 | 0.3536 | 4.08 | 4.69 | 4.03 | 4.30 |
| UDEL-LAB1 | - | - | - | - | 0.34 | 0.34 | 0.34 | 0.34 |
| UDEL-LAB2 | - | - | - | - | 0.34 | 0.34 | 0.34 | 0.34 |
| UDEL-LAB3 | - | - | - | - | 0.34 | 0.34 | 0.34 | 0.34 |
| UDEL-LAB4 | - | - | - | - | 0.34 | 0.34 | 0.34 | 0.34 |
| NCU-1 | 0.4351 | 0.4301 | 0.4367 | 0.4213 | 4.58 | 4.44 | 4.69 | 5.00 |
| UDEL-LAB5 | - | - | - | - | 0.34 | 0.34 | 0.34 | 0.34 |
| NCU-IISR/AS-GIS-1 | 0.2842 | 0.2747 | 0.2868 | 0.2706 | 4.43 | 4.04 | 4.13 | 4.96 |
| NCU-IISR/AS-GIS-2 | 0.4375 | 0.4419 | 0.4349 | 0.4355 | 4.62 | 4.49 | 4.70 | 5.00 |
| simple truncation | - | - | - | - | - | - | - | - |
| Ir_sys1 | 0.5648 | 0.3540 | 0.5762 | 0.3449 | 4.21 | 4.70 | 4.01 | 4.34 |
| NCU-IISR-AS-GIS-4 | 0.4351 | 0.4301 | 0.4367 | 0.4213 | 4.58 | 4.44 | 4.69 | 5.00 |
| NCU-IISR-AS-GIS-5 | 0.4351 | 0.4301 | 0.4367 | 0.4213 | 4.58 | 4.44 | 4.69 | 5.00 |
| Ir_sys2 | 0.1315 | 0.1439 | 0.1406 | 0.1501 | 4.62 | 3.63 | 4.32 | 4.86 |
| Ir_sys3 | 0.2489 | 0.2124 | 0.2506 | 0.2125 | 4.22 | 3.46 | 3.77 | 4.67 |
| Ir_sys4 | 0.5459 | 0.3464 | 0.5630 | 0.3390 | 4.12 | 4.63 | 3.97 | 4.31 |
| lalala | 0.5531 | 0.3467 | 0.5658 | 0.3388 | 4.14 | 4.60 | 3.99 | 4.29 |
| BioASQ_Baseline | - | - | - | - | - | - | - | - |
Test batch 5
Exact Answers
| Yes/No | Factoid | List | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| System | Accuracy | F1 Yes | F1 No | Macro F1 | Strict Acc. | Lenient Acc. | MRR | Mean Prec. | Recall | F-Measure |
| simple truncation | 0.5357 | 0.6977 | - | 0.3488 | - | - | - | - | - | - |
| bio-answerfinder | 0.7857 | 0.8235 | 0.7273 | 0.7754 | 0.3448 | 0.4138 | 0.3736 | 0.3438 | 0.3290 | 0.3248 |
| bio-answerfinder-2 | 0.7857 | 0.8235 | 0.7273 | 0.7754 | 0.3448 | 0.4138 | 0.3736 | 0.3438 | 0.3290 | 0.3248 |
| AUEB-System1 | 0.5357 | 0.6977 | - | 0.3488 | - | - | - | 0.0278 | 0.0278 | 0.0278 |
| AUEB-System2 | 0.5357 | 0.6977 | - | 0.3488 | 0.3103 | 0.3448 | 0.3218 | 0.1222 | 0.1012 | 0.0891 |
| AUEB-System3 | 0.5357 | 0.6977 | - | 0.3488 | - | - | - | 0.0278 | 0.0278 | 0.0278 |
| AUEB-System4 | 0.5357 | 0.6977 | - | 0.3488 | 0.2414 | 0.3793 | 0.3000 | 0.1037 | 0.0734 | 0.0714 |
| AUEB-System5 | 0.5357 | 0.6977 | - | 0.3488 | 0.3103 | 0.4138 | 0.3534 | 0.0870 | 0.1012 | 0.0847 |
| bioinfo-0 | 0.7857 | 0.7692 | 0.8000 | 0.7846 | - | - | - | - | - | - |
| bioinfo-1 | 0.7500 | 0.7586 | 0.7407 | 0.7497 | - | - | - | - | - | - |
| bioinfo-2 | 0.8214 | 0.8148 | 0.8276 | 0.8212 | - | - | - | - | - | - |
| bioinfo-3 | 0.8214 | 0.8276 | 0.8148 | 0.8212 | - | - | - | - | - | - |
| bioinfo-4 | 0.7857 | 0.7692 | 0.8000 | 0.7846 | - | - | - | - | - | - |
| LaRSA | 0.7500 | 0.8000 | 0.6667 | 0.7333 | 0.4138 | 0.5517 | 0.4626 | 0.4486 | 0.4188 | 0.4191 |
| Fleming-4 | 0.6429 | 0.7500 | 0.3750 | 0.5625 | - | - | - | - | - | - |
| UDEL-LAB1 | 0.8929 | 0.9091 | 0.8696 | 0.8893 | 0.3448 | 0.5172 | 0.4149 | 0.6082 | 0.5921 | 0.5794 |
| UDEL-LAB2 | 0.8929 | 0.9091 | 0.8696 | 0.8893 | 0.3103 | 0.5172 | 0.3833 | 0.5642 | 0.6424 | 0.5860 |
| UDEL-LAB3 | 0.8929 | 0.9091 | 0.8696 | 0.8893 | 0.4483 | 0.5862 | 0.5000 | 0.5790 | 0.6043 | 0.5793 |
| UDEL-LAB4 | 0.8929 | 0.9091 | 0.8696 | 0.8893 | 0.3103 | 0.5862 | 0.4190 | 0.6120 | 0.6427 | 0.6123 |
| UDEL-LAB5 | 0.8929 | 0.9091 | 0.8696 | 0.8893 | 0.3793 | 0.5172 | 0.4190 | 0.5881 | 0.6515 | 0.6076 |
| BioASQ-2022_UNCC | 0.8929 | 0.9032 | 0.8800 | 0.8916 | 0.1034 | 0.6207 | 0.2253 | - | - | - |
| BioASQ-2022_UNCC1 | 0.8929 | 0.9032 | 0.8800 | 0.8916 | 0.4138 | 0.6207 | 0.4868 | - | - | - |
| MQ-1 | 0.5357 | 0.6977 | - | 0.3488 | - | - | - | - | - | - |
| MQ-2 | 0.5357 | 0.6977 | - | 0.3488 | - | - | - | - | - | - |
| BioASQ-2022_UNCC2 | 0.8929 | 0.9032 | 0.8800 | 0.8916 | 0.4138 | 0.6207 | 0.4868 | - | - | - |
| BioASQ-2022_UNCC3 | 0.8929 | 0.9032 | 0.8800 | 0.8916 | 0.4138 | 0.6207 | 0.4868 | - | - | - |
| KU-AAA637-system1 | 0.8929 | 0.9091 | 0.8696 | 0.8893 | 0.3448 | 0.4828 | 0.3994 | 0.5386 | 0.5095 | 0.4885 |
| KU-AAA637-system2 | 0.9286 | 0.9375 | 0.9167 | 0.9271 | 0.3448 | 0.4828 | 0.4052 | 0.5858 | 0.5373 | 0.5285 |
| KU-AAA637-system3 | 0.8929 | 0.9091 | 0.8696 | 0.8893 | 0.3448 | 0.5172 | 0.4080 | 0.5283 | 0.5049 | 0.5054 |
| KU-AAA637-system4 | 0.8571 | 0.8824 | 0.8182 | 0.8503 | 0.3793 | 0.5172 | 0.4264 | 0.5361 | 0.5373 | 0.4999 |
| KU-AAA637-system5 | 0.8929 | 0.9091 | 0.8696 | 0.8893 | 0.3448 | 0.5172 | 0.4109 | 0.5312 | 0.5595 | 0.5085 |
| NCU-IISR/AS-GIS-1 | 0.5357 | 0.6977 | - | 0.3488 | - | - | - | - | - | - |
| NCU-1 | 0.5357 | 0.6977 | - | 0.3488 | - | - | - | - | - | - |
| NCU-IISR/AS-GIS-2 | 0.5357 | 0.6977 | - | 0.3488 | - | - | - | - | - | - |
| NCU-IISR-AS-GIS-4 | 0.8929 | 0.9091 | 0.8696 | 0.8893 | 0.4828 | 0.5862 | 0.5259 | 0.6500 | 0.5114 | 0.5517 |
| NCU-IISR-AS-GIS-5 | 0.8929 | 0.9091 | 0.8696 | 0.8893 | 0.4828 | 0.5862 | 0.5259 | 0.6500 | 0.5114 | 0.5517 |
| Ir_sys1 | 0.9286 | 0.9333 | 0.9231 | 0.9282 | 0.3793 | 0.5517 | 0.4540 | 0.6799 | 0.4716 | 0.5214 |
| Ir_sys2 | 0.9286 | 0.9333 | 0.9231 | 0.9282 | 0.3448 | 0.5862 | 0.4494 | 0.6204 | 0.4660 | 0.5099 |
| Ir_sys3 | 0.8929 | 0.9032 | 0.8800 | 0.8916 | 0.4828 | 0.5862 | 0.5098 | 0.5852 | 0.4114 | 0.4613 |
| Ir_sys4 | 0.7857 | 0.8235 | 0.7273 | 0.7754 | 0.3793 | 0.4483 | 0.4138 | 0.6058 | 0.4354 | 0.4692 |
| lalala | 0.9286 | 0.9333 | 0.9231 | 0.9282 | 0.3793 | 0.5517 | 0.4494 | 0.5209 | 0.5840 | 0.5340 |
| BioASQ_Baseline | 0.4643 | 0.2857 | 0.5714 | 0.4286 | 0.0345 | 0.1034 | 0.0632 | 0.2125 | 0.4801 | 0.2572 |
Ideal Answers
| Automatic scores (Rouge - R) | Manual scores | |||||||
|---|---|---|---|---|---|---|---|---|
| System | R-2 (Rec) | R-2 (F1) | R-SU4 (Rec) | R-SU4 (F1) | Readability | Recall | Precision | Repetition |
| simple truncation | 0.4725 | 0.3202 | 0.4722 | 0.3091 | 4.23 | 4.71 | 3.92 | 4.53 |
| bio-answerfinder | 0.4367 | 0.3440 | 0.4369 | 0.3377 | 4.42 | 4.62 | 4.29 | 4.69 |
| bio-answerfinder-2 | 0.3924 | 0.3537 | 0.3926 | 0.3485 | 4.47 | 4.57 | 4.38 | 4.91 |
| AUEB-System1 | - | - | - | - | - | - | - | - |
| AUEB-System2 | - | - | - | - | - | - | - | - |
| AUEB-System3 | - | - | - | - | - | - | - | - |
| AUEB-System4 | - | - | - | - | - | - | - | - |
| AUEB-System5 | - | - | - | - | - | - | - | - |
| bioinfo-0 | - | - | - | - | - | - | - | - |
| bioinfo-1 | - | - | - | - | - | - | - | - |
| bioinfo-2 | - | - | - | - | - | - | - | - |
| bioinfo-3 | - | - | - | - | - | - | - | - |
| bioinfo-4 | - | - | - | - | - | - | - | - |
| LaRSA | 0.5098 | 0.3646 | 0.5156 | 0.3572 | 4.56 | 4.78 | 4.14 | 4.60 |
| Fleming-4 | - | - | - | - | 1.46 | 0.92 | 1.16 | 1.56 |
| UDEL-LAB1 | - | - | - | - | 0.32 | 0.32 | 0.32 | 0.32 |
| UDEL-LAB2 | - | - | - | - | 0.32 | 0.32 | 0.32 | 0.32 |
| UDEL-LAB3 | - | - | - | - | 0.32 | 0.32 | 0.32 | 0.32 |
| UDEL-LAB4 | - | - | - | - | 0.32 | 0.32 | 0.32 | 0.32 |
| UDEL-LAB5 | - | - | - | - | 0.32 | 0.32 | 0.32 | 0.32 |
| BioASQ-2022_UNCC | - | - | - | - | - | - | - | - |
| BioASQ-2022_UNCC1 | - | - | - | - | - | - | - | - |
| MQ-1 | 0.5446 | 0.3558 | 0.5511 | 0.3478 | 4.34 | 4.80 | 4.07 | 4.42 |
| MQ-2 | 0.5416 | 0.3579 | 0.5460 | 0.3489 | 4.34 | 4.76 | 4.06 | 4.46 |
| BioASQ-2022_UNCC2 | - | - | - | - | - | - | - | - |
| BioASQ-2022_UNCC3 | - | - | - | - | - | - | - | - |
| KU-AAA637-system1 | - | - | - | - | - | - | - | - |
| KU-AAA637-system2 | - | - | - | - | - | - | - | - |
| KU-AAA637-system3 | - | - | - | - | - | - | - | - |
| KU-AAA637-system4 | - | - | - | - | - | - | - | - |
| KU-AAA637-system5 | - | - | - | - | - | - | - | - |
| NCU-IISR/AS-GIS-1 | 0.4154 | 0.3904 | 0.4133 | 0.3836 | 4.61 | 4.56 | 4.54 | 4.98 |
| NCU-1 | 0.4050 | 0.3838 | 0.4030 | 0.3800 | 4.64 | 4.54 | 4.56 | 4.99 |
| NCU-IISR/AS-GIS-2 | 0.4367 | 0.4118 | 0.4336 | 0.4049 | 4.64 | 4.61 | 4.60 | 4.99 |
| NCU-IISR-AS-GIS-4 | - | - | - | - | - | - | - | - |
| NCU-IISR-AS-GIS-5 | - | - | - | - | - | - | - | - |
| Ir_sys1 | 0.5523 | 0.3555 | 0.5502 | 0.3426 | 4.34 | 4.76 | 4.01 | 4.36 |
| Ir_sys2 | 0.6242 | 0.3262 | 0.6169 | 0.3101 | 4.10 | 4.68 | 3.91 | 3.91 |
| Ir_sys3 | 0.2425 | 0.2019 | 0.2499 | 0.2069 | 4.23 | 3.57 | 3.40 | 4.56 |
| Ir_sys4 | 0.4891 | 0.3198 | 0.4971 | 0.3134 | 4.40 | 4.73 | 4.08 | 4.53 |
| lalala | 0.5307 | 0.3381 | 0.5307 | 0.3260 | 4.29 | 4.71 | 3.96 | 4.38 |
| BioASQ_Baseline | - | - | - | - | - | - | - | - |
Test batch 6
Exact Answers
| Yes/No | Factoid | List | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| System | Accuracy | F1 Yes | F1 No | Macro F1 | Strict Acc. | Lenient Acc. | MRR | Mean Prec. | Recall | F-Measure |
| Fleming-2 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | - | - | - | - | - | - |
| NCU-IISR-AS-GIS-5 | 0.8333 | 0.8571 | 0.8000 | 0.8286 | 0.1667 | 0.3333 | 0.2222 | 0.6933 | 0.3108 | 0.3458 |
| AUEB-System1 | 0.5000 | 0.6667 | - | 0.3333 | - | - | - | - | - | - |
| AUEB-System2 | 0.5000 | 0.6667 | - | 0.3333 | 0.3333 | 0.5000 | 0.3889 | 0.3022 | 0.1953 | 0.2156 |
| AUEB-System4 | 0.5000 | 0.6667 | - | 0.3333 | 0.1667 | 0.5000 | 0.3056 | 0.3467 | 0.1491 | 0.1912 |
| AUEB-System5 | 0.5000 | 0.6667 | - | 0.3333 | 0.3333 | 0.5000 | 0.3889 | 0.2822 | 0.2285 | 0.2275 |
| AUEB-System3 | 0.5000 | 0.6667 | - | 0.3333 | - | - | - | - | - | - |
| LaRSA | 0.6667 | 0.7500 | 0.5000 | 0.6250 | 0.3333 | 0.3333 | 0.3333 | 0.6444 | 0.3985 | 0.4271 |
| bio-answerfinder | 0.6667 | 0.7500 | 0.5000 | 0.6250 | 0.1667 | 0.5000 | 0.3333 | 0.7031 | 0.3893 | 0.4405 |
| bio-answerfinder-2 | 0.6667 | 0.7500 | 0.5000 | 0.6250 | 0.1667 | 0.5000 | 0.3333 | 0.7031 | 0.3893 | 0.4405 |
| bioinfo-0 | 0.5000 | 0.5714 | 0.4000 | 0.4857 | - | - | - | - | - | - |
| bioinfo-1 | 0.5000 | 0.5714 | 0.4000 | 0.4857 | - | - | - | - | - | - |
| bioinfo-2 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | - | - | - | - | - | - |
| bioinfo-3 | 0.5000 | 0.6667 | - | 0.3333 | - | - | - | - | - | - |
| bioinfo-4 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | - | - | - | - | - | - |
| BioASQ-2022_UNCC | 0.6667 | 0.6667 | 0.6667 | 0.6667 | 0.3333 | 0.5000 | 0.4167 | - | - | - |
| MQ-1 | 0.5000 | 0.6667 | - | 0.3333 | - | - | - | - | - | - |
| MQ-2 | 0.5000 | 0.6667 | - | 0.3333 | - | - | - | - | - | - |
| NCU-IISR/AS-GIS-3 | 0.8333 | 0.8571 | 0.8000 | 0.8286 | 0.1667 | 0.3333 | 0.2222 | 0.6933 | 0.3108 | 0.3458 |
| NCU-IISR/AS-GIS-2 | 0.8333 | 0.8571 | 0.8000 | 0.8286 | 0.1667 | 0.3333 | 0.2222 | 0.6937 | 0.3108 | 0.3487 |
| NCU-IISR/AS-GIS-1 | 0.5000 | 0.6667 | - | 0.3333 | - | - | - | - | - | - |
| NCU-1 | 0.5000 | 0.6667 | - | 0.3333 | - | - | - | - | - | - |
| KU-AAA637-system1 | 0.8333 | 0.8571 | 0.8000 | 0.8286 | 0.3333 | 0.5000 | 0.3750 | 0.5225 | 0.3361 | 0.3426 |
| KU-AAA637-system2 | 0.8333 | 0.8571 | 0.8000 | 0.8286 | 0.3333 | 0.5000 | 0.4167 | 0.5744 | 0.3799 | 0.3917 |
| KU-AAA637-system3 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.3333 | 0.3333 | 0.3333 | 0.5137 | 0.3572 | 0.3628 |
| KU-AAA637-system4 | 0.8333 | 0.8571 | 0.8000 | 0.8286 | 0.3333 | 0.5000 | 0.3889 | 0.5507 | 0.3528 | 0.3607 |
| KU-AAA637-system5 | 0.8333 | 0.8571 | 0.8000 | 0.8286 | 0.3333 | 0.5000 | 0.4167 | 0.4687 | 0.3755 | 0.3690 |
| Ir_sys1 | 0.8333 | 0.8571 | 0.8000 | 0.8286 | 0.3333 | 0.6667 | 0.4306 | 0.6811 | 0.3851 | 0.4133 |
| Ir_sys3 | 0.8333 | 0.8571 | 0.8000 | 0.8286 | 0.1667 | 0.5000 | 0.2917 | 0.5852 | 0.3380 | 0.3740 |
| Ir_sys4 | 0.8333 | 0.8571 | 0.8000 | 0.8286 | 0.3333 | 0.5000 | 0.4167 | 0.7000 | 0.2971 | 0.3458 |
| lalala | 0.8333 | 0.8571 | 0.8000 | 0.8286 | 0.3333 | 0.6667 | 0.4222 | 0.5367 | 0.5018 | 0.4306 |
| Ir_sys2 | 0.8333 | 0.8571 | 0.8000 | 0.8286 | 0.3333 | 0.6667 | 0.4722 | 0.5889 | 0.3356 | 0.3548 |
| BioASQ_Baseline | 0.5000 | - | 0.6667 | 0.3333 | 0.3333 | 0.5000 | 0.3667 | 0.2600 | 0.2992 | 0.2470 |
Ideal Answers
| Automatic scores (Rouge - R) | Manual scores | |||||||
|---|---|---|---|---|---|---|---|---|
| System | R-2 (Rec) | R-2 (F1) | R-SU4 (Rec) | R-SU4 (F1) | Readability | Recall | Precision | Repetition |
| Fleming-2 | - | - | - | - | 0.70 | 0.38 | 0.70 | 0.70 |
| NCU-IISR-AS-GIS-5 | - | - | - | - | - | - | - | - |
| AUEB-System1 | - | - | - | - | - | - | - | - |
| AUEB-System2 | - | - | - | - | - | - | - | - |
| AUEB-System4 | - | - | - | - | - | - | - | - |
| AUEB-System5 | - | - | - | - | - | - | - | - |
| AUEB-System3 | - | - | - | - | - | - | - | - |
| LaRSA | 0.4182 | 0.3686 | 0.4369 | 0.3765 | 3.76 | 3.81 | 4.00 | 4.00 |
| bio-answerfinder | 0.3181 | 0.3103 | 0.3332 | 0.3221 | 4.16 | 3.51 | 4.14 | 4.38 |
| bio-answerfinder-2 | 0.2625 | 0.2848 | 0.2757 | 0.2972 | 4.30 | 3.30 | 4.19 | 4.68 |
| bioinfo-0 | - | - | - | - | - | - | - | - |
| bioinfo-1 | - | - | - | - | - | - | - | - |
| bioinfo-2 | - | - | - | - | - | - | - | - |
| bioinfo-3 | - | - | - | - | - | - | - | - |
| bioinfo-4 | - | - | - | - | - | - | - | - |
| BioASQ-2022_UNCC | - | - | - | - | - | - | - | - |
| MQ-1 | 0.4338 | 0.3540 | 0.4558 | 0.3630 | 3.65 | 3.84 | 3.78 | 3.89 |
| MQ-2 | 0.4382 | 0.3616 | 0.4595 | 0.3707 | 3.65 | 3.84 | 3.76 | 3.81 |
| NCU-IISR/AS-GIS-3 | 0.2322 | 0.2672 | 0.2420 | 0.2772 | 4.14 | 3.05 | 4.08 | 4.76 |
| NCU-IISR/AS-GIS-2 | 0.2322 | 0.2672 | 0.2420 | 0.2772 | 4.14 | 3.05 | 4.08 | 4.76 |
| NCU-IISR/AS-GIS-1 | 0.2296 | 0.2605 | 0.2363 | 0.2690 | 4.24 | 3.05 | 4.11 | 4.84 |
| NCU-1 | 0.2161 | 0.2472 | 0.2223 | 0.2554 | 4.16 | 2.97 | 3.97 | 4.73 |
| KU-AAA637-system1 | - | - | - | - | - | - | - | - |
| KU-AAA637-system2 | - | - | - | - | - | - | - | - |
| KU-AAA637-system3 | - | - | - | - | - | - | - | - |
| KU-AAA637-system4 | - | - | - | - | - | - | - | - |
| KU-AAA637-system5 | - | - | - | - | - | - | - | - |
| Ir_sys1 | 0.4586 | 0.3754 | 0.4877 | 0.3874 | 3.73 | 3.89 | 3.89 | 3.86 |
| Ir_sys3 | 0.1256 | 0.1286 | 0.1320 | 0.1350 | 3.27 | 1.89 | 2.46 | 3.41 |
| Ir_sys4 | 0.4163 | 0.3430 | 0.4425 | 0.3534 | 3.65 | 3.70 | 3.81 | 3.78 |
| lalala | 0.4458 | 0.3641 | 0.4695 | 0.3717 | 3.59 | 3.78 | 3.70 | 3.78 |
| Ir_sys2 | 0.4023 | 0.2834 | 0.4251 | 0.2891 | 2.86 | 3.57 | 3.38 | 3.35 |
| BioASQ_Baseline | - | - | - | - | - | - | - | - |