BioASQ Participants Area
Task Synergy - version 2023: Test Results
Test round 1
Documents
System |
Mean precision |
Recall |
F-Measure |
MAP |
GMAP |
bio-answerfinder |
0.3113 |
0.2998 |
0.2505 |
0.2720 |
0.0097 |
bio-answerfinder-2 |
0.3787 |
0.4317 |
0.3602 |
0.3825 |
0.0484 |
Fleming-3 |
0.2922 |
0.3145 |
0.2685 |
0.2611 |
0.0113 |
Fleming-4 |
0.2643 |
0.3151 |
0.2563 |
0.2379 |
0.0121 |
Just for test |
0.4003 |
0.6804 |
0.4347 |
0.5773 |
0.4851 |
dmiip1 |
0.3959 |
0.7008 |
0.4384 |
0.5870 |
0.4955 |
Snippets
System |
Mean precision |
Recall |
F-Measure |
MAP |
GMAP |
bio-answerfinder |
0.1709 |
0.2763 |
0.1844 |
0.3123 |
0.0043 |
bio-answerfinder-2 |
0.2239 |
0.3103 |
0.2299 |
0.2790 |
0.0066 |
Fleming-3 |
- |
- |
- |
- |
- |
Fleming-4 |
- |
- |
- |
- |
- |
Just for test |
0.3571 |
0.7758 |
0.4382 |
0.8498 |
0.5477 |
dmiip1 |
0.3571 |
0.7758 |
0.4382 |
0.8498 |
0.5477 |
Exact Answers
|
Yes/No |
Factoid |
List |
System |
Accuracy |
F1 Yes |
F1 No |
Macro F1 |
Strict Acc. |
Lenient Acc. |
MRR |
Mean Prec. |
Recall |
F-Measure |
bio-answerfinder |
- |
- |
- |
- |
- | - | - |
- | - | - |
bio-answerfinder-2 |
- |
- |
- |
- |
- | - | - |
- | - | - |
Fleming-3 |
- |
- |
- |
- |
- | - | - |
- | - | - |
Fleming-4 |
- |
- |
- |
- |
- | - | - |
- | - | - |
Just for test |
- |
- |
- |
- |
- | - | - |
- | - | - |
dmiip1 |
- |
- |
- |
- |
- | - | - |
- | - | - |
Ideal Answers
|
Automatic scores (Rouge - R) |
Manual scores |
System |
R-2 (Rec) |
R-2 (F1) |
R-SU4 (Rec) |
R-SU4 (F1) |
Readability |
Recall |
Precision |
Repetition |
bio-answerfinder |
- |
- |
- |
- |
- |
- |
- |
- |
bio-answerfinder-2 |
- |
- |
- |
- |
- |
- |
- |
- |
Fleming-3 |
- |
- |
- |
- |
- |
- |
- |
- |
Fleming-4 |
- |
- |
- |
- |
- |
- |
- |
- |
Just for test |
- |
- |
- |
- |
- |
- |
- |
- |
dmiip1 |
- |
- |
- |
- |
- |
- |
- |
- |
Test round 2
Documents
System |
Mean precision |
Recall |
F-Measure |
MAP |
GMAP |
bio-answerfinder |
0.2846 |
0.1209 |
0.1461 |
0.1674 |
0.0052 |
bio-answerfinder-2 |
0.2965 |
0.1688 |
0.1865 |
0.2168 |
0.0102 |
dmiip1 |
0.4161 |
0.3806 |
0.3276 |
0.3724 |
0.2050 |
dmiip3 |
0.4258 |
0.3966 |
0.3369 |
0.4040 |
0.1240 |
dmiip4 |
0.3548 |
0.3280 |
0.2703 |
0.3237 |
0.1410 |
dmiip5 |
0.4161 |
0.3684 |
0.3221 |
0.3942 |
0.1471 |
Fleming-3 |
- |
- |
- |
- |
- |
dmiip2 |
0.3516 |
0.2322 |
0.2443 |
0.3063 |
0.0289 |
Snippets
System |
Mean precision |
Recall |
F-Measure |
MAP |
GMAP |
bio-answerfinder |
0.1388 |
0.1066 |
0.0981 |
0.1302 |
0.0023 |
bio-answerfinder-2 |
0.2012 |
0.1572 |
0.1422 |
0.1752 |
0.0065 |
dmiip1 |
0.3648 |
0.3561 |
0.3026 |
0.4601 |
0.0807 |
dmiip3 |
0.3471 |
0.3690 |
0.2944 |
0.4678 |
0.0656 |
dmiip4 |
0.3009 |
0.2995 |
0.2477 |
0.3698 |
0.0481 |
dmiip5 |
0.3121 |
0.2827 |
0.2517 |
0.4146 |
0.0473 |
Fleming-3 |
- |
- |
- |
- |
- |
dmiip2 |
0.3111 |
0.2213 |
0.2297 |
0.3450 |
0.0157 |
Exact Answers
|
Yes/No |
Factoid |
List |
System |
Accuracy |
F1 Yes |
F1 No |
Macro F1 |
Strict Acc. |
Lenient Acc. |
MRR |
Mean Prec. |
Recall |
F-Measure |
bio-answerfinder |
0.2500 |
0.4000 |
- |
0.2000 |
0.3333 |
0.3333 |
0.3333 |
- | - | - |
bio-answerfinder-2 |
0.5000 |
0.6667 |
- |
0.3333 |
0.3333 |
0.3333 |
0.3333 |
- | - | - |
dmiip1 |
1.0000 |
1.0000 |
1.0000 |
1.0000 |
1.0000 |
1.0000 |
1.0000 |
0.2143 |
0.5000 |
0.3000 |
dmiip3 |
1.0000 |
1.0000 |
1.0000 |
1.0000 |
0.6667 |
1.0000 |
0.7778 |
0.1667 |
0.1667 |
0.1667 |
dmiip4 |
1.0000 |
1.0000 |
1.0000 |
1.0000 |
0.3333 |
1.0000 |
0.6111 |
0.2500 |
0.1667 |
0.2000 |
dmiip5 |
1.0000 |
1.0000 |
1.0000 |
1.0000 |
0.3333 |
0.3333 |
0.3333 |
0.0500 |
0.1667 |
0.0769 |
Fleming-3 |
0.7500 |
0.8000 |
0.6667 |
0.7333 |
- | - | - |
- | - | - |
dmiip2 |
1.0000 |
1.0000 |
1.0000 |
1.0000 |
1.0000 |
1.0000 |
1.0000 |
0.2143 |
0.5000 |
0.3000 |
Ideal Answers
|
Automatic scores (Rouge - R) |
Manual scores |
System |
R-2 (Rec) |
R-2 (F1) |
R-SU4 (Rec) |
R-SU4 (F1) |
Readability |
Recall |
Precision |
Repetition |
bio-answerfinder |
0.0435 |
0.0455 |
0.0649 |
0.0702 |
2.57 |
1.36 |
2.21 |
3.71 |
bio-answerfinder-2 |
0.0433 |
0.0430 |
0.0699 |
0.0715 |
3.07 |
1.64 |
2.07 |
4.00 |
dmiip1 |
0.4993 |
0.4223 |
0.5139 |
0.4337 |
3.14 |
3.36 |
2.86 |
2.71 |
dmiip3 |
0.4586 |
0.3779 |
0.4732 |
0.3892 |
3.43 |
3.71 |
3.00 |
3.00 |
dmiip4 |
0.5256 |
0.4402 |
0.5359 |
0.4462 |
3.07 |
3.36 |
2.86 |
2.57 |
dmiip5 |
0.4779 |
0.4171 |
0.4838 |
0.4206 |
3.79 |
3.71 |
3.50 |
3.29 |
Fleming-3 |
- |
- |
- |
- |
1.21 |
0.57 |
0.86 |
1.29 |
dmiip2 |
0.4993 |
0.4223 |
0.5139 |
0.4337 |
3.14 |
3.36 |
2.86 |
2.71 |
Test round 3
Documents
System |
Mean precision |
Recall |
F-Measure |
MAP |
GMAP |
bio-answerfinder |
0.1485 |
0.1464 |
0.1236 |
0.1168 |
0.0016 |
bio-answerfinder-2 |
0.1366 |
0.1345 |
0.1123 |
0.1103 |
0.0016 |
Fleming-4 |
- |
- |
- |
- |
- |
dmiip4 |
0.3639 |
0.4084 |
0.3319 |
0.3637 |
0.1008 |
dmiip3 |
0.2583 |
0.2988 |
0.2397 |
0.2387 |
0.0235 |
dmiip2 |
0.3306 |
0.4195 |
0.3090 |
0.3474 |
0.1984 |
dmiip1 |
0.2611 |
0.3279 |
0.2462 |
0.2421 |
0.0330 |
dmiip5 |
0.2139 |
0.2584 |
0.1986 |
0.1608 |
0.0140 |
CA-1 |
- |
- |
- |
- |
- |
Snippets
System |
Mean precision |
Recall |
F-Measure |
MAP |
GMAP |
bio-answerfinder |
0.0830 |
0.0734 |
0.0679 |
0.0603 |
0.0003 |
bio-answerfinder-2 |
0.0744 |
0.0636 |
0.0575 |
0.0498 |
0.0004 |
Fleming-4 |
- |
- |
- |
- |
- |
dmiip4 |
0.2298 |
0.2454 |
0.2039 |
0.3360 |
0.0159 |
dmiip3 |
0.1921 |
0.1825 |
0.1637 |
0.2024 |
0.0048 |
dmiip2 |
0.2170 |
0.2473 |
0.2004 |
0.3388 |
0.0249 |
dmiip1 |
0.1845 |
0.1897 |
0.1563 |
0.2362 |
0.0056 |
dmiip5 |
0.1381 |
0.1695 |
0.1234 |
0.1616 |
0.0024 |
CA-1 |
0.0459 |
0.0231 |
0.0282 |
0.0187 |
0.0000 |
Exact Answers
|
Yes/No |
Factoid |
List |
System |
Accuracy |
F1 Yes |
F1 No |
Macro F1 |
Strict Acc. |
Lenient Acc. |
MRR |
Mean Prec. |
Recall |
F-Measure |
bio-answerfinder |
0.5714 |
0.7273 |
- |
0.3636 |
0.4286 |
0.5714 |
0.5000 |
0.1486 |
0.0803 |
0.1019 |
bio-answerfinder-2 |
0.5714 |
0.7273 |
- |
0.3636 |
0.2857 |
0.4286 |
0.3214 |
0.1000 |
0.0532 |
0.0644 |
Fleming-4 |
0.7143 |
0.7500 |
0.6667 |
0.7083 |
- | - | - |
- | - | - |
dmiip4 |
0.4286 |
0.3333 |
0.5000 |
0.4167 |
0.4286 |
1.0000 |
0.6548 |
0.3641 |
0.2834 |
0.2559 |
dmiip3 |
0.8571 |
0.8571 |
0.8571 |
0.8571 |
0.5714 |
1.0000 |
0.7143 |
0.3555 |
0.2929 |
0.2467 |
dmiip2 |
0.8571 |
0.8571 |
0.8571 |
0.8571 |
0.7143 |
0.8571 |
0.7500 |
0.3554 |
0.3308 |
0.2871 |
dmiip1 |
0.8571 |
0.8571 |
0.8571 |
0.8571 |
0.7143 |
0.8571 |
0.7619 |
0.3571 |
0.3490 |
0.2781 |
dmiip5 |
0.7143 |
0.7500 |
0.6667 |
0.7083 |
0.5714 |
0.7143 |
0.6190 |
0.0667 |
0.2318 |
0.0796 |
CA-1 |
0.5714 |
0.7273 |
- |
0.3636 |
0.1429 |
0.2857 |
0.1905 |
0.0083 |
0.0044 |
0.0057 |
Ideal Answers
|
Automatic scores (Rouge - R) |
Manual scores |
System |
R-2 (Rec) |
R-2 (F1) |
R-SU4 (Rec) |
R-SU4 (F1) |
Readability |
Recall |
Precision |
Repetition |
bio-answerfinder |
0.1552 |
0.1587 |
0.1627 |
0.1697 |
2.61 |
2.03 |
2.23 |
2.68 |
bio-answerfinder-2 |
0.1682 |
0.1661 |
0.1772 |
0.1788 |
2.39 |
1.84 |
1.77 |
2.39 |
Fleming-4 |
- |
- |
- |
- |
0.23 |
0.13 |
0.13 |
0.13 |
dmiip4 |
0.4954 |
0.4168 |
0.5068 |
0.4216 |
2.77 |
2.81 |
2.58 |
2.65 |
dmiip3 |
0.4643 |
0.4028 |
0.4743 |
0.4085 |
2.74 |
2.77 |
2.58 |
2.71 |
dmiip2 |
0.4829 |
0.4095 |
0.4960 |
0.4163 |
2.74 |
2.81 |
2.55 |
2.58 |
dmiip1 |
0.4829 |
0.4095 |
0.4960 |
0.4163 |
2.74 |
2.81 |
2.55 |
2.58 |
dmiip5 |
0.2773 |
0.2485 |
0.2937 |
0.2597 |
2.87 |
2.61 |
2.48 |
2.77 |
CA-1 |
- |
- |
- |
- |
- |
- |
- |
- |
Test round 4
Documents
System |
Mean precision |
Recall |
F-Measure |
MAP |
GMAP |
bio-answerfinder |
0.1575 |
0.1390 |
0.1244 |
0.1468 |
0.0021 |
bio-answerfinder-2 |
0.1575 |
0.1247 |
0.1232 |
0.1236 |
0.0011 |
Fleming-4 |
0.0893 |
0.0180 |
0.0285 |
0.0212 |
0.0000 |
dmiip1 |
0.2256 |
0.2668 |
0.2034 |
0.2001 |
0.0151 |
dmiip2 |
0.3026 |
0.3772 |
0.2803 |
0.2791 |
0.0572 |
dmiip3 |
0.1974 |
0.2116 |
0.1741 |
0.1708 |
0.0080 |
dmiip4 |
0.3000 |
0.3714 |
0.2760 |
0.2788 |
0.0512 |
dmiip5 |
0.2667 |
0.3230 |
0.2466 |
0.2525 |
0.0578 |
Snippets
System |
Mean precision |
Recall |
F-Measure |
MAP |
GMAP |
bio-answerfinder |
0.0994 |
0.0823 |
0.0794 |
0.0911 |
0.0008 |
bio-answerfinder-2 |
0.0937 |
0.0882 |
0.0735 |
0.1177 |
0.0003 |
Fleming-4 |
- |
- |
- |
- |
- |
dmiip1 |
0.1664 |
0.1698 |
0.1365 |
0.1990 |
0.0031 |
dmiip2 |
0.1800 |
0.2273 |
0.1588 |
0.2290 |
0.0071 |
dmiip3 |
0.1312 |
0.1394 |
0.1071 |
0.1692 |
0.0015 |
dmiip4 |
0.1775 |
0.2281 |
0.1539 |
0.2296 |
0.0052 |
dmiip5 |
0.2019 |
0.2029 |
0.1677 |
0.3015 |
0.0051 |
Exact Answers
|
Yes/No |
Factoid |
List |
System |
Accuracy |
F1 Yes |
F1 No |
Macro F1 |
Strict Acc. |
Lenient Acc. |
MRR |
Mean Prec. |
Recall |
F-Measure |
bio-answerfinder |
0.6667 |
0.8000 |
- |
0.4000 |
0.2500 |
0.2500 |
0.2500 |
0.1250 |
0.0617 |
0.0787 |
bio-answerfinder-2 |
0.6667 |
0.8000 |
- |
0.4000 |
0.2500 |
0.3750 |
0.2917 |
0.1972 |
0.0894 |
0.1100 |
Fleming-4 |
0.7778 |
0.8333 |
0.6667 |
0.7500 |
0.2500 |
0.2500 |
0.2500 |
0.5000 |
0.1423 |
0.1852 |
dmiip1 |
0.5556 |
0.5000 |
0.6000 |
0.5500 |
0.8750 |
1.0000 |
0.9167 |
0.3687 |
0.3628 |
0.2917 |
dmiip2 |
0.5556 |
0.5000 |
0.6000 |
0.5500 |
0.8750 |
1.0000 |
0.9000 |
0.3988 |
0.3947 |
0.3247 |
dmiip3 |
0.6667 |
0.7273 |
0.5714 |
0.6494 |
0.6250 |
1.0000 |
0.7813 |
0.4206 |
0.3703 |
0.3032 |
dmiip4 |
0.5556 |
0.5000 |
0.6000 |
0.5500 |
0.6250 |
1.0000 |
0.7917 |
0.4076 |
0.3248 |
0.2879 |
dmiip5 |
0.6667 |
0.7273 |
0.5714 |
0.6494 |
0.2500 |
0.3750 |
0.3125 |
0.5083 |
0.2556 |
0.3288 |
Ideal Answers
|
Automatic scores (Rouge - R) |
Manual scores |
System |
R-2 (Rec) |
R-2 (F1) |
R-SU4 (Rec) |
R-SU4 (F1) |
Readability |
Recall |
Precision |
Repetition |
bio-answerfinder |
0.6202 |
0.4575 |
0.6267 |
0.4623 |
3.59 |
2.95 |
2.78 |
3.59 |
bio-answerfinder-2 |
0.5969 |
0.4370 |
0.6019 |
0.4400 |
3.32 |
2.78 |
2.62 |
3.51 |
Fleming-4 |
- |
- |
- |
- |
0.19 |
0.27 |
0.19 |
0.19 |
dmiip1 |
0.4969 |
0.4258 |
0.5098 |
0.4328 |
0.54 |
0.54 |
0.57 |
0.54 |
dmiip2 |
0.4969 |
0.4258 |
0.5098 |
0.4328 |
0.54 |
0.54 |
0.57 |
0.54 |
dmiip3 |
0.4751 |
0.4153 |
0.4855 |
0.4216 |
0.65 |
0.68 |
0.65 |
0.65 |
dmiip4 |
0.5077 |
0.4320 |
0.5192 |
0.4375 |
0.54 |
0.54 |
0.57 |
0.54 |
dmiip5 |
0.3714 |
0.4103 |
0.3710 |
0.4104 |
4.43 |
4.49 |
4.38 |
4.51 |