BioASQ Participants Area
Task 13b: Test Results of Phase B
The test results are presented in separate tables for each type of annotation. The "System Description" of each system is used.The evaluation measures that are used in Task B are presented here .
Warning: For ideal answers, good ROUGE results do not always imply good manual scores.
Test batch 1
Exact Answers
| Yes/No | Factoid | List | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| System | Accuracy | F1 Yes | F1 No | Macro F1 | Strict Acc. | Lenient Acc. | MRR | Mean Prec. | Recall | F-Measure |
| IISR first submit | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.3846 | 0.3846 | 0.3846 | 0.5873 | 0.4714 | 0.4967 |
| IISR 2nd submit | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.3462 | 0.4231 | 0.3846 | 0.5923 | 0.5198 | 0.5302 |
| IISR 3rd submit | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.4231 | 0.5000 | 0.4615 | 0.5801 | 0.4531 | 0.4903 |
| IISR 4th submit | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.4231 | 0.4615 | 0.4423 | 0.6056 | 0.5297 | 0.5438 |
| IISR 5th submit | 0.9412 | 0.9565 | 0.9091 | 0.9328 | 0.4231 | 0.4615 | 0.4423 | 0.4784 | 0.3164 | 0.3676 |
| UniTor_0 | 0.9412 | 0.9565 | 0.9091 | 0.9328 | 0.4231 | 0.4615 | 0.4423 | 0.4787 | 0.5181 | 0.4632 |
| UniTor_1 | 0.9412 | 0.9565 | 0.9091 | 0.9328 | 0.4231 | 0.4615 | 0.4423 | 0.5184 | 0.5330 | 0.4912 |
| UniTor_2 | 0.9412 | 0.9565 | 0.9091 | 0.9328 | 0.4231 | 0.4615 | 0.4423 | 0.4510 | 0.4679 | 0.4382 |
| UniTor_3 | 0.9412 | 0.9565 | 0.9091 | 0.9328 | 0.4231 | 0.4615 | 0.4423 | 0.4435 | 0.4874 | 0.4471 |
| DB_vector_&_LLM | 0.9412 | 0.9565 | 0.9091 | 0.9328 | 0.3846 | 0.5000 | 0.4423 | 0.6220 | 0.5359 | 0.5527 |
| google_serach_&_LLM | 0.9412 | 0.9565 | 0.9091 | 0.9328 | 0.3846 | 0.5000 | 0.4423 | 0.6220 | 0.5359 | 0.5527 |
| UR-IW-1 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.3846 | 0.5385 | 0.4423 | 0.4075 | 0.5942 | 0.4419 |
| UR-IW-2 | 0.9412 | 0.9600 | 0.8889 | 0.9244 | 0.4615 | 0.5385 | 0.4936 | 0.4331 | 0.4619 | 0.4257 |
| UR-IW-3 | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.4231 | 0.5769 | 0.4821 | 0.4279 | 0.5659 | 0.4610 |
| UR-IW-4 | 0.8235 | 0.8696 | 0.7273 | 0.7984 | 0.4231 | 0.5000 | 0.4615 | 0.5671 | 0.5620 | 0.5325 |
| UR-IW-5 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.3846 | 0.4615 | 0.4231 | 0.3945 | 0.5808 | 0.4272 |
| Fleming-1 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.5385 | 0.6154 | 0.5769 | 0.5327 | 0.5288 | 0.4962 |
| bioinfo-0 | 0.7059 | 0.8276 | - | 0.4138 | - | - | - | - | - | - |
| bioinfo-1 | 0.7059 | 0.8276 | - | 0.4138 | - | - | - | - | - | - |
| bioinfo-2 | 0.7059 | 0.8276 | - | 0.4138 | - | - | - | - | - | - |
| bioinfo-3 | 0.7059 | 0.8276 | - | 0.4138 | - | - | - | - | - | - |
| bioinfo-4 | 0.7059 | 0.8276 | - | 0.4138 | - | - | - | - | - | - |
| Fleming-2 | 0.9412 | 0.9600 | 0.8889 | 0.9244 | 0.5385 | 0.6154 | 0.5769 | 0.5327 | 0.5288 | 0.4962 |
| Mistral7BIns10shots | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.3462 | 0.3462 | 0.3462 | 0.3004 | 0.2084 | 0.2374 |
| vllm agents | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.2692 | 0.2692 | 0.2692 | 0.5473 | 0.4482 | 0.4790 |
| dmiip2024 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.4615 | 0.5769 | 0.5128 | 0.6084 | 0.5167 | 0.5357 |
| dmiip2024_2 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.4615 | 0.4615 | 0.4615 | 0.5116 | 0.5707 | 0.5055 |
| dmiip2024_3 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.3846 | 0.4615 | 0.4231 | 0.6126 | 0.4548 | 0.5001 |
| dmiip2024_4 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.4615 | 0.5000 | 0.4808 | 0.5551 | 0.3668 | 0.4107 |
| config-1 | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.5000 | 0.5000 | 0.5000 | 0.5797 | 0.4319 | 0.4744 |
| llama | 0.9412 | 0.9565 | 0.9091 | 0.9328 | 0.5000 | 0.5769 | 0.5385 | 0.5208 | 0.5092 | 0.4900 |
| dense | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.4615 | 0.6154 | 0.5385 | 0.5491 | 0.5353 | 0.5106 |
| config-2 | 0.9412 | 0.9600 | 0.8889 | 0.9244 | 0.4231 | 0.4615 | 0.4423 | 0.5321 | 0.4314 | 0.4504 |
| config-3 | 0.9412 | 0.9600 | 0.8889 | 0.9244 | 0.5000 | 0.5385 | 0.5192 | 0.5203 | 0.4312 | 0.4546 |
| config-4 | 0.9412 | 0.9600 | 0.8889 | 0.9244 | 0.5000 | 0.5000 | 0.5000 | 0.5353 | 0.4576 | 0.4753 |
| config-5 | 0.9412 | 0.9600 | 0.8889 | 0.9244 | 0.5385 | 0.5769 | 0.5577 | 0.5348 | 0.4399 | 0.4655 |
| dmiip2024_1 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.4231 | 0.4231 | 0.4231 | 0.6254 | 0.4935 | 0.5285 |
| mistral | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.4615 | 0.5385 | 0.5000 | 0.5129 | 0.4258 | 0.4407 |
| Fleming-3 | 0.9412 | 0.9600 | 0.8889 | 0.9244 | 0.5000 | 0.6154 | 0.5513 | 0.5421 | 0.5096 | 0.4954 |
| bious1 | 0.8824 | 0.9167 | 0.8000 | 0.8583 | 0.3846 | 0.4231 | 0.4038 | 0.5516 | 0.4305 | 0.4606 |
| bious2 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.3846 | 0.4231 | 0.4038 | 0.4767 | 0.4584 | 0.4459 |
| bious3 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.5385 | 0.5769 | 0.5577 | 0.4772 | 0.4521 | 0.4490 |
| bious4 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.3846 | 0.4231 | 0.4038 | 0.4831 | 0.4256 | 0.4327 |
| bious5 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.4615 | 0.5000 | 0.4808 | 0.4608 | 0.4441 | 0.4356 |
| kmeans | 0.9412 | 0.9600 | 0.8889 | 0.9244 | 0.3077 | 0.4615 | 0.3718 | 0.5257 | 0.4758 | 0.4757 |
| simple truncation | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.3462 | 0.4615 | 0.3974 | 0.6329 | 0.4936 | 0.5283 |
| similarity measures | 0.9412 | 0.9600 | 0.8889 | 0.9244 | 0.3462 | 0.4615 | 0.4038 | 0.4534 | 0.5271 | 0.4431 |
| extractive | 0.9412 | 0.9600 | 0.8889 | 0.9244 | 0.3462 | 0.4231 | 0.3782 | 0.4351 | 0.5477 | 0.4483 |
| deepseek32b-me | 0.2941 | - | 0.4545 | 0.2273 | 0.3462 | 0.3462 | 0.3462 | 0.6215 | 0.4891 | 0.5211 |
| EP-1 | 0.9412 | 0.9565 | 0.9091 | 0.9328 | 0.3077 | 0.5000 | 0.4038 | 0.5435 | 0.4533 | 0.4708 |
| abstractive | 0.9412 | 0.9600 | 0.8889 | 0.9244 | 0.3462 | 0.5000 | 0.4071 | 0.3984 | 0.6329 | 0.4430 |
| EP-2 | 0.9412 | 0.9600 | 0.8889 | 0.9244 | 0.2308 | 0.3462 | 0.2821 | 0.4558 | 0.2349 | 0.2839 |
| deepseek32b-full | 0.2941 | - | 0.4545 | 0.2273 | 0.3462 | 0.3462 | 0.3462 | 0.5982 | 0.4926 | 0.5152 |
| deepseek32b-f | 0.2941 | - | 0.4545 | 0.2273 | 0.3462 | 0.3462 | 0.3462 | 0.5114 | 0.4002 | 0.4293 |
| GPT4O | 0.9412 | 0.9600 | 0.8889 | 0.9244 | 0.3462 | 0.3462 | 0.3462 | 0.5569 | 0.4490 | 0.4794 |
| phaseB-4 | 0.2941 | - | 0.4545 | 0.2273 | 0.3462 | 0.3462 | 0.3462 | 0.6433 | 0.4892 | 0.5304 |
| EP-4 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.5000 | 0.5769 | 0.5385 | 0.5836 | 0.4136 | 0.4678 |
| phaseB-5 | 0.2941 | - | 0.4545 | 0.2273 | 0.3462 | 0.3462 | 0.3462 | 0.5620 | 0.4375 | 0.4687 |
| deepseek-r1:32b | 0.9412 | 0.9600 | 0.8889 | 0.9244 | 0.3077 | 0.3077 | 0.3077 | 0.4947 | 0.4028 | 0.4242 |
| EP-5 | 0.9412 | 0.9600 | 0.8889 | 0.9244 | 0.4231 | 0.5385 | 0.4808 | 0.5865 | 0.4189 | 0.4685 |
| EP-3 | 0.9412 | 0.9600 | 0.8889 | 0.9244 | 0.4231 | 0.5385 | 0.4808 | 0.5865 | 0.4189 | 0.4685 |
| 2025-DMIS-KU-1 | 0.9412 | 0.9565 | 0.9091 | 0.9328 | 0.5000 | 0.6154 | 0.5577 | 0.6057 | 0.4929 | 0.5117 |
| 2025-DMIS-KU-4 | 0.8235 | 0.8571 | 0.7692 | 0.8132 | 0.4231 | 0.6154 | 0.5128 | 0.6012 | 0.4657 | 0.4986 |
| 2025-DMIS-KU-5 | 0.9412 | 0.9565 | 0.9091 | 0.9328 | 0.5000 | 0.6154 | 0.5577 | 0.6168 | 0.4008 | 0.4632 |
| 2025-DMIS-KU-3 | 0.9412 | 0.9565 | 0.9091 | 0.9328 | 0.5000 | 0.6154 | 0.5577 | 0.6025 | 0.5181 | 0.5362 |
| deepseek-r1:14b | 0.8824 | 0.9167 | 0.8000 | 0.8583 | 0.3077 | 0.3077 | 0.3077 | 0.4089 | 0.3015 | 0.3307 |
| 2025-DMIS-KU-2 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.4615 | 0.6154 | 0.5321 | 0.6342 | 0.4920 | 0.5322 |
| using free 7b LLM | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.3462 | 0.3462 | 0.3462 | 0.4343 | 0.2314 | 0.2669 |
| deepseek-r1:8b | 0.9412 | 0.9600 | 0.8889 | 0.9244 | 0.2308 | 0.2308 | 0.2308 | 0.1135 | 0.0745 | 0.0853 |
| lasigeBioTM | 0.9412 | 0.9565 | 0.9091 | 0.9328 | 0.1154 | 0.1154 | 0.1154 | - | - | - |
| gpt 01 mini | 0.8235 | 0.8696 | 0.7273 | 0.7984 | 0.3462 | 0.3462 | 0.3462 | 0.3826 | 0.3143 | 0.3307 |
| BioASQ_Baseline | 0.4706 | 0.4000 | 0.5263 | 0.4632 | 0.1923 | 0.2692 | 0.2212 | 0.2552 | 0.1751 | 0.1921 |
Ideal Answers
| Automatic scores (Rouge - R) | Manual scores | |||||||
|---|---|---|---|---|---|---|---|---|
| System | R-2 (Rec) | R-2 (F1) | R-SU4 (Rec) | R-SU4 (F1) | Readability | Recall | Precision | Repetition |
| IISR first submit | 0.2012 | 0.2452 | 0.1878 | 0.2303 | 4.16 | 4.14 | 4.25 | 4.35 |
| IISR 2nd submit | 0.2197 | 0.2625 | 0.2036 | 0.2448 | 4.32 | 4.29 | 4.28 | 4.42 |
| IISR 3rd submit | 0.2328 | 0.2476 | 0.2216 | 0.2359 | 4.16 | 4.29 | 4.15 | 4.28 |
| IISR 4th submit | 0.2322 | 0.2469 | 0.2194 | 0.2342 | 4.26 | 4.29 | 4.15 | 4.38 |
| IISR 5th submit | 0.2362 | 0.2768 | 0.2222 | 0.2617 | 4.14 | 4.13 | 4.16 | 4.38 |
| UniTor_0 | 0.1764 | 0.2209 | 0.1643 | 0.2062 | 4.11 | 4.02 | 4.12 | 4.28 |
| UniTor_1 | 0.1817 | 0.2195 | 0.1702 | 0.2061 | 4.05 | 4.08 | 4.07 | 4.20 |
| UniTor_2 | 0.1744 | 0.2109 | 0.1641 | 0.1981 | 4.00 | 3.91 | 4.04 | 4.19 |
| UniTor_3 | 0.1807 | 0.2143 | 0.1698 | 0.2012 | 4.08 | 4.00 | 4.13 | 4.21 |
| DB_vector_&_LLM | 0.3096 | 0.2068 | 0.3061 | 0.2030 | 4.29 | 4.45 | 4.01 | 4.35 |
| google_serach_&_LLM | 0.3096 | 0.2068 | 0.3061 | 0.2030 | 4.29 | 4.45 | 4.01 | 4.35 |
| UR-IW-1 | 0.2935 | 0.2290 | 0.2925 | 0.2240 | 4.47 | 4.58 | 4.18 | 4.44 |
| UR-IW-2 | 0.2146 | 0.1872 | 0.2242 | 0.1900 | 4.39 | 4.38 | 4.13 | 4.47 |
| UR-IW-3 | 0.2933 | 0.2372 | 0.2906 | 0.2312 | 4.36 | 4.53 | 4.13 | 4.46 |
| UR-IW-4 | 0.2310 | 0.1823 | 0.2379 | 0.1845 | 4.41 | 4.48 | 4.14 | 4.49 |
| UR-IW-5 | 0.2667 | 0.2300 | 0.2665 | 0.2266 | 4.33 | 4.42 | 4.09 | 4.28 |
| Fleming-1 | 0.2641 | 0.2150 | 0.2609 | 0.2107 | 4.34 | 4.41 | 4.18 | 4.46 |
| bioinfo-0 | 0.1609 | 0.1876 | 0.1525 | 0.1774 | 4.26 | 4.19 | 4.25 | 4.40 |
| bioinfo-1 | 0.2301 | 0.1934 | 0.2311 | 0.1916 | 4.28 | 4.34 | 4.11 | 4.39 |
| bioinfo-2 | 0.2449 | 0.2048 | 0.2437 | 0.2010 | 4.21 | 4.32 | 4.07 | 4.33 |
| bioinfo-3 | 0.2612 | 0.1991 | 0.2643 | 0.1992 | 4.36 | 4.54 | 4.06 | 4.44 |
| bioinfo-4 | 0.2452 | 0.1923 | 0.2459 | 0.1912 | 4.29 | 4.41 | 4.14 | 4.41 |
| Fleming-2 | 0.3268 | 0.1973 | 0.3259 | 0.1950 | 4.35 | 4.42 | 4.04 | 4.40 |
| Mistral7BIns10shots | 0.2326 | 0.2526 | 0.2197 | 0.2379 | 4.25 | 4.35 | 4.20 | 4.35 |
| vllm agents | 0.0816 | 0.1061 | 0.0763 | 0.0985 | 3.24 | 3.29 | 3.68 | 4.08 |
| dmiip2024 | 0.1772 | 0.2255 | 0.1634 | 0.2092 | 4.24 | 4.04 | 4.22 | 4.28 |
| dmiip2024_2 | 0.1612 | 0.2028 | 0.1544 | 0.1952 | 4.07 | 3.82 | 3.98 | 4.31 |
| dmiip2024_3 | 0.1635 | 0.2183 | 0.1499 | 0.2021 | 4.25 | 4.13 | 4.24 | 4.33 |
| dmiip2024_4 | 0.1848 | 0.2359 | 0.1694 | 0.2178 | 4.16 | 4.09 | 4.21 | 4.33 |
| config-1 | 0.1745 | 0.2127 | 0.1617 | 0.1976 | 4.07 | 3.96 | 4.08 | 4.24 |
| llama | 0.2509 | 0.2459 | 0.2413 | 0.2351 | 4.33 | 4.39 | 4.18 | 4.38 |
| dense | 0.2641 | 0.2453 | 0.2550 | 0.2356 | 4.34 | 4.53 | 4.14 | 4.44 |
| config-2 | 0.2622 | 0.2506 | 0.2563 | 0.2446 | 4.21 | 4.35 | 4.18 | 4.40 |
| config-3 | 0.2821 | 0.1995 | 0.2806 | 0.1962 | 3.87 | 4.06 | 3.62 | 3.95 |
| config-4 | 0.2492 | 0.2257 | 0.2463 | 0.2221 | 4.29 | 4.26 | 4.12 | 4.34 |
| config-5 | 0.3126 | 0.2303 | 0.3105 | 0.2262 | 4.38 | 4.56 | 4.13 | 4.46 |
| dmiip2024_1 | 0.1685 | 0.2190 | 0.1552 | 0.2022 | 4.05 | 3.94 | 4.16 | 4.21 |
| mistral | 0.2468 | 0.2562 | 0.2344 | 0.2426 | 4.06 | 4.38 | 4.16 | 4.39 |
| Fleming-3 | 0.3268 | 0.1973 | 0.3259 | 0.1950 | 4.35 | 4.42 | 4.04 | 4.40 |
| bious1 | 0.2150 | 0.2432 | 0.2065 | 0.2328 | 4.27 | 4.19 | 4.31 | 4.40 |
| bious2 | 0.2358 | 0.2564 | 0.2268 | 0.2451 | 4.26 | 4.26 | 4.24 | 4.41 |
| bious3 | 0.2356 | 0.2579 | 0.2238 | 0.2440 | 4.35 | 4.27 | 4.21 | 4.39 |
| bious4 | 0.2312 | 0.2523 | 0.2204 | 0.2399 | 4.28 | 4.22 | 4.21 | 4.40 |
| bious5 | 0.2235 | 0.2440 | 0.2138 | 0.2328 | 4.36 | 4.24 | 4.24 | 4.44 |
| kmeans | 0.0487 | 0.0422 | 0.0517 | 0.0436 | 0.86 | 0.93 | 0.87 | 0.93 |
| simple truncation | 0.0583 | 0.0513 | 0.0574 | 0.0505 | 0.88 | 0.95 | 0.87 | 0.92 |
| similarity measures | 0.0536 | 0.0423 | 0.0540 | 0.0427 | 0.89 | 0.95 | 0.91 | 0.92 |
| extractive | 0.0544 | 0.0418 | 0.0561 | 0.0425 | 0.84 | 0.91 | 0.86 | 0.91 |
| deepseek32b-me | 0.1924 | 0.1790 | 0.1920 | 0.1769 | 4.05 | 4.48 | 4.20 | 4.46 |
| EP-1 | 0.2620 | 0.2226 | 0.2576 | 0.2185 | 4.39 | 4.38 | 4.14 | 4.40 |
| abstractive | 0.0520 | 0.0388 | 0.0545 | 0.0400 | 0.86 | 0.92 | 0.87 | 0.91 |
| EP-2 | 0.2405 | 0.2461 | 0.2306 | 0.2353 | 4.39 | 4.32 | 4.20 | 4.42 |
| deepseek32b-full | 0.2013 | 0.1823 | 0.2006 | 0.1793 | 4.13 | 4.46 | 4.16 | 4.52 |
| deepseek32b-f | 0.1958 | 0.1818 | 0.1934 | 0.1773 | 4.11 | 4.53 | 4.19 | 4.44 |
| GPT4O | 0.2750 | 0.2196 | 0.2753 | 0.2177 | 4.36 | 4.42 | 4.16 | 4.39 |
| phaseB-4 | 0.1949 | 0.1850 | 0.1938 | 0.1815 | 4.08 | 4.45 | 4.14 | 4.42 |
| EP-4 | 0.2572 | 0.2665 | 0.2474 | 0.2555 | 4.42 | 4.41 | 4.28 | 4.48 |
| phaseB-5 | 0.2030 | 0.1877 | 0.2019 | 0.1848 | 4.12 | 4.49 | 4.15 | 4.47 |
| deepseek-r1:32b | 0.2556 | 0.1998 | 0.2563 | 0.1979 | 4.29 | 4.25 | 4.05 | 4.40 |
| EP-5 | 0.2225 | 0.2096 | 0.2190 | 0.2057 | 4.40 | 4.38 | 4.21 | 4.51 |
| EP-3 | 0.2355 | 0.2441 | 0.2253 | 0.2333 | 4.40 | 4.28 | 4.26 | 4.45 |
| 2025-DMIS-KU-1 | - | - | - | - | - | - | - | - |
| 2025-DMIS-KU-4 | - | - | - | - | - | - | - | - |
| 2025-DMIS-KU-5 | - | - | - | - | - | - | - | - |
| 2025-DMIS-KU-3 | - | - | - | - | - | - | - | - |
| deepseek-r1:14b | 0.2178 | 0.1845 | 0.2217 | 0.1842 | 4.25 | 4.13 | 3.99 | 4.39 |
| 2025-DMIS-KU-2 | - | - | - | - | - | - | - | - |
| using free 7b LLM | 0.1804 | 0.2183 | 0.1688 | 0.2049 | 4.02 | 4.13 | 3.98 | 4.06 |
| deepseek-r1:8b | 0.2181 | 0.1764 | 0.2208 | 0.1760 | 4.12 | 4.12 | 3.98 | 4.26 |
| lasigeBioTM | 0.1016 | 0.1376 | 0.0929 | 0.1274 | 3.45 | 3.34 | 3.89 | 4.13 |
| gpt 01 mini | 0.2218 | 0.1807 | 0.2208 | 0.1797 | 4.16 | 4.19 | 4.00 | 4.31 |
| BioASQ_Baseline | - | - | - | - | - | - | - | - |
Test batch 2
Exact Answers
| Yes/No | Factoid | List | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| System | Accuracy | F1 Yes | F1 No | Macro F1 | Strict Acc. | Lenient Acc. | MRR | Mean Prec. | Recall | F-Measure |
| IISR first submit | 0.8824 | 0.9000 | 0.8571 | 0.8786 | 0.5185 | 0.5926 | 0.5556 | 0.5112 | 0.4329 | 0.4498 |
| IISR 2nd submit | 0.8824 | 0.9000 | 0.8571 | 0.8786 | 0.4444 | 0.4815 | 0.4630 | 0.4777 | 0.4104 | 0.4243 |
| IISR 3rd submit | 0.8824 | 0.9000 | 0.8571 | 0.8786 | 0.5185 | 0.5185 | 0.5185 | 0.5338 | 0.4804 | 0.4838 |
| IISR 4th submit | 0.8824 | 0.9000 | 0.8571 | 0.8786 | 0.4444 | 0.4444 | 0.4444 | 0.6036 | 0.5123 | 0.5305 |
| IISR 5th submit | 0.8235 | 0.8571 | 0.7692 | 0.8132 | 0.5556 | 0.6296 | 0.5926 | 0.5162 | 0.4393 | 0.4547 |
| bioinfo-0 | 0.5882 | 0.7407 | - | 0.3704 | - | - | - | - | - | - |
| bioinfo-1 | 0.5882 | 0.7407 | - | 0.3704 | - | - | - | - | - | - |
| bioinfo-2 | 0.5882 | 0.7407 | - | 0.3704 | - | - | - | - | - | - |
| bioinfo-3 | 0.5882 | 0.7407 | - | 0.3704 | - | - | - | - | - | - |
| bioinfo-4 | 0.5882 | 0.7407 | - | 0.3704 | - | - | - | - | - | - |
| UR-IW-1 | 0.8235 | 0.8696 | 0.7273 | 0.7984 | 0.5556 | 0.6667 | 0.6111 | 0.4304 | 0.5800 | 0.4554 |
| UR-IW-2 | 0.8235 | 0.8571 | 0.7692 | 0.8132 | 0.5185 | 0.6296 | 0.5741 | 0.3955 | 0.4834 | 0.4079 |
| UR-IW-3 | 0.8235 | 0.8421 | 0.8000 | 0.8211 | 0.5185 | 0.6667 | 0.5741 | 0.4641 | 0.5877 | 0.4799 |
| UR-IW-4 | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.5556 | 0.6296 | 0.5926 | 0.4766 | 0.5495 | 0.4790 |
| UR-IW-5 | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.5185 | 0.5926 | 0.5556 | 0.4276 | 0.5428 | 0.4404 |
| UniTor_0 | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.7037 | 0.7037 | 0.7037 | 0.4207 | 0.5137 | 0.4275 |
| UniTor_1 | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.7037 | 0.7037 | 0.7037 | 0.4207 | 0.5137 | 0.4275 |
| UniTor_2 | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.7407 | 0.7407 | 0.7407 | 0.3530 | 0.3977 | 0.3453 |
| UniTor_3 | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.7407 | 0.7407 | 0.7407 | 0.3530 | 0.3977 | 0.3453 |
| Fleming-1 | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.4074 | 0.6667 | 0.5105 | 0.5263 | 0.4636 | 0.4641 |
| Fleming-2 | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.4074 | 0.6667 | 0.5105 | 0.4803 | 0.5518 | 0.4783 |
| Mistral7BIns10shots | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.1481 | 0.1481 | 0.1481 | 0.2370 | 0.2231 | 0.2102 |
| GPT4turbo | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.5556 | 0.6667 | 0.6111 | 0.5453 | 0.4801 | 0.4808 |
| dmiip2024 | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.5926 | 0.6296 | 0.6111 | 0.5719 | 0.5500 | 0.5416 |
| dmiip2024_1 | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.5926 | 0.5926 | 0.5926 | 0.5703 | 0.5171 | 0.5236 |
| dmiip2024_3 | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.6296 | 0.6667 | 0.6481 | 0.5860 | 0.4871 | 0.5116 |
| dmiip2024_4 | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.5556 | 0.6667 | 0.6111 | 0.6289 | 0.5014 | 0.5312 |
| dmiip2024_2 | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.4815 | 0.4815 | 0.4815 | 0.4772 | 0.6140 | 0.4964 |
| bious1 | 0.8824 | 0.9000 | 0.8571 | 0.8786 | 0.3704 | 0.4444 | 0.4074 | 0.3630 | 0.3820 | 0.3506 |
| bious2 | 0.7647 | 0.8182 | 0.6667 | 0.7424 | 0.3704 | 0.4444 | 0.4074 | 0.4013 | 0.4226 | 0.3982 |
| bious3 | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.4444 | 0.5556 | 0.5000 | 0.4655 | 0.4492 | 0.4427 |
| bious4 | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.4074 | 0.4815 | 0.4444 | 0.3986 | 0.4010 | 0.3895 |
| bious5 | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.4815 | 0.4815 | 0.4815 | 0.4187 | 0.4339 | 0.4062 |
| lasigeBioTM-onto-bl | 0.5882 | 0.5882 | 0.5882 | 0.5882 | 0.0741 | 0.1481 | 0.1111 | 0.0992 | 0.1090 | 0.1010 |
| lasigeBioTM-onto-sm | 0.7059 | 0.7059 | 0.7059 | 0.7059 | 0.0741 | 0.1111 | 0.0926 | 0.0526 | 0.0367 | 0.0432 |
| Fleming-3 | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.4074 | 0.7778 | 0.5500 | 0.5263 | 0.4636 | 0.4641 |
| GPT4O | 0.8824 | 0.9000 | 0.8571 | 0.8786 | 0.3704 | 0.3704 | 0.3704 | 0.4967 | 0.3749 | 0.4061 |
| deepseek-r1:32b | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.0741 | 0.0741 | 0.0741 | 0.1948 | 0.2089 | 0.1968 |
| deepseek-r1:14b | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.0741 | 0.0741 | 0.0741 | 0.1948 | 0.2089 | 0.1968 |
| deepseek-r1:8b | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.4444 | 0.4444 | 0.4444 | 0.4912 | 0.4499 | 0.4446 |
| gpt 01 mini | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.1111 | 0.1111 | 0.1111 | 0.2949 | 0.2528 | 0.2408 |
| lasigeBioTM | 0.7647 | 0.8182 | 0.6667 | 0.7424 | 0.5185 | 0.5185 | 0.5185 | 0.6842 | 0.1962 | 0.2934 |
| deepseek32b-me | 0.8235 | 0.8421 | 0.8000 | 0.8211 | 0.5556 | 0.5556 | 0.5556 | 0.5144 | 0.5111 | 0.4899 |
| deepseek32b-full | 0.8824 | 0.9000 | 0.8571 | 0.8786 | 0.6667 | 0.6667 | 0.6667 | 0.4784 | 0.4386 | 0.4340 |
| deepseek32b-f | 0.8824 | 0.9000 | 0.8571 | 0.8786 | 0.5926 | 0.5926 | 0.5926 | 0.4815 | 0.4614 | 0.4524 |
| phaseB-4 | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.5185 | 0.5185 | 0.5185 | 0.5472 | 0.5394 | 0.4923 |
| phaseB-5 | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.4074 | 0.4074 | 0.4074 | 0.5771 | 0.4853 | 0.4958 |
| lasigeBioTM-ku-bl | 0.7059 | 0.7826 | 0.5455 | 0.6640 | - | - | - | - | - | - |
| simple truncation | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.5185 | 0.6296 | 0.5679 | 0.5534 | 0.5087 | 0.4946 |
| config-2 | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.6296 | 0.6667 | 0.6481 | 0.4912 | 0.4500 | 0.4501 |
| config-1 | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.5185 | 0.5185 | 0.5185 | 0.5140 | 0.4173 | 0.4427 |
| config-3 | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.6296 | 0.6667 | 0.6481 | 0.4912 | 0.4500 | 0.4501 |
| config-4 | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.6296 | 0.7037 | 0.6605 | 0.4855 | 0.4491 | 0.4383 |
| config-5 | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.6296 | 0.6667 | 0.6481 | 0.5570 | 0.4939 | 0.5002 |
| mistral | 0.8824 | 0.9000 | 0.8571 | 0.8786 | 0.5185 | 0.7037 | 0.6111 | 0.5240 | 0.4864 | 0.4770 |
| llama | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.4815 | 0.5556 | 0.5185 | 0.5075 | 0.4442 | 0.4509 |
| dense | 0.8824 | 0.9000 | 0.8571 | 0.8786 | 0.5556 | 0.5556 | 0.5556 | 0.5130 | 0.4836 | 0.4793 |
| 2025-DMIS-KU-1 | 0.8824 | 0.9000 | 0.8571 | 0.8786 | 0.5556 | 0.5926 | 0.5741 | 0.5741 | 0.4931 | 0.5075 |
| 2025-DMIS-KU-2 | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.5926 | 0.6296 | 0.6111 | 0.5741 | 0.4931 | 0.5075 |
| 2025-DMIS-KU-3 | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.5185 | 0.6296 | 0.5741 | 0.5594 | 0.4828 | 0.4937 |
| EP-1 | 0.7059 | 0.7826 | 0.5455 | 0.6640 | 0.5556 | 0.7778 | 0.6494 | 0.5741 | 0.5862 | 0.5460 |
| 2025-DMIS-KU-4 | 0.8235 | 0.8421 | 0.8000 | 0.8211 | 0.5926 | 0.8148 | 0.6914 | 0.5540 | 0.5267 | 0.5123 |
| EP-2 | 0.7059 | 0.7826 | 0.5455 | 0.6640 | 0.5556 | 0.6296 | 0.5926 | 0.6009 | 0.4842 | 0.5085 |
| 2025-DMIS-KU-5 | 0.8824 | 0.9000 | 0.8571 | 0.8786 | 0.6296 | 0.8148 | 0.7099 | 0.5716 | 0.5018 | 0.5098 |
| kmeans | 0.8235 | 0.8421 | 0.8000 | 0.8211 | 0.5185 | 0.6667 | 0.5926 | 0.5174 | 0.4771 | 0.4633 |
| similarity measures | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.4444 | 0.7037 | 0.5525 | 0.3361 | 0.3486 | 0.3248 |
| extractive | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.4444 | 0.7778 | 0.5716 | 0.3260 | 0.3350 | 0.3089 |
| EP-3 | 0.6471 | 0.7273 | 0.5000 | 0.6136 | 0.4444 | 0.5926 | 0.5123 | 0.5233 | 0.4627 | 0.4748 |
| abstractive | 0.8824 | 0.9091 | 0.8333 | 0.8712 | 0.4074 | 0.6667 | 0.5216 | 0.2978 | 0.3423 | 0.2983 |
| EP-4 | 0.9412 | 0.9524 | 0.9231 | 0.9377 | 0.5185 | 0.5556 | 0.5370 | 0.5583 | 0.4980 | 0.4974 |
| EP-5 | 0.8824 | 0.9000 | 0.8571 | 0.8786 | 0.5556 | 0.5556 | 0.5556 | 0.5912 | 0.4379 | 0.4849 |
| BioASQ_Baseline | 0.5294 | 0.4286 | 0.6000 | 0.5143 | 0.1852 | 0.4444 | 0.2772 | 0.2724 | 0.2898 | 0.2182 |
Ideal Answers
| Automatic scores (Rouge - R) | Manual scores | |||||||
|---|---|---|---|---|---|---|---|---|
| System | R-2 (Rec) | R-2 (F1) | R-SU4 (Rec) | R-SU4 (F1) | Readability | Recall | Precision | Repetition |
| IISR first submit | 0.2428 | 0.2494 | 0.2293 | 0.2340 | 4.41 | 4.33 | 4.51 | 4.61 |
| IISR 2nd submit | 0.3093 | 0.3251 | 0.2945 | 0.3089 | 4.62 | 4.06 | 4.51 | 4.67 |
| IISR 3rd submit | 0.2631 | 0.2715 | 0.2512 | 0.2563 | 4.28 | 4.28 | 4.51 | 4.55 |
| IISR 4th submit | 0.2646 | 0.3137 | 0.2434 | 0.2904 | 4.52 | 4.07 | 4.51 | 4.60 |
| IISR 5th submit | 0.2926 | 0.3189 | 0.2844 | 0.3079 | 4.62 | 4.09 | 4.41 | 4.65 |
| bioinfo-0 | 0.1920 | 0.2144 | 0.1800 | 0.2004 | 4.49 | 4.15 | 4.52 | 4.64 |
| bioinfo-1 | 0.2748 | 0.2253 | 0.2674 | 0.2166 | 4.41 | 4.38 | 4.31 | 4.56 |
| bioinfo-2 | 0.2681 | 0.2093 | 0.2620 | 0.2009 | 4.46 | 4.44 | 4.36 | 4.53 |
| bioinfo-3 | 0.2566 | 0.2053 | 0.2541 | 0.2000 | 4.38 | 4.39 | 4.33 | 4.56 |
| bioinfo-4 | 0.2491 | 0.1992 | 0.2480 | 0.1957 | 4.47 | 4.33 | 4.36 | 4.55 |
| UR-IW-1 | 0.3209 | 0.2249 | 0.3198 | 0.2194 | 4.38 | 4.41 | 4.25 | 4.45 |
| UR-IW-2 | 0.2431 | 0.2099 | 0.2437 | 0.2064 | 4.58 | 4.39 | 4.42 | 4.68 |
| UR-IW-3 | 0.3329 | 0.2650 | 0.3271 | 0.2560 | 4.53 | 4.44 | 4.36 | 4.61 |
| UR-IW-4 | 0.2678 | 0.2059 | 0.2709 | 0.2036 | 4.51 | 4.53 | 4.44 | 4.60 |
| UR-IW-5 | 0.3082 | 0.2684 | 0.3030 | 0.2602 | 4.53 | 4.39 | 4.35 | 4.52 |
| UniTor_0 | 0.2640 | 0.2883 | 0.2483 | 0.2717 | 4.40 | 4.14 | 4.35 | 4.51 |
| UniTor_1 | 0.2640 | 0.2883 | 0.2483 | 0.2717 | 4.40 | 4.14 | 4.35 | 4.51 |
| UniTor_2 | 0.2557 | 0.2814 | 0.2368 | 0.2619 | 4.36 | 4.11 | 4.33 | 4.51 |
| UniTor_3 | 0.2557 | 0.2814 | 0.2368 | 0.2619 | 4.36 | 4.11 | 4.33 | 4.51 |
| Fleming-1 | 0.3055 | 0.2368 | 0.3019 | 0.2275 | 4.40 | 4.42 | 4.41 | 4.49 |
| Fleming-2 | 0.3906 | 0.2137 | 0.3790 | 0.2045 | 4.35 | 4.51 | 4.15 | 4.35 |
| Mistral7BIns10shots | 0.3054 | 0.3122 | 0.2875 | 0.2941 | 4.47 | 4.38 | 4.51 | 4.59 |
| GPT4turbo | 0.2794 | 0.3019 | 0.2646 | 0.2845 | 4.52 | 4.29 | 4.55 | 4.59 |
| dmiip2024 | 0.2420 | 0.2742 | 0.2253 | 0.2575 | 4.39 | 4.14 | 4.52 | 4.56 |
| dmiip2024_1 | 0.2492 | 0.2841 | 0.2358 | 0.2690 | 4.31 | 4.06 | 4.42 | 4.47 |
| dmiip2024_3 | 0.2023 | 0.2458 | 0.1848 | 0.2267 | 4.52 | 4.11 | 4.54 | 4.59 |
| dmiip2024_4 | 0.2277 | 0.2641 | 0.2140 | 0.2482 | 4.42 | 4.11 | 4.46 | 4.61 |
| dmiip2024_2 | 0.2421 | 0.2759 | 0.2295 | 0.2599 | 4.41 | 4.12 | 4.46 | 4.55 |
| bious1 | 0.2598 | 0.2662 | 0.2492 | 0.2522 | 4.58 | 4.38 | 4.52 | 4.66 |
| bious2 | 0.2666 | 0.2670 | 0.2541 | 0.2522 | 4.55 | 4.27 | 4.45 | 4.62 |
| bious3 | 0.2635 | 0.2668 | 0.2547 | 0.2544 | 4.60 | 4.28 | 4.54 | 4.64 |
| bious4 | 0.2562 | 0.2643 | 0.2466 | 0.2520 | 4.64 | 4.31 | 4.59 | 4.68 |
| bious5 | 0.2593 | 0.2692 | 0.2488 | 0.2557 | 4.61 | 4.26 | 4.51 | 4.68 |
| lasigeBioTM-onto-bl | 0.1771 | 0.1039 | 0.1923 | 0.1114 | 4.05 | 2.86 | 3.15 | 4.38 |
| lasigeBioTM-onto-sm | 0.1733 | 0.1106 | 0.1840 | 0.1167 | 3.98 | 2.75 | 3.07 | 4.33 |
| Fleming-3 | 0.3171 | 0.2307 | 0.3116 | 0.2229 | 4.45 | 4.42 | 4.38 | 4.52 |
| GPT4O | 0.2901 | 0.1741 | 0.2940 | 0.1737 | 4.13 | 4.33 | 4.15 | 4.33 |
| deepseek-r1:32b | 0.0993 | 0.1058 | 0.1072 | 0.1134 | 3.89 | 2.65 | 3.31 | 4.36 |
| deepseek-r1:14b | 0.1082 | 0.1088 | 0.1148 | 0.1154 | 4.11 | 2.78 | 3.48 | 4.40 |
| deepseek-r1:8b | 0.2874 | 0.1994 | 0.2837 | 0.1946 | 4.31 | 4.52 | 4.32 | 4.39 |
| gpt 01 mini | 0.1812 | 0.1228 | 0.1947 | 0.1290 | 4.28 | 3.49 | 3.74 | 4.42 |
| lasigeBioTM | 0.2448 | 0.2521 | 0.2308 | 0.2361 | 4.27 | 4.12 | 4.38 | 4.64 |
| deepseek32b-me | 0.2032 | 0.1745 | 0.2034 | 0.1708 | 3.88 | 4.25 | 4.19 | 4.33 |
| deepseek32b-full | 0.2228 | 0.1789 | 0.2194 | 0.1743 | 4.11 | 4.58 | 4.35 | 4.49 |
| deepseek32b-f | 0.2288 | 0.1836 | 0.2246 | 0.1792 | 4.16 | 4.59 | 4.41 | 4.59 |
| phaseB-4 | 0.3009 | 0.2511 | 0.2956 | 0.2422 | 4.15 | 4.55 | 4.32 | 4.48 |
| phaseB-5 | 0.2934 | 0.2480 | 0.2864 | 0.2380 | 4.34 | 4.59 | 4.40 | 4.59 |
| lasigeBioTM-ku-bl | 0.2315 | 0.2778 | 0.2158 | 0.2588 | 4.54 | 4.01 | 4.51 | 4.64 |
| simple truncation | 0.0871 | 0.0776 | 0.0846 | 0.0752 | 1.14 | 1.21 | 1.15 | 1.16 |
| config-2 | 0.3594 | 0.2394 | 0.3556 | 0.2320 | 4.04 | 4.26 | 3.98 | 4.19 |
| config-1 | 0.2870 | 0.3001 | 0.2760 | 0.2877 | 4.15 | 4.12 | 4.36 | 4.48 |
| config-3 | 0.3594 | 0.2394 | 0.3556 | 0.2320 | 4.04 | 4.26 | 3.98 | 4.19 |
| config-4 | 0.2955 | 0.2367 | 0.2859 | 0.2265 | 4.51 | 4.44 | 4.48 | 4.64 |
| config-5 | 0.3914 | 0.2557 | 0.3885 | 0.2481 | 4.33 | 4.54 | 4.28 | 4.52 |
| mistral | 0.3064 | 0.2819 | 0.2926 | 0.2657 | 4.42 | 4.41 | 4.47 | 4.56 |
| llama | 0.2614 | 0.2160 | 0.2528 | 0.2065 | 4.21 | 4.52 | 4.40 | 4.55 |
| dense | 0.2814 | 0.2422 | 0.2715 | 0.2303 | 4.26 | 4.31 | 4.28 | 4.52 |
| 2025-DMIS-KU-1 | - | - | - | - | - | - | - | - |
| 2025-DMIS-KU-2 | - | - | - | - | - | - | - | - |
| 2025-DMIS-KU-3 | - | - | - | - | - | - | - | - |
| EP-1 | 0.3112 | 0.2601 | 0.2990 | 0.2461 | 4.38 | 4.35 | 4.39 | 4.49 |
| 2025-DMIS-KU-4 | - | - | - | - | - | - | - | - |
| EP-2 | 0.3000 | 0.2441 | 0.2855 | 0.2299 | 4.42 | 4.51 | 4.41 | 4.56 |
| 2025-DMIS-KU-5 | - | - | - | - | - | - | - | - |
| kmeans | 0.0899 | 0.0769 | 0.0860 | 0.0737 | 1.12 | 1.21 | 1.15 | 1.14 |
| similarity measures | 0.0809 | 0.0519 | 0.0805 | 0.0512 | 1.13 | 1.24 | 1.07 | 1.19 |
| extractive | 0.0784 | 0.0516 | 0.0793 | 0.0518 | 1.06 | 1.24 | 1.01 | 1.13 |
| EP-3 | 0.3373 | 0.2545 | 0.3256 | 0.2436 | 4.38 | 4.54 | 4.41 | 4.48 |
| abstractive | 0.0889 | 0.0520 | 0.0881 | 0.0508 | 1.11 | 1.24 | 1.05 | 1.13 |
| EP-4 | 0.3634 | 0.2588 | 0.3575 | 0.2496 | 4.41 | 4.45 | 4.29 | 4.48 |
| EP-5 | 0.3136 | 0.2851 | 0.2924 | 0.2652 | 4.47 | 4.33 | 4.54 | 4.61 |
| BioASQ_Baseline | - | - | - | - | - | - | - | - |
Test batch 3
Exact Answers
| Yes/No | Factoid | List | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| System | Accuracy | F1 Yes | F1 No | Macro F1 | Strict Acc. | Lenient Acc. | MRR | Mean Prec. | Recall | F-Measure |
| UR-IW-1 | 0.8636 | 0.9091 | 0.7273 | 0.8182 | 0.4000 | 0.5000 | 0.4225 | 0.4582 | 0.5951 | 0.4755 |
| UR-IW-2 | 0.8636 | 0.9032 | 0.7692 | 0.8362 | 0.3500 | 0.4000 | 0.3750 | 0.4314 | 0.5212 | 0.4470 |
| UR-IW-3 | 0.8636 | 0.9032 | 0.7692 | 0.8362 | 0.3000 | 0.4000 | 0.3500 | 0.4533 | 0.5569 | 0.4765 |
| UR-IW-4 | 0.9091 | 0.9375 | 0.8333 | 0.8854 | 0.3000 | 0.3500 | 0.3250 | 0.4676 | 0.4922 | 0.4666 |
| UR-IW-5 | 0.9091 | 0.9375 | 0.8333 | 0.8854 | 0.2500 | 0.4500 | 0.3500 | 0.4817 | 0.5368 | 0.4877 |
| UniTor_0 | 0.8636 | 0.9032 | 0.7692 | 0.8362 | 0.4000 | 0.4000 | 0.4000 | 0.5394 | 0.4990 | 0.5109 |
| UniTor_1 | 0.8636 | 0.9032 | 0.7692 | 0.8362 | 0.4000 | 0.4500 | 0.4250 | 0.5924 | 0.5272 | 0.5472 |
| UniTor_2 | 0.8636 | 0.9032 | 0.7692 | 0.8362 | 0.3500 | 0.3500 | 0.3500 | 0.5247 | 0.4743 | 0.4885 |
| UniTor_3 | 0.8636 | 0.9032 | 0.7692 | 0.8362 | 0.4000 | 0.4000 | 0.4000 | 0.5625 | 0.5035 | 0.5204 |
| bioinfo-0 | 0.6818 | 0.8108 | - | 0.4054 | - | - | - | - | - | - |
| bioinfo-1 | 0.6818 | 0.8108 | - | 0.4054 | - | - | - | - | - | - |
| bioinfo-2 | 0.6818 | 0.8108 | - | 0.4054 | - | - | - | - | - | - |
| bioinfo-3 | 0.6818 | 0.8108 | - | 0.4054 | - | - | - | - | - | - |
| bioinfo-4 | 0.6818 | 0.8108 | - | 0.4054 | - | - | - | - | - | - |
| Synthia with first | 0.8636 | 0.8966 | 0.8000 | 0.8483 | 0.0500 | 0.0500 | 0.0500 | 0.3716 | 0.3731 | 0.3546 |
| RMC_append_snippets | 0.9545 | 0.9677 | 0.9231 | 0.9454 | - | - | - | 0.3832 | 0.4019 | 0.3669 |
| IISR first submit | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.4000 | 0.4500 | 0.4250 | 0.6048 | 0.4980 | 0.5292 |
| IISR 2nd submit | 0.9091 | 0.9333 | 0.8571 | 0.8952 | 0.3500 | 0.4000 | 0.3750 | 0.6433 | 0.5403 | 0.5746 |
| IISR 3rd submit | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.4000 | 0.4500 | 0.4250 | 0.6522 | 0.5197 | 0.5619 |
| IISR 4th submit | 0.8636 | 0.9032 | 0.7692 | 0.8362 | 0.2000 | 0.2500 | 0.2250 | 0.6375 | 0.5136 | 0.5491 |
| IISR 5th submit | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.2500 | 0.3000 | 0.2750 | 0.6407 | 0.5494 | 0.5758 |
| lasigeBioTM | 0.7727 | 0.8276 | 0.6667 | 0.7471 | 0.3500 | 0.3500 | 0.3500 | 0.5343 | 0.4429 | 0.4668 |
| AQAMS2 | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.3000 | 0.3500 | 0.3250 | 0.6390 | 0.5539 | 0.5831 |
| mistral | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.3500 | 0.5500 | 0.4500 | 0.5909 | 0.5302 | 0.5411 |
| llama | 0.8636 | 0.9091 | 0.7273 | 0.8182 | 0.4000 | 0.4500 | 0.4250 | 0.5911 | 0.5127 | 0.5406 |
| dense | 0.9091 | 0.9375 | 0.8333 | 0.8854 | 0.3500 | 0.5500 | 0.4500 | 0.5473 | 0.4929 | 0.5065 |
| GPT4O | 0.9091 | 0.9375 | 0.8333 | 0.8854 | 0.3500 | 0.3500 | 0.3500 | 0.5256 | 0.4600 | 0.4822 |
| deepseek-r1:32b | 0.8182 | 0.8667 | 0.7143 | 0.7905 | 0.1500 | 0.1500 | 0.1500 | 0.4924 | 0.4231 | 0.4456 |
| deepseek-r1:14b | 0.8636 | 0.9032 | 0.7692 | 0.8362 | 0.2500 | 0.2500 | 0.2500 | 0.4317 | 0.4156 | 0.4152 |
| deepseek-r1:8b | 0.8636 | 0.9032 | 0.7692 | 0.8362 | 0.1000 | 0.1000 | 0.1000 | 0.4886 | 0.4368 | 0.4474 |
| Fleming-4 | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.2500 | 0.6000 | 0.3725 | 0.4062 | 0.5710 | 0.4483 |
| Fleming-1 | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.2000 | 0.6000 | 0.3467 | 0.5314 | 0.5796 | 0.5311 |
| 2025-DMIS-KU-1 | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.3500 | 0.6000 | 0.4475 | 0.6021 | 0.5045 | 0.5379 |
| simple truncation | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.4500 | 0.6000 | 0.5042 | 0.4400 | 0.3752 | 0.3980 |
| kmeans | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.4000 | 0.6000 | 0.4917 | 0.4242 | 0.3700 | 0.3793 |
| Fleming-2 | 0.9091 | 0.9375 | 0.8333 | 0.8854 | 0.2500 | 0.5000 | 0.3500 | 0.4370 | 0.5710 | 0.4709 |
| 2025-DMIS-KU-2 | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.3000 | 0.6000 | 0.4225 | 0.6354 | 0.5117 | 0.5503 |
| bious1 | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.3000 | 0.3500 | 0.3250 | 0.4853 | 0.4561 | 0.4595 |
| bious2 | 0.9091 | 0.9333 | 0.8571 | 0.8952 | 0.2000 | 0.3000 | 0.2417 | 0.4896 | 0.4530 | 0.4647 |
| 2025-DMIS-KU-3 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.3500 | 0.6000 | 0.4542 | 0.6125 | 0.5367 | 0.5594 |
| Fleming-3 | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.2500 | 0.5000 | 0.3500 | 0.4062 | 0.5710 | 0.4483 |
| bious3 | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.2000 | 0.3000 | 0.2417 | 0.4510 | 0.4071 | 0.4233 |
| 2025-DMIS-KU-4 | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.4500 | 0.6000 | 0.5125 | 0.6317 | 0.5277 | 0.5611 |
| 2025-DMIS-KU-5 | 0.9091 | 0.9333 | 0.8571 | 0.8952 | 0.3500 | 0.6000 | 0.4417 | 0.6439 | 0.5483 | 0.5721 |
| bious4 | 0.8182 | 0.8571 | 0.7500 | 0.8036 | 0.3000 | 0.4000 | 0.3417 | 0.4716 | 0.4541 | 0.4565 |
| bious5 | 0.9091 | 0.9333 | 0.8571 | 0.8952 | 0.2000 | 0.3000 | 0.2417 | 0.4552 | 0.4206 | 0.4322 |
| EP-1 | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.4000 | 0.5500 | 0.4625 | 0.6716 | 0.5667 | 0.5908 |
| EP-2 | 0.9091 | 0.9375 | 0.8333 | 0.8854 | 0.4000 | 0.6000 | 0.4792 | 0.6421 | 0.5201 | 0.5572 |
| lasigeBioTM-onto-bl | 0.8182 | 0.8750 | 0.6667 | 0.7708 | 0.1000 | 0.1000 | 0.1000 | 0.5314 | 0.4180 | 0.4538 |
| lasigeBioTM-onto-sm | 0.5000 | 0.5217 | 0.4762 | 0.4990 | - | - | - | - | - | - |
| similarity measures | 0.9091 | 0.9333 | 0.8571 | 0.8952 | 0.3500 | 0.6000 | 0.4600 | 0.4698 | 0.4165 | 0.4324 |
| sp_lasigebiotm | 0.7727 | 0.8387 | 0.6154 | 0.7270 | 0.2000 | 0.2000 | 0.2000 | 0.5576 | 0.4371 | 0.4662 |
| extractive | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.1000 | 0.1000 | 0.1000 | - | - | - |
| dmiip2024 | 0.9091 | 0.9286 | 0.8750 | 0.9018 | 0.3500 | 0.4500 | 0.4000 | 0.5945 | 0.4803 | 0.5198 |
| dmiip2024_1 | 0.8182 | 0.8571 | 0.7500 | 0.8036 | 0.4000 | 0.4000 | 0.4000 | 0.6496 | 0.5075 | 0.5469 |
| dmiip2024_3 | 0.8636 | 0.9091 | 0.7273 | 0.8182 | 0.3500 | 0.4500 | 0.3917 | 0.5722 | 0.4832 | 0.5153 |
| dmiip2024_4 | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.4000 | 0.4500 | 0.4250 | 0.6071 | 0.4516 | 0.5004 |
| dmiip2024_2 | 0.9091 | 0.9375 | 0.8333 | 0.8854 | 0.2500 | 0.3000 | 0.2750 | 0.5133 | 0.5394 | 0.5037 |
| deepseek32b-me | 0.9091 | 0.9333 | 0.8571 | 0.8952 | 0.3500 | 0.3500 | 0.3500 | 0.5433 | 0.5011 | 0.5105 |
| deepseek32b-full | 0.9091 | 0.9333 | 0.8571 | 0.8952 | 0.3500 | 0.3500 | 0.3500 | 0.5433 | 0.5011 | 0.5105 |
| deepseek32b-f | 0.9091 | 0.9375 | 0.8333 | 0.8854 | 0.4500 | 0.4500 | 0.4500 | 0.6247 | 0.5096 | 0.5419 |
| EP-3 | 0.9091 | 0.9375 | 0.8333 | 0.8854 | 0.4000 | 0.6500 | 0.5100 | 0.6026 | 0.5827 | 0.5737 |
| phaseB-4 | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.4500 | 0.4500 | 0.4500 | 0.6417 | 0.5045 | 0.5522 |
| phaseB-5 | 0.9545 | 0.9677 | 0.9231 | 0.9454 | 0.4000 | 0.4000 | 0.4000 | 0.5770 | 0.4722 | 0.5039 |
| EP-4 | 0.9091 | 0.9375 | 0.8333 | 0.8854 | 0.4500 | 0.5000 | 0.4750 | 0.6013 | 0.5235 | 0.5485 |
| BioASQ_Baseline | 0.3636 | 0.2222 | 0.4615 | 0.3419 | 0.0000 | 0.1500 | 0.0542 | 0.1821 | 0.2528 | 0.1672 |
Ideal Answers
| Automatic scores (Rouge - R) | Manual scores | |||||||
|---|---|---|---|---|---|---|---|---|
| System | R-2 (Rec) | R-2 (F1) | R-SU4 (Rec) | R-SU4 (F1) | Readability | Recall | Precision | Repetition |
| UR-IW-1 | 0.2794 | 0.2015 | 0.2911 | 0.2010 | 4.44 | 4.52 | 4.27 | 4.47 |
| UR-IW-2 | 0.1599 | 0.1433 | 0.1787 | 0.1522 | 4.48 | 4.38 | 4.33 | 4.56 |
| UR-IW-3 | 0.2892 | 0.2158 | 0.2925 | 0.2096 | 4.44 | 4.48 | 4.31 | 4.45 |
| UR-IW-4 | 0.1573 | 0.1356 | 0.1705 | 0.1427 | 4.40 | 4.40 | 4.34 | 4.55 |
| UR-IW-5 | 0.2350 | 0.2093 | 0.2425 | 0.2057 | 4.36 | 4.28 | 4.21 | 4.35 |
| UniTor_0 | 0.2002 | 0.2432 | 0.1931 | 0.2335 | 4.27 | 3.91 | 4.29 | 4.41 |
| UniTor_1 | 0.2155 | 0.2576 | 0.2113 | 0.2510 | 4.32 | 3.98 | 4.35 | 4.49 |
| UniTor_2 | 0.2125 | 0.2573 | 0.2084 | 0.2511 | 4.29 | 3.81 | 4.22 | 4.46 |
| UniTor_3 | 0.2265 | 0.2658 | 0.2214 | 0.2585 | 4.33 | 3.91 | 4.25 | 4.45 |
| bioinfo-0 | 0.1647 | 0.1836 | 0.1632 | 0.1774 | 4.20 | 4.07 | 4.52 | 4.53 |
| bioinfo-1 | 0.2517 | 0.2020 | 0.2478 | 0.1953 | 4.45 | 4.46 | 4.33 | 4.61 |
| bioinfo-2 | 0.2357 | 0.1872 | 0.2420 | 0.1868 | 4.44 | 4.42 | 4.19 | 4.52 |
| bioinfo-3 | 0.2248 | 0.1810 | 0.2317 | 0.1800 | 4.32 | 4.36 | 4.19 | 4.53 |
| bioinfo-4 | 0.1912 | 0.1674 | 0.1987 | 0.1674 | 4.44 | 4.34 | 4.41 | 4.56 |
| Synthia with first | 0.1974 | 0.2216 | 0.1906 | 0.2128 | 3.02 | 3.88 | 3.61 | 3.91 |
| RMC_append_snippets | 0.2415 | 0.2526 | 0.2323 | 0.2419 | 2.89 | 4.19 | 3.73 | 3.87 |
| IISR first submit | 0.1835 | 0.1960 | 0.1819 | 0.1930 | 4.27 | 4.14 | 4.45 | 4.49 |
| IISR 2nd submit | 0.2818 | 0.2886 | 0.2765 | 0.2801 | 4.45 | 4.15 | 4.41 | 4.48 |
| IISR 3rd submit | 0.2132 | 0.2230 | 0.2091 | 0.2127 | 4.29 | 4.14 | 4.38 | 4.49 |
| IISR 4th submit | 0.2292 | 0.2749 | 0.2231 | 0.2674 | 4.40 | 3.99 | 4.35 | 4.49 |
| IISR 5th submit | 0.2732 | 0.2865 | 0.2659 | 0.2765 | 4.55 | 4.20 | 4.52 | 4.61 |
| lasigeBioTM | 0.3136 | 0.2012 | 0.3169 | 0.1990 | 4.14 | 4.29 | 4.01 | 4.20 |
| AQAMS2 | 0.3009 | 0.1996 | 0.3059 | 0.1950 | 3.92 | 4.48 | 4.02 | 4.34 |
| mistral | 0.2614 | 0.2272 | 0.2614 | 0.2189 | 4.41 | 4.38 | 4.36 | 4.54 |
| llama | 0.2117 | 0.1829 | 0.2100 | 0.1791 | 4.38 | 4.51 | 4.40 | 4.52 |
| dense | 0.2685 | 0.2542 | 0.2640 | 0.2437 | 4.46 | 4.29 | 4.47 | 4.55 |
| GPT4O | 0.2786 | 0.2046 | 0.2806 | 0.2027 | 4.42 | 4.26 | 4.32 | 4.52 |
| deepseek-r1:32b | 0.2219 | 0.1622 | 0.2301 | 0.1644 | 4.29 | 4.16 | 4.14 | 4.40 |
| deepseek-r1:14b | 0.1520 | 0.1719 | 0.1568 | 0.1749 | 4.15 | 3.69 | 4.01 | 4.36 |
| deepseek-r1:8b | 0.1592 | 0.1668 | 0.1587 | 0.1659 | 4.20 | 3.72 | 4.09 | 4.40 |
| Fleming-4 | 0.2602 | 0.1482 | 0.2791 | 0.1520 | 4.21 | 4.52 | 3.99 | 4.38 |
| Fleming-1 | 0.2821 | 0.1822 | 0.2873 | 0.1828 | 4.28 | 4.45 | 4.00 | 4.40 |
| 2025-DMIS-KU-1 | - | - | - | - | - | - | - | - |
| simple truncation | 0.0843 | 0.0736 | 0.0844 | 0.0734 | 1.11 | 1.11 | 1.07 | 1.11 |
| kmeans | 0.0864 | 0.0690 | 0.0891 | 0.0704 | 1.12 | 1.15 | 1.07 | 1.13 |
| Fleming-2 | 0.3243 | 0.1525 | 0.3366 | 0.1533 | 4.19 | 4.49 | 3.84 | 4.24 |
| 2025-DMIS-KU-2 | - | - | - | - | - | - | - | - |
| bious1 | 0.2351 | 0.2342 | 0.2261 | 0.2225 | 4.45 | 4.13 | 4.34 | 4.51 |
| bious2 | 0.2455 | 0.2316 | 0.2451 | 0.2254 | 4.42 | 4.21 | 4.40 | 4.51 |
| 2025-DMIS-KU-3 | - | - | - | - | - | - | - | - |
| Fleming-3 | 0.3122 | 0.1463 | 0.3218 | 0.1476 | 4.12 | 4.53 | 3.95 | 4.19 |
| bious3 | 0.2399 | 0.2353 | 0.2415 | 0.2315 | 4.44 | 4.18 | 4.41 | 4.54 |
| 2025-DMIS-KU-4 | - | - | - | - | - | - | - | - |
| 2025-DMIS-KU-5 | - | - | - | - | - | - | - | - |
| bious4 | 0.2474 | 0.2350 | 0.2421 | 0.2258 | 4.39 | 4.19 | 4.32 | 4.49 |
| bious5 | 0.2411 | 0.2336 | 0.2399 | 0.2269 | 4.45 | 4.21 | 4.38 | 4.55 |
| EP-1 | 0.2533 | 0.2218 | 0.2578 | 0.2165 | 4.33 | 4.39 | 4.26 | 4.42 |
| EP-2 | 0.2802 | 0.2340 | 0.2811 | 0.2282 | 4.38 | 4.47 | 4.27 | 4.48 |
| lasigeBioTM-onto-bl | 0.2857 | 0.1945 | 0.2841 | 0.1917 | 4.27 | 4.44 | 4.22 | 4.44 |
| lasigeBioTM-onto-sm | 0.0871 | 0.0919 | 0.0918 | 0.0960 | 3.31 | 2.21 | 2.65 | 3.96 |
| similarity measures | 0.0856 | 0.0629 | 0.0858 | 0.0620 | 1.08 | 1.15 | 1.06 | 1.11 |
| sp_lasigebiotm | 0.2606 | 0.2094 | 0.2566 | 0.2039 | 4.26 | 4.12 | 4.14 | 4.47 |
| extractive | 0.0935 | 0.0538 | 0.0966 | 0.0545 | 1.08 | 1.16 | 1.01 | 1.04 |
| dmiip2024 | 0.1935 | 0.2309 | 0.1933 | 0.2262 | 4.39 | 4.04 | 4.38 | 4.44 |
| dmiip2024_1 | 0.1888 | 0.2276 | 0.1860 | 0.2219 | 4.41 | 3.99 | 4.39 | 4.47 |
| dmiip2024_3 | 0.1694 | 0.2148 | 0.1634 | 0.2056 | 4.40 | 3.91 | 4.35 | 4.41 |
| dmiip2024_4 | 0.1852 | 0.2295 | 0.1813 | 0.2229 | 4.31 | 3.93 | 4.34 | 4.45 |
| dmiip2024_2 | 0.2039 | 0.2464 | 0.2010 | 0.2406 | 4.26 | 4.00 | 4.32 | 4.34 |
| deepseek32b-me | 0.1965 | 0.2369 | 0.1893 | 0.2286 | 4.25 | 3.92 | 4.38 | 4.45 |
| deepseek32b-full | 0.1965 | 0.2369 | 0.1893 | 0.2286 | 4.25 | 3.92 | 4.38 | 4.45 |
| deepseek32b-f | 0.2226 | 0.1783 | 0.2259 | 0.1760 | 4.28 | 4.53 | 4.25 | 4.49 |
| EP-3 | 0.2784 | 0.2315 | 0.2780 | 0.2234 | 4.34 | 4.45 | 4.29 | 4.42 |
| phaseB-4 | 0.2022 | 0.1711 | 0.2081 | 0.1700 | 4.24 | 4.54 | 4.24 | 4.44 |
| phaseB-5 | 0.2380 | 0.2049 | 0.2486 | 0.2030 | 4.19 | 4.52 | 4.33 | 4.48 |
| EP-4 | 0.3170 | 0.2384 | 0.3112 | 0.2292 | 4.36 | 4.51 | 4.20 | 4.44 |
| BioASQ_Baseline | - | - | - | - | - | - | - | - |
Test batch 4
Exact Answers
| Yes/No | Factoid | List | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| System | Accuracy | F1 Yes | F1 No | Macro F1 | Strict Acc. | Lenient Acc. | MRR | Mean Prec. | Recall | F-Measure |
| UniTor_0 | 0.8462 | 0.8889 | 0.7500 | 0.8194 | 0.5455 | 0.5909 | 0.5682 | 0.4480 | 0.3686 | 0.3736 |
| UniTor_1 | 0.8846 | 0.9189 | 0.8000 | 0.8595 | 0.5455 | 0.6364 | 0.5909 | 0.5051 | 0.3880 | 0.4205 |
| UniTor_2 | 0.8846 | 0.9189 | 0.8000 | 0.8595 | 0.5455 | 0.6364 | 0.5909 | 0.3621 | 0.2737 | 0.2961 |
| UniTor_3 | 0.8846 | 0.9189 | 0.8000 | 0.8595 | 0.5455 | 0.6364 | 0.5909 | 0.4205 | 0.3749 | 0.3678 |
| UR-IW-1 | 0.8462 | 0.8947 | 0.7143 | 0.8045 | 0.5455 | 0.5909 | 0.5606 | 0.3794 | 0.5100 | 0.4019 |
| UR-IW-2 | 0.9231 | 0.9444 | 0.8750 | 0.9097 | 0.5455 | 0.5455 | 0.5455 | 0.3711 | 0.4807 | 0.3844 |
| UR-IW-3 | 0.7692 | 0.8125 | 0.7000 | 0.7563 | 0.4545 | 0.4545 | 0.4545 | 0.4544 | 0.5584 | 0.4638 |
| UR-IW-4 | 1.0000 | 1.0000 | 1.0000 | 1.0000 | 0.5455 | 0.6364 | 0.5909 | 0.4660 | 0.5116 | 0.4576 |
| UR-IW-5 | 0.9231 | 0.9444 | 0.8750 | 0.9097 | 0.5000 | 0.5455 | 0.5227 | 0.4116 | 0.5366 | 0.4401 |
| Synthia with first | 0.8846 | 0.9091 | 0.8421 | 0.8756 | 0.1818 | 0.1818 | 0.1818 | 0.3193 | 0.1963 | 0.2258 |
| RMC_append_snippets | 0.8846 | 0.9143 | 0.8235 | 0.8689 | 0.3636 | 0.3636 | 0.3636 | 0.4281 | 0.3305 | 0.3508 |
| bioinfo-0 | 0.6538 | 0.7907 | - | 0.3953 | - | - | - | - | - | - |
| bioinfo-1 | 0.6538 | 0.7907 | - | 0.3953 | - | - | - | - | - | - |
| bioinfo-2 | 0.6538 | 0.7907 | - | 0.3953 | - | - | - | - | - | - |
| bioinfo-3 | 0.6538 | 0.7907 | - | 0.3953 | - | - | - | - | - | - |
| bioinfo-4 | 0.6538 | 0.7907 | - | 0.3953 | - | - | - | - | - | - |
| My system 1 | 0.8077 | 0.8718 | 0.6154 | 0.7436 | - | - | - | - | - | - |
| 3.PhaseB_System | 0.6538 | 0.7907 | - | 0.3953 | 0.1818 | 0.1818 | 0.1818 | 0.0531 | 0.0526 | 0.0512 |
| edo | 0.3462 | - | 0.5143 | 0.2571 | 0.1364 | 0.2727 | 0.1818 | 0.0895 | 0.0856 | 0.0839 |
| DB_vector_&_LLM | 0.9231 | 0.9444 | 0.8750 | 0.9097 | 0.5455 | 0.6364 | 0.5795 | 0.4875 | 0.5013 | 0.4746 |
| Machinen Results | 0.7308 | 0.8293 | 0.3636 | 0.5965 | 0.2727 | 0.4091 | 0.3182 | 0.1007 | 0.1919 | 0.1231 |
| Fleming-1 | 0.9231 | 0.9412 | 0.8889 | 0.9150 | 0.3636 | 0.5909 | 0.4697 | 0.5088 | 0.3994 | 0.4214 |
| AQAMS2 | 0.8462 | 0.8947 | 0.7143 | 0.8045 | 0.5909 | 0.5909 | 0.5909 | 0.6035 | 0.4131 | 0.4703 |
| IISR first submit | 0.8462 | 0.8889 | 0.7500 | 0.8194 | 0.5000 | 0.5909 | 0.5455 | 0.6335 | 0.5035 | 0.5472 |
| IISR 2nd submit | 0.8077 | 0.8485 | 0.7368 | 0.7927 | 0.4545 | 0.5000 | 0.4773 | 0.6575 | 0.4908 | 0.5400 |
| IISR 3rd submit | 0.8846 | 0.9143 | 0.8235 | 0.8689 | 0.5455 | 0.5909 | 0.5682 | 0.5818 | 0.4582 | 0.4990 |
| IISR 4th submit | 0.9615 | 0.9714 | 0.9412 | 0.9563 | 0.4091 | 0.5000 | 0.4545 | 0.4812 | 0.3102 | 0.3628 |
| dmiip2024 | 0.8846 | 0.9143 | 0.8235 | 0.8689 | 0.5455 | 0.6364 | 0.5795 | 0.6752 | 0.5207 | 0.5718 |
| dmiip2024_1 | 0.9231 | 0.9412 | 0.8889 | 0.9150 | 0.5455 | 0.5455 | 0.5455 | 0.6565 | 0.5086 | 0.5585 |
| dmiip2024_2 | 0.9615 | 0.9714 | 0.9412 | 0.9563 | 0.5909 | 0.5909 | 0.5909 | 0.5482 | 0.5478 | 0.5189 |
| dmiip2024_4 | 0.8846 | 0.9189 | 0.8000 | 0.8595 | 0.5000 | 0.5909 | 0.5455 | 0.7596 | 0.4876 | 0.5657 |
| dmiip2024_3 | 0.9231 | 0.9444 | 0.8750 | 0.9097 | 0.5909 | 0.5909 | 0.5909 | 0.6534 | 0.4331 | 0.4996 |
| IISR 5th submit | 0.8846 | 0.9091 | 0.8421 | 0.8756 | 0.4545 | 0.5455 | 0.5000 | 0.5890 | 0.4358 | 0.4839 |
| deepseek32b-me | 0.8462 | 0.8889 | 0.7500 | 0.8194 | 0.4545 | 0.4545 | 0.4545 | 0.4288 | 0.3474 | 0.3588 |
| deepseek32b-full | 0.8462 | 0.8889 | 0.7500 | 0.8194 | 0.4545 | 0.4545 | 0.4545 | 0.4288 | 0.3474 | 0.3588 |
| deepseek32b-f | 0.9231 | 0.9444 | 0.8750 | 0.9097 | 0.5000 | 0.5000 | 0.5000 | 0.5335 | 0.4208 | 0.4531 |
| phaseB-4 | 0.9231 | 0.9444 | 0.8750 | 0.9097 | 0.5000 | 0.5000 | 0.5000 | 0.5158 | 0.4054 | 0.4365 |
| phaseB-5 | 0.8846 | 0.9189 | 0.8000 | 0.8595 | 0.5000 | 0.5000 | 0.5000 | 0.5716 | 0.4959 | 0.5030 |
| Mistral7BIns10shots | 0.8462 | 0.8824 | 0.7778 | 0.8301 | 0.4545 | 0.5000 | 0.4773 | 0.5341 | 0.4257 | 0.4610 |
| GPT4turbo | 0.8846 | 0.9189 | 0.8000 | 0.8595 | 0.5455 | 0.6364 | 0.5909 | 0.6123 | 0.4912 | 0.5294 |
| GPTPrompt1sStyle2 | 0.8846 | 0.9189 | 0.8000 | 0.8595 | 0.6364 | 0.6818 | 0.6591 | 0.6052 | 0.4888 | 0.5243 |
| bious1 | 0.8462 | 0.8824 | 0.7778 | 0.8301 | 0.4545 | 0.5455 | 0.4924 | 0.5417 | 0.4482 | 0.4702 |
| bious2 | 0.8077 | 0.8387 | 0.7619 | 0.8003 | 0.4091 | 0.5000 | 0.4545 | 0.4813 | 0.4456 | 0.4405 |
| bious3 | 0.8846 | 0.9143 | 0.8235 | 0.8689 | 0.3636 | 0.4545 | 0.4091 | 0.4860 | 0.4343 | 0.4363 |
| GPTPrompt1sStyle3 | 0.8846 | 0.9189 | 0.8000 | 0.8595 | 0.5909 | 0.6818 | 0.6364 | 0.6464 | 0.5144 | 0.5538 |
| bious4 | 0.8077 | 0.8485 | 0.7368 | 0.7927 | 0.4545 | 0.5455 | 0.4924 | 0.4750 | 0.4358 | 0.4363 |
| bious5 | 0.8462 | 0.8824 | 0.7778 | 0.8301 | 0.4545 | 0.5455 | 0.5000 | 0.4735 | 0.4799 | 0.4548 |
| NLP-UTB4 | 0.6538 | 0.7907 | - | 0.3953 | 0.0455 | 0.0455 | 0.0455 | 0.1053 | 0.0263 | 0.0421 |
| sp_lasigebiotm | 0.9231 | 0.9375 | 0.9000 | 0.9188 | 0.5000 | 0.5000 | 0.5000 | 0.4756 | 0.2269 | 0.2834 |
| lasigeBioTM | 0.8077 | 0.8387 | 0.7619 | 0.8003 | 0.4091 | 0.4091 | 0.4091 | 0.4380 | 0.3442 | 0.3612 |
| lasigeBioTM-onto-bl | 0.8846 | 0.9091 | 0.8421 | 0.8756 | 0.3636 | 0.4091 | 0.3864 | 0.6400 | 0.4221 | 0.4932 |
| lasigeBioTM-onto-sm | 0.7308 | 0.7586 | 0.6957 | 0.7271 | 0.0455 | 0.1364 | 0.0833 | 0.3113 | 0.1488 | 0.1933 |
| Fleming-4 | 0.9231 | 0.9444 | 0.8750 | 0.9097 | 0.3182 | 0.4545 | 0.3591 | 0.3261 | 0.4434 | 0.3560 |
| Fleming-5 | 0.9615 | 0.9714 | 0.9412 | 0.9563 | 0.3182 | 0.4545 | 0.3652 | 0.3261 | 0.4434 | 0.3560 |
| mistral | 0.8846 | 0.9143 | 0.8235 | 0.8689 | 0.5455 | 0.5909 | 0.5682 | 0.5231 | 0.4719 | 0.4791 |
| llama | 0.9231 | 0.9444 | 0.8750 | 0.9097 | 0.5455 | 0.6364 | 0.5909 | 0.5553 | 0.5250 | 0.5220 |
| dense | 0.9231 | 0.9444 | 0.8750 | 0.9097 | 0.5000 | 0.5909 | 0.5455 | 0.5344 | 0.4533 | 0.4800 |
| GPT4O | 0.9231 | 0.9412 | 0.8889 | 0.9150 | 0.3182 | 0.3182 | 0.3182 | 0.4618 | 0.3259 | 0.3642 |
| deepseek-r1:32b | 0.8462 | 0.8824 | 0.7778 | 0.8301 | 0.4091 | 0.4091 | 0.4091 | 0.4736 | 0.3966 | 0.4160 |
| deepseek-r1:8b | 0.8846 | 0.9091 | 0.8421 | 0.8756 | 0.2727 | 0.3182 | 0.2955 | 0.4997 | 0.4097 | 0.4296 |
| gpt 01 mini | 0.8846 | 0.9091 | 0.8421 | 0.8756 | 0.4545 | 0.4545 | 0.4545 | 0.3486 | 0.2315 | 0.2581 |
| 2025-DMIS-KU-1 | 0.8846 | 0.9143 | 0.8235 | 0.8689 | 0.5455 | 0.5909 | 0.5682 | 0.6833 | 0.4855 | 0.5482 |
| Fleming-2 | 0.9231 | 0.9444 | 0.8750 | 0.9097 | 0.3182 | 0.4545 | 0.3591 | 0.3474 | 0.2758 | 0.2969 |
| Fleming-3 | 0.8846 | 0.9143 | 0.8235 | 0.8689 | 0.3182 | 0.4545 | 0.3591 | 0.3474 | 0.2758 | 0.2969 |
| 2025-DMIS-KU-2 | 0.9231 | 0.9444 | 0.8750 | 0.9097 | 0.5909 | 0.7273 | 0.6364 | 0.6833 | 0.4855 | 0.5484 |
| 2025-DMIS-KU-3 | 0.9615 | 0.9714 | 0.9412 | 0.9563 | 0.5909 | 0.7273 | 0.6364 | 0.6774 | 0.5031 | 0.5573 |
| 2025-DMIS-KU-4 | 0.8846 | 0.9189 | 0.8000 | 0.8595 | 0.5909 | 0.7273 | 0.6364 | 0.6723 | 0.5316 | 0.5783 |
| 2025-DMIS-KU-5 | 0.9231 | 0.9412 | 0.8889 | 0.9150 | 0.5455 | 0.6818 | 0.5909 | 0.6939 | 0.4855 | 0.5503 |
| EP-1 | 0.9231 | 0.9444 | 0.8750 | 0.9097 | 0.5455 | 0.5909 | 0.5606 | 0.6026 | 0.4484 | 0.4925 |
| EP-2 | 0.9231 | 0.9444 | 0.8750 | 0.9097 | 0.5455 | 0.5455 | 0.5455 | 0.5823 | 0.5184 | 0.5319 |
| EP-3 | 0.9231 | 0.9444 | 0.8750 | 0.9097 | 0.5909 | 0.5909 | 0.5909 | 0.5443 | 0.5452 | 0.5193 |
| EP-4 | 0.9615 | 0.9714 | 0.9412 | 0.9563 | 0.5455 | 0.5909 | 0.5682 | 0.5786 | 0.5265 | 0.5335 |
| EP-5 | 0.9615 | 0.9714 | 0.9412 | 0.9563 | 0.5909 | 0.6364 | 0.6136 | 0.5754 | 0.3981 | 0.4519 |
| simple truncation | 0.9615 | 0.9714 | 0.9412 | 0.9563 | 0.4545 | 0.5455 | 0.5000 | 0.5209 | 0.4591 | 0.4746 |
| kmeans | 0.9615 | 0.9714 | 0.9412 | 0.9563 | 0.5000 | 0.5909 | 0.5455 | 0.5337 | 0.4887 | 0.4936 |
| similarity measures | 0.9231 | 0.9412 | 0.8889 | 0.9150 | 0.4545 | 0.6364 | 0.5455 | 0.2562 | 0.4706 | 0.3003 |
| extractive | 0.9615 | 0.9714 | 0.9412 | 0.9563 | 0.5000 | 0.5909 | 0.5455 | 0.3068 | 0.3995 | 0.3235 |
| abstractive | 0.9231 | 0.9412 | 0.8889 | 0.9150 | 0.5000 | 0.6364 | 0.5682 | 0.2598 | 0.4772 | 0.3048 |
| BioASQ_Baseline | 0.3462 | 0.3200 | 0.3704 | 0.3452 | 0.1818 | 0.2727 | 0.2197 | 0.2243 | 0.2643 | 0.2226 |
Ideal Answers
| Automatic scores (Rouge - R) | Manual scores | |||||||
|---|---|---|---|---|---|---|---|---|
| System | R-2 (Rec) | R-2 (F1) | R-SU4 (Rec) | R-SU4 (F1) | Readability | Recall | Precision | Repetition |
| UniTor_0 | 0.1690 | 0.2057 | 0.1604 | 0.1933 | 4.22 | 3.78 | 4.14 | 4.27 |
| UniTor_1 | 0.1718 | 0.2045 | 0.1654 | 0.1940 | 4.24 | 3.80 | 4.19 | 4.31 |
| UniTor_2 | 0.1686 | 0.2050 | 0.1628 | 0.1961 | 4.18 | 3.72 | 4.11 | 4.25 |
| UniTor_3 | 0.1687 | 0.2035 | 0.1610 | 0.1925 | 4.26 | 3.80 | 4.16 | 4.32 |
| UR-IW-1 | 0.2479 | 0.1767 | 0.2583 | 0.1816 | 4.28 | 4.34 | 3.98 | 4.28 |
| UR-IW-2 | 0.1536 | 0.1301 | 0.1724 | 0.1416 | 4.47 | 4.28 | 4.19 | 4.53 |
| UR-IW-3 | 0.2486 | 0.1774 | 0.2531 | 0.1801 | 4.26 | 4.29 | 4.04 | 4.36 |
| UR-IW-4 | 0.1370 | 0.1189 | 0.1577 | 0.1327 | 4.28 | 4.31 | 4.02 | 4.48 |
| UR-IW-5 | 0.2201 | 0.1861 | 0.2211 | 0.1874 | 4.13 | 4.04 | 3.95 | 4.24 |
| Synthia with first | 0.1702 | 0.1979 | 0.1653 | 0.1906 | 4.13 | 3.64 | 4.07 | 4.16 |
| RMC_append_snippets | 0.2201 | 0.2314 | 0.2126 | 0.2214 | 4.27 | 3.91 | 4.18 | 4.34 |
| bioinfo-0 | 0.1451 | 0.1731 | 0.1423 | 0.1692 | 4.21 | 3.73 | 4.22 | 4.29 |
| bioinfo-1 | 0.2064 | 0.1765 | 0.2124 | 0.1805 | 4.25 | 4.27 | 4.22 | 4.39 |
| bioinfo-2 | 0.1974 | 0.2141 | 0.1928 | 0.2068 | 4.31 | 4.07 | 4.20 | 4.36 |
| bioinfo-3 | 0.1869 | 0.2050 | 0.1791 | 0.1958 | 4.26 | 4.04 | 4.24 | 4.35 |
| bioinfo-4 | 0.1800 | 0.1569 | 0.1859 | 0.1619 | 4.31 | 4.09 | 4.20 | 4.41 |
| My system 1 | 0.0290 | 0.0401 | 0.0293 | 0.0410 | 1.16 | 0.94 | 1.11 | 1.21 |
| 3.PhaseB_System | 0.1448 | 0.1734 | 0.1408 | 0.1662 | 3.88 | 3.19 | 3.75 | 4.05 |
| edo | 0.1102 | 0.1041 | 0.1178 | 0.1104 | 2.59 | 2.75 | 2.91 | 3.55 |
| DB_vector_&_LLM | 0.2664 | 0.1489 | 0.2834 | 0.1573 | 4.28 | 4.48 | 3.94 | 4.28 |
| Machinen Results | 0.1649 | 0.1584 | 0.1695 | 0.1629 | 3.76 | 3.64 | 3.74 | 4.08 |
| Fleming-1 | 0.2186 | 0.1212 | 0.2329 | 0.1278 | 4.21 | 4.45 | 3.89 | 4.34 |
| AQAMS2 | 0.2198 | 0.1720 | 0.2292 | 0.1782 | 4.26 | 4.08 | 3.98 | 4.33 |
| IISR first submit | 0.1659 | 0.1797 | 0.1639 | 0.1782 | 4.27 | 3.94 | 4.27 | 4.34 |
| IISR 2nd submit | 0.2278 | 0.2538 | 0.2201 | 0.2460 | 4.22 | 3.87 | 4.20 | 4.26 |
| IISR 3rd submit | 0.1859 | 0.1979 | 0.1812 | 0.1930 | 4.24 | 3.96 | 4.27 | 4.39 |
| IISR 4th submit | 0.1789 | 0.2198 | 0.1710 | 0.2112 | 4.22 | 3.66 | 4.25 | 4.25 |
| dmiip2024 | 0.1724 | 0.2086 | 0.1645 | 0.2007 | 4.18 | 3.93 | 4.21 | 4.32 |
| dmiip2024_1 | 0.1789 | 0.2128 | 0.1716 | 0.2053 | 4.29 | 3.88 | 4.22 | 4.31 |
| dmiip2024_2 | 0.1744 | 0.2104 | 0.1635 | 0.1969 | 4.16 | 3.72 | 4.21 | 4.21 |
| dmiip2024_4 | 0.1606 | 0.2053 | 0.1496 | 0.1926 | 4.16 | 3.60 | 4.18 | 4.25 |
| dmiip2024_3 | 0.1487 | 0.1954 | 0.1356 | 0.1803 | 4.26 | 3.68 | 4.19 | 4.32 |
| IISR 5th submit | 0.2309 | 0.2466 | 0.2225 | 0.2383 | 4.31 | 3.89 | 4.27 | 4.33 |
| deepseek32b-me | 0.1237 | 0.1609 | 0.1194 | 0.1559 | 3.92 | 3.48 | 3.96 | 4.12 |
| deepseek32b-full | 0.1237 | 0.1609 | 0.1194 | 0.1559 | 3.92 | 3.48 | 3.96 | 4.12 |
| deepseek32b-f | 0.1878 | 0.1675 | 0.1874 | 0.1646 | 4.34 | 4.47 | 4.12 | 4.44 |
| phaseB-4 | 0.1745 | 0.1596 | 0.1736 | 0.1557 | 4.36 | 4.47 | 4.09 | 4.45 |
| phaseB-5 | 0.2152 | 0.1922 | 0.2167 | 0.1912 | 4.34 | 4.35 | 4.13 | 4.42 |
| Mistral7BIns10shots | 0.2425 | 0.2420 | 0.2365 | 0.2350 | 4.01 | 3.98 | 4.00 | 4.26 |
| GPT4turbo | 0.2158 | 0.2318 | 0.2077 | 0.2229 | 4.15 | 3.94 | 4.15 | 4.40 |
| GPTPrompt1sStyle2 | 0.1911 | 0.2229 | 0.1834 | 0.2128 | 4.08 | 3.85 | 3.99 | 4.27 |
| bious1 | 0.1914 | 0.2042 | 0.1881 | 0.1976 | 4.25 | 3.92 | 4.14 | 4.31 |
| bious2 | 0.1986 | 0.2071 | 0.1986 | 0.2060 | 4.32 | 3.89 | 4.20 | 4.34 |
| bious3 | 0.1954 | 0.2102 | 0.1919 | 0.2051 | 4.28 | 4.02 | 4.24 | 4.29 |
| GPTPrompt1sStyle3 | 0.1705 | 0.2116 | 0.1628 | 0.2028 | 4.22 | 3.67 | 4.20 | 4.24 |
| bious4 | 0.1983 | 0.2111 | 0.1959 | 0.2058 | 4.24 | 3.92 | 4.19 | 4.27 |
| bious5 | 0.1984 | 0.2084 | 0.1969 | 0.2046 | 4.22 | 3.93 | 4.21 | 4.31 |
| NLP-UTB4 | 0.0131 | 0.0164 | 0.0148 | 0.0186 | 0.55 | 0.64 | 0.69 | 0.79 |
| sp_lasigebiotm | 0.2382 | 0.1901 | 0.2407 | 0.1900 | 4.20 | 4.01 | 4.00 | 4.29 |
| lasigeBioTM | 0.2558 | 0.1891 | 0.2608 | 0.1908 | 4.16 | 4.21 | 3.96 | 4.29 |
| lasigeBioTM-onto-bl | 0.2538 | 0.1857 | 0.2620 | 0.1902 | 4.19 | 4.14 | 3.98 | 4.33 |
| lasigeBioTM-onto-sm | 0.0924 | 0.1048 | 0.0908 | 0.1029 | 3.47 | 2.95 | 3.48 | 3.89 |
| Fleming-4 | 0.2119 | 0.1026 | 0.2335 | 0.1121 | 4.14 | 4.39 | 3.86 | 4.35 |
| Fleming-5 | 0.2041 | 0.1174 | 0.2197 | 0.1245 | 4.20 | 4.36 | 3.79 | 4.41 |
| mistral | 0.2012 | 0.1638 | 0.2041 | 0.1640 | 4.36 | 4.26 | 4.15 | 4.47 |
| llama | 0.2519 | 0.2189 | 0.2473 | 0.2129 | 4.35 | 4.20 | 4.16 | 4.41 |
| dense | 0.2106 | 0.1814 | 0.2163 | 0.1833 | 4.25 | 4.12 | 4.06 | 4.33 |
| GPT4O | 0.2095 | 0.1574 | 0.2170 | 0.1604 | 4.14 | 4.04 | 3.87 | 4.25 |
| deepseek-r1:32b | 0.1961 | 0.1545 | 0.2049 | 0.1591 | 4.12 | 3.98 | 3.87 | 4.25 |
| deepseek-r1:8b | 0.1046 | 0.1162 | 0.1107 | 0.1217 | 3.78 | 3.34 | 3.71 | 4.00 |
| gpt 01 mini | 0.1468 | 0.1080 | 0.1636 | 0.1191 | 3.99 | 3.88 | 3.84 | 4.20 |
| 2025-DMIS-KU-1 | - | - | - | - | - | - | - | - |
| Fleming-2 | 0.2291 | 0.1312 | 0.2507 | 0.1428 | 4.20 | 4.32 | 3.76 | 4.31 |
| Fleming-3 | 0.2442 | 0.1184 | 0.2606 | 0.1250 | 4.21 | 4.45 | 3.94 | 4.39 |
| 2025-DMIS-KU-2 | - | - | - | - | - | - | - | - |
| 2025-DMIS-KU-3 | - | - | - | - | - | - | - | - |
| 2025-DMIS-KU-4 | - | - | - | - | - | - | - | - |
| 2025-DMIS-KU-5 | - | - | - | - | - | - | - | - |
| EP-1 | 0.2033 | 0.1947 | 0.2057 | 0.1965 | 4.19 | 4.12 | 4.11 | 4.32 |
| EP-2 | 0.1883 | 0.1814 | 0.1917 | 0.1840 | 4.18 | 4.05 | 4.09 | 4.27 |
| EP-3 | 0.2035 | 0.1929 | 0.2046 | 0.1921 | 4.22 | 4.11 | 4.19 | 4.33 |
| EP-4 | 0.2105 | 0.1955 | 0.2122 | 0.1962 | 4.27 | 4.07 | 4.16 | 4.36 |
| EP-5 | 0.2139 | 0.2028 | 0.2140 | 0.2027 | 4.28 | 4.09 | 4.21 | 4.35 |
| simple truncation | 0.0562 | 0.0469 | 0.0578 | 0.0477 | 0.89 | 0.88 | 0.87 | 0.89 |
| kmeans | 0.0557 | 0.0424 | 0.0586 | 0.0437 | 0.91 | 0.91 | 0.87 | 0.92 |
| similarity measures | 0.0564 | 0.0380 | 0.0600 | 0.0394 | 0.84 | 0.93 | 0.85 | 0.91 |
| extractive | 0.0521 | 0.0435 | 0.0538 | 0.0443 | 0.89 | 0.91 | 0.87 | 0.89 |
| abstractive | 0.0563 | 0.0381 | 0.0599 | 0.0397 | 0.87 | 0.93 | 0.85 | 0.91 |
| BioASQ_Baseline | - | - | - | - | - | - | - | - |