🔬 LLM Consistency Dashboard

Generated: 2025-12-09 14:04:46

Files: 5 | Checks/File: 50 | Iterations: 3 | Models: 3

📊 Overall Model Performance

Model Name Decision Consistency (%) Reasoning Consistency (%) Overall Consistency (%) Avg Tokens API Failure Rate (%) UNCLEAR Count ERROR Count
Llama-3.1-8B-Instruct 93.6% 98.7% 93.6% 2235 1.1% 36 8
DeepSeek-R1-Distill-Llama-8B 84.4% 99.1% 84.4% 2514 0.7% 108 5
Trinka GPT-OSS 20B 88.8% 68.8% 8.0% 2445 0.8% 39 6

🏆 Winner: Llama-3.1-8B-Instruct

Llama-3.1-8B-Instruct achieved the highest overall consistency score with minimal failures:

Performance Metrics:

Reliability Metrics:

Why Llama-3.1-8B-Instruct Wins:

Comparison with Runner-up (DeepSeek-R1-Distill-Llama-8B):

Metric Winner Runner-up Difference
Overall Consistency 93.6% 84.4% +9.2%
API Failure Rate 1.1% 0.7% 0.4%
Avg Tokens 2235 2514 -279
UNCLEAR Results 36 108 -72

📋 Recommendations:

📖 Understanding the Metrics

1. Decision Consistency (%)

Percentage of checks where all 3 runs produced the same PASS or FAIL result.

2. Reasoning Consistency (%)

Average text similarity of explanations across 3 runs (using sequence matching).

3. Overall Consistency (%)

Percentage of checks passing BOTH criteria:

This is the gold standard - both decisions AND reasoning are reproducible.

4. Avg Tokens per Check

Average tokens consumed per check. Directly impacts API costs.

Note: Llama-3.1-8B has a 4096 token limit (input + output combined).

5. API Failure Rate (%)

Percentage of API calls that failed (HTTP errors, timeouts, etc.).

6. UNCLEAR Results

Number of times the model's output couldn't be clearly classified as PASS or FAIL.

Review these cases to improve prompt clarity.

7. ERROR Results

Number of times the model produced no usable output (empty response, API error, etc.).

These indicate technical issues that need investigation.

📁 Per-File Performance

File Name Model Name Decision Consistency (%) Reasoning Consistency (%) Overall Consistency (%) Avg Tokens
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx DeepSeek-R1-Distill-Llama-8B 82.0% 100.0% 82.0% 2557
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx Llama-3.1-8B-Instruct 94.0% 98.0% 94.0% 2358
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx Trinka GPT-OSS 20B 84.0% 70.5% 8.0% 2529
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx DeepSeek-R1-Distill-Llama-8B 88.0% 100.0% 88.0% 2634
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx Llama-3.1-8B-Instruct 94.0% 100.0% 94.0% 2348
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx Trinka GPT-OSS 20B 90.0% 69.0% 4.0% 2534
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx DeepSeek-R1-Distill-Llama-8B 78.0% 97.3% 78.0% 2448
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx Llama-3.1-8B-Instruct 94.0% 96.0% 94.0% 2116
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx Trinka GPT-OSS 20B 90.0% 69.4% 12.0% 2358
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx DeepSeek-R1-Distill-Llama-8B 86.0% 98.0% 86.0% 2857
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx Llama-3.1-8B-Instruct 94.0% 99.3% 94.0% 2559
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx Trinka GPT-OSS 20B 86.0% 61.1% 2.0% 2806
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx DeepSeek-R1-Distill-Llama-8B 88.0% 100.0% 88.0% 2075
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx Llama-3.1-8B-Instruct 92.0% 100.0% 92.0% 1795
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx Trinka GPT-OSS 20B 94.0% 73.8% 14.0% 1998

📋 Detailed Results

Click any row to view request/response details

File Check ID Question Model Runs #PASS #FAIL #UNCLEAR #ERROR Consistent? Labels Avg Sim Min Sim Total Tokens Avg Tokens Details
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4105 085.7 No Mentions of Predefined Phrases in Title/Abstract ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,793 2931
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4105 085.7 No Mentions of Predefined Phrases in Title/Abstract ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,953 2651
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4105 085.7 No Mentions of Predefined Phrases in Title/Abstract ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,046 2682
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4104 084 III.10. No Non-English Text in Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,316 2772
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4104 084 III.10. No Non-English Text in Submission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,779 2593
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4104 084 III.10. No Non-English Text in Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 86.65 79.97 8,155 2718
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4103 085.1 No Utstein, Delphi, Consensus Phrases... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,475 2825
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4103 085.1 No Utstein, Delphi, Consensus Phrases... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,782 2594
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4103 085.1 No Utstein, Delphi, Consensus Phrases... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,366 2789
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4102 083 III.9.vii. Not Study Suggesting Weaponization of an Agent... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,754 2918
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4102 083 III.9.vii. Not Study Suggesting Weaponization of an Agent... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,941 2647
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4102 083 III.9.vii. Not Study Suggesting Weaponization of an Agent... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 62.2 9.75 8,300 2767
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4101 082 III.9.vi. Not Study Reporting Product That Interferes with Diagnosis... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,787 2929
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4101 082 III.9.vi. Not Study Reporting Product That Interferes with Diagnosis... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,899 2633
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4101 082 III.9.vi. Not Study Reporting Product That Interferes with Diagnosis... Trinka GPT-OSS 20B 3 2 0 1 0 False PASS, PASS, UNCLEAR 62.89 36.81 8,369 2790
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4100 081 III.9.v. Not Study Reporting Altered Pathogen Host Range... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,120 3040
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4100 081 III.9.v. Not Study Reporting Altered Pathogen Host Range... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,013 2671
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4100 081 III.9.v. Not Study Reporting Altered Pathogen Host Range... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 38.59 7.89 8,368 2789
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4099 078 III.9.ii. Not Study Reporting New Antibiotic/Antiviral Resistance... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,519 3173
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4099 078 III.9.ii. Not Study Reporting New Antibiotic/Antiviral Resistance... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,037 2679
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4099 078 III.9.ii. Not Study Reporting New Antibiotic/Antiviral Resistance... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 51.62 15.66 8,612 2871
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4098 080 III.9.iv. Not Study Reporting Enhanced Transmissibility or Host Range... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,318 3106
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4098 080 III.9.iv. Not Study Reporting Enhanced Transmissibility or Host Range... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,932 2644
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4098 080 III.9.iv. Not Study Reporting Enhanced Transmissibility or Host Range... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 65.81 32.73 8,587 2862
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4097 079 III.9.iii. Not Study Reporting Enhanced Pathogen Virulence or Stability... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,219 3073
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4097 079 III.9.iii. Not Study Reporting Enhanced Pathogen Virulence or Stability... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,070 2690
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4097 079 III.9.iii. Not Study Reporting Enhanced Pathogen Virulence or Stability... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 77.6 65.31 8,295 2765
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4096 074.1 III.7. Not Content That Could Threaten Public Health... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 9,480 3160
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4096 074.1 III.7. Not Content That Could Threaten Public Health... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,493 2831
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4096 074.1 III.7. Not Content That Could Threaten Public Health... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 74.18 44.84 8,445 2815
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4095 073 III.7.iv. Not Study Advocating Early Cessation of Drug Regimens... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,469 2823
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4095 073 III.7.iv. Not Study Advocating Early Cessation of Drug Regimens... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,283 2761
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4095 073 III.7.iv. Not Study Advocating Early Cessation of Drug Regimens... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 81.86 48.93 8,204 2735
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4094 075 III.8. Not Study With Underlying Agenda... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,153 3051
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4094 075 III.8. Not Study With Underlying Agenda... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,016 2672
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4094 075 III.8. Not Study With Underlying Agenda... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 66.37 44.41 8,674 2891
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4093 077 III.9.i. Not Study Reporting Reduced Effectiveness of Vaccines... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,901 2967
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4093 077 III.9.i. Not Study Reporting Reduced Effectiveness of Vaccines... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,890 2630
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4093 077 III.9.i. Not Study Reporting Reduced Effectiveness of Vaccines... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 92.4 77.2 8,320 2773
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4092 074 III.7.v. Not Study Reporting Biohazards or Dual-Use Research... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,330 3110
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4092 074 III.7.v. Not Study Reporting Biohazards or Dual-Use Research... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,956 2652
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4092 074 III.7.v. Not Study Reporting Biohazards or Dual-Use Research... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 71.94 44.42 8,503 2834
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4091 067 III.5. Not Based on Pseudoscience... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,871 2957
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4091 067 III.5. Not Based on Pseudoscience... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,025 2675
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4091 067 III.5. Not Based on Pseudoscience... Trinka GPT-OSS 20B 3 2 0 1 0 False PASS, PASS, UNCLEAR 49.45 18.5 9,009 3003
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4090 072 III.7.iii. Not Study Challenging Known Toxicity/Carcinogenicity... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,072 3024
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4090 072 III.7.iii. Not Study Challenging Known Toxicity/Carcinogenicity... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,190 2730
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4090 072 III.7.iii. Not Study Challenging Known Toxicity/Carcinogenicity... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 81.75 71.54 8,427 2809
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4089 061.3 II.6.vi.C. References Are Formatted Consistently... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 9,957 3319
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4089 061.3 II.6.vi.C. References Are Formatted Consistently... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,781 2927
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4089 061.3 II.6.vi.C. References Are Formatted Consistently... Trinka GPT-OSS 20B 3 2 1 0 0 False FAIL, PASS, PASS 43.25 12.8 22,813 7604
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4088 068 III.6. Behavioral Genetics with Tenuous Health Connections... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,868 2956
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4088 068 III.6. Behavioral Genetics with Tenuous Health Connections... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,085 2695
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4088 068 III.6. Behavioral Genetics with Tenuous Health Connections... Trinka GPT-OSS 20B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 61.39 42.09 8,292 2764
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4087 071 III.7.ii. Not Study Challenging Infectious Disease Transmission... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 9,084 3028
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4087 071 III.7.ii. Not Study Challenging Infectious Disease Transmission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,031 2677
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4087 071 III.7.ii. Not Study Challenging Infectious Disease Transmission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 37.88 4.45 8,141 2714
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4086 070 III.7.i. Not Study Challenging Vaccine Safety... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,910 2970
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4086 070 III.7.i. Not Study Challenging Vaccine Safety... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,091 2697
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4086 070 III.7.i. Not Study Challenging Vaccine Safety... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 54.76 31.89 8,127 2709
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4085 065 III.3. Not Self-Serving Research... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 9,810 3270
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4085 065 III.3. Not Self-Serving Research... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,145 2715
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4085 065 III.3. Not Self-Serving Research... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 47.68 18.87 8,990 2997
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4084 066 III.4. Not A Political Paper... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,691 2897
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4084 066 III.4. Not A Political Paper... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,001 2667
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4084 066 III.4. Not A Political Paper... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 60.44 40.56 8,251 2750
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4083 064 III.2. Not Study of Smoking or Vaping... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,901 2967
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4083 064 III.2. Not Study of Smoking or Vaping... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,280 2760
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4083 064 III.2. Not Study of Smoking or Vaping... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 93.32 80.06 8,345 2782
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4082 062.1 II.6.vi References Are Present and Proper... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 9,900 3300
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4082 062.1 II.6.vi References Are Present and Proper... Llama-3.1-8B-Instruct 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 8,571 2857
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4082 062.1 II.6.vi References Are Present and Proper... Trinka GPT-OSS 20B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 65.38 17.38 9,099 3033
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4081 063 III.1. No Identifiable Information (Photographs and Names)... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,702 3234
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4081 063 III.1. No Identifiable Information (Photographs and Names)... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,799 2933
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4081 063 III.1. No Identifiable Information (Photographs and Names)... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 40.74 10.2 9,588 3196
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4080 062 II.6.vi.D. References Are Not Present in Main Text... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,616 2872
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4080 062 II.6.vi.D. References Are Not Present in Main Text... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,109 2703
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4080 062 II.6.vi.D. References Are Not Present in Main Text... Trinka GPT-OSS 20B 3 1 0 2 0 False PASS, UNCLEAR, UNCLEAR 54.1 28.74 9,178 3059
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4079 061.2 II.6.vi.B. References Are in the Main Manuscript... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 9,477 3159
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4079 061.2 II.6.vi.B. References Are in the Main Manuscript... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,310 2770
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4079 061.2 II.6.vi.B. References Are in the Main Manuscript... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 64.3 44.86 8,942 2981
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4078 060 II.6.v.B. Results Are Presented in Figures or Tables... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 9,819 3273
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4078 060 II.6.v.B. Results Are Presented in Figures or Tables... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,066 3022
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4078 060 II.6.v.B. Results Are Presented in Figures or Tables... Trinka GPT-OSS 20B 3 1 2 0 0 False FAIL, PASS, FAIL 56.21 30.15 9,962 3321
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4077 057 II.6.iv.B. Methodology is Clearly Described... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 10,080 3360
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4077 057 II.6.iv.B. Methodology is Clearly Described... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,150 3050
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4077 057 II.6.iv.B. Methodology is Clearly Described... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 70.68 45.34 9,982 3327
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4076 061.1 II.6.vi.A. References Are in a Separate Section... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,973 2991
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4076 061.1 II.6.vi.A. References Are in a Separate Section... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,436 2812
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4076 061.1 II.6.vi.A. References Are in a Separate Section... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 54.58 31.57 8,988 2996
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4075 059 II.6.v.A. Results Are Presented In Text... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,669 3223
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4075 059 II.6.v.A. Results Are Presented In Text... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,207 3069
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4075 059 II.6.v.A. Results Are Presented In Text... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 67.7 51.24 10,022 3341
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4074 056 II.6.iv.A. Methodology Is A Separate Section... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 9,171 3057
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4074 056 II.6.iv.A. Methodology Is A Separate Section... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,937 2979
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4074 056 II.6.iv.A. Methodology Is A Separate Section... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 85.9 65.72 9,371 3124
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4073 045 II.3.i. Authors Are Not High-School, Undergraduate, or Master's Students... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 2,871 957
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4073 045 II.3.i. Authors Are Not High-School, Undergraduate, or Master's Students... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 16,347 5449
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4073 045 II.3.i. Authors Are Not High-School, Undergraduate, or Master's Students... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 76.5 29.51 2,337 779
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4072 054.1 II.6 Main Text and Supplemental Files Are Proper... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 8,292 2764
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4072 054.1 II.6 Main Text and Supplemental Files Are Proper... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,995 2665
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4072 054.1 II.6 Main Text and Supplemental Files Are Proper... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 57.69 22.2 8,693 2898
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4071 054 II.6.iii. Submission Does Not Have Tracked Changes or Comment... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,911 637
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4071 054 II.6.iii. Submission Does Not Have Tracked Changes or Comment... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,482 494
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4071 054 II.6.iii. Submission Does Not Have Tracked Changes or Comment... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 67.73 37.78 2,232 744
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4070 051 II.5.ii. Manuscript Abstract Is Clearly Separated... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 7,059 2353
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4070 051 II.5.ii. Manuscript Abstract Is Clearly Separated... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,570 2190
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4070 051 II.5.ii. Manuscript Abstract Is Clearly Separated... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 55.33 32.99 6,892 2297
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4069 052 II.6.i. Submission Does Not Include Cover Letter... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,736 2912
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4069 052 II.6.i. Submission Does Not Include Cover Letter... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,926 2642
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4069 052 II.6.i. Submission Does Not Include Cover Letter... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 65.58 23.38 8,293 2764
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4068 053 II.6.ii. Submission Does Not Have Editor-Addressed Content... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,649 2883
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4068 053 II.6.ii. Submission Does Not Have Editor-Addressed Content... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,857 2619
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4068 053 II.6.ii. Submission Does Not Have Editor-Addressed Content... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 72.81 54.09 8,141 2714
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4067 039 II.2.iii. No Anonymous Authors... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 1,908 636
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4067 039 II.2.iii. No Anonymous Authors... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,365 455
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4067 039 II.2.iii. No Anonymous Authors... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 93.57 85.33 1,808 603
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4066 040 II.2.iv. No Pseudonyms Among Authors... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 2,286 762
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4066 040 II.2.iv. No Pseudonyms Among Authors... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,518 506
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4066 040 II.2.iv. No Pseudonyms Among Authors... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 88.07 68.48 2,099 700
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4065 041 II.2.v. No Provisional Authorship... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,884 628
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4065 041 II.2.v. No Provisional Authorship... Llama-3.1-8B-Instruct 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 1,752 584
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4065 041 II.2.v. No Provisional Authorship... Trinka GPT-OSS 20B 3 1 0 2 0 False UNCLEAR, PASS, UNCLEAR 87.36 74.69 2,076 692
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4064 037 II.2.i. At Least One Author Present... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 2,256 752
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4064 037 II.2.i. At Least One Author Present... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,314 438
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4064 037 II.2.i. At Least One Author Present... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 83.6 50.79 1,781 594
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4063 033 II.1.ii. Title Has No Full References to Other Papers... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 1,563 521
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4063 033 II.1.ii. Title Has No Full References to Other Papers... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,203 401
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4063 033 II.1.ii. Title Has No Full References to Other Papers... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 93.22 88.2 1,522 507
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4062 038 II.2.ii. AI Tools Not Listed as Authors... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 2,091 697
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4062 038 II.2.ii. AI Tools Not Listed as Authors... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,479 493
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4062 038 II.2.ii. AI Tools Not Listed as Authors... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 97.74 95.57 1,982 661
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4061 032 II.1.i. Title Has No Full URLs... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,878 626
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4061 032 II.1.i. Title Has No Full URLs... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,098 366
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4061 032 II.1.i. Title Has No Full URLs... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,359 453
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4060 034 II.1.iii. Title Does Not Have Clickbait Language... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 2,175 725
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4060 034 II.1.iii. Title Does Not Have Clickbait Language... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,380 460
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4060 034 II.1.iii. Title Does Not Have Clickbait Language... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 92.06 76.17 1,749 583
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4059 030 I.2.ii.E. Not Clinical Research Design Protocols (bioRxiv only)... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,925 2975
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4059 030 I.2.ii.E. Not Clinical Research Design Protocols (bioRxiv only)... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,010 2670
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4059 030 I.2.ii.E. Not Clinical Research Design Protocols (bioRxiv only)... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 77.61 61.98 8,323 2774
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4058 029 I.2.ii.D. Not Studies Linking Human gene(s) variant(s) with disease(s) (incl... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 9,201 3067
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4058 029 I.2.ii.D. Not Studies Linking Human gene(s) variant(s) with disease(s) (incl... Llama-3.1-8B-Instruct 3 0 2 0 1 False ERROR, FAIL, FAIL 0.0 0.0 5,708 1903
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4058 029 I.2.ii.D. Not Studies Linking Human gene(s) variant(s) with disease(s) (incl... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 73.93 59.32 9,070 3023
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4057 025 I.2.ii.A. Not Studies Diagnostic Tools or Medical Equipment (bioRxiv only)... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,744 3248
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4057 025 I.2.ii.A. Not Studies Diagnostic Tools or Medical Equipment (bioRxiv only)... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,535 2845
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4057 025 I.2.ii.A. Not Studies Diagnostic Tools or Medical Equipment (bioRxiv only)... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 68.78 49.3 9,784 3261
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4056 028 I.2.ii.C. Not Clinical Treatment Claims / Clinical Trials (bioRxiv only)... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,952 2984
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4056 028 I.2.ii.C. Not Clinical Treatment Claims / Clinical Trials (bioRxiv only)... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,905 2635
logs-a7wyezuhrufpzef0jfrgcdlh-BIORXIV_2025_686274-1-2025-12-05T14-34-04-645Z.xlsx 4056 028 I.2.ii.C. Not Clinical Treatment Claims / Clinical Trials (bioRxiv only)... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 51.83 27.75 8,390 2797
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3011 107 V.3.iii. Not An Educational Intervention Trial... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 5,934 1978
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3011 107 V.3.iii. Not An Educational Intervention Trial... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 4,656 1552
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3011 107 V.3.iii. Not An Educational Intervention Trial... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 80.59 43.43 5,429 1810
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3010 110 V.6. Competing Interest Statement Present And Proper... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 2,007 669
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3010 110 V.6. Competing Interest Statement Present And Proper... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,251 417
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3010 110 V.6. Competing Interest Statement Present And Proper... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 85.71 57.14 1,680 560
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3009 108 V.4. Funding Statement Present And Proper... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,419 473
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3009 108 V.4. Funding Statement Present And Proper... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,095 365
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3009 108 V.4. Funding Statement Present And Proper... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 47.32 18.31 1,777 592
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3008 109 V.5. Data Availability Statement Present and Proper... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,914 638
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3008 109 V.5. Data Availability Statement Present and Proper... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,194 398
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3008 109 V.5. Data Availability Statement Present and Proper... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 54.2 30.78 1,848 616
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3007 101 V.1.i. Ethics Statement Requirement... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 5,475 1825
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3007 101 V.1.i. Ethics Statement Requirement... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 5,673 1891
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3007 101 V.1.i. Ethics Statement Requirement... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 67.25 27.95 5,494 1831
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3006 105 V.3.i. Clinical Trial ID Requirement... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 5,487 1829
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3006 105 V.3.i. Clinical Trial ID Requirement... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 4,911 1637
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3006 105 V.3.i. Clinical Trial ID Requirement... Trinka GPT-OSS 20B 3 2 0 1 0 False PASS, UNCLEAR, PASS 73.08 55.27 5,670 1890
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3005 103 V.1.iii. Cohort-Specific Details Provided In Submission... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 2,355 785
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3005 103 V.1.iii. Cohort-Specific Details Provided In Submission... Llama-3.1-8B-Instruct 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 1,260 420
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3005 103 V.1.iii. Cohort-Specific Details Provided In Submission... Trinka GPT-OSS 20B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 48.82 10.94 2,637 879
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3004 100.1 IV.5. Section IV Report Output - medRxiv-Specific Content... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 5,454 1818
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3004 100.1 IV.5. Section IV Report Output - medRxiv-Specific Content... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 4,620 1540
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3004 100.1 IV.5. Section IV Report Output - medRxiv-Specific Content... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 73.03 21.39 6,677 2226
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3003 106 V.3.ii. Clinical Trial ID Is Present And Is From An Acceptable Source... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,276 2092
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3003 106 V.3.ii. Clinical Trial ID Is Present And Is From An Acceptable Source... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 5,607 1869
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3003 106 V.3.ii. Clinical Trial ID Is Present And Is From An Acceptable Source... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 80.95 69.32 6,202 2067
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3002 104 V.2. Vulnerable Groups Not Mentioned in Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 5,472 1824
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3002 104 V.2. Vulnerable Groups Not Mentioned in Submission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 4,716 1572
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3002 104 V.2. Vulnerable Groups Not Mentioned in Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 54.84 23.44 5,276 1759
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3001 100.2 (IV.6.RW) IV.6. Reasoning (Why’s):... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 6,684 2228
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3001 100.2 (IV.6.RW) IV.6. Reasoning (Why’s):... Llama-3.1-8B-Instruct 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 4,941 1647
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3001 100.2 (IV.6.RW) IV.6. Reasoning (Why’s):... Trinka GPT-OSS 20B 3 0 1 2 0 False FAIL, UNCLEAR, UNCLEAR 38.21 3.81 5,688 1896
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3000 097 IV.1.xii. No Identity-Revealing Patient Identifiers In Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,033 3011
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3000 097 IV.1.xii. No Identity-Revealing Patient Identifiers In Submission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,184 2728
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 3000 097 IV.1.xii. No Identity-Revealing Patient Identifiers In Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 72.0 48.12 8,449 2816
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2999 095 IV.1.x. No Mentions Of Hospital Names and Locations That Pose Privacy Risk... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,874 2958
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2999 095 IV.1.x. No Mentions Of Hospital Names and Locations That Pose Privacy Risk... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,205 2735
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2999 095 IV.1.x. No Mentions Of Hospital Names and Locations That Pose Privacy Risk... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 75.04 54.83 8,761 2920
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2998 099 IV.2. Is Not Research Related to Stem Cell Therapies... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,306 3102
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2998 099 IV.2. Is Not Research Related to Stem Cell Therapies... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,181 2727
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2998 099 IV.2. Is Not Research Related to Stem Cell Therapies... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 79.03 68.26 8,565 2855
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2997 100 IV.3. Not Research Related to Challenge Trials... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,387 3129
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2997 100 IV.3. Not Research Related to Challenge Trials... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,163 2721
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2997 100 IV.3. Not Research Related to Challenge Trials... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 63.91 37.61 8,476 2825
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2996 098 IV.1.xiii. No Mention of Privacy-Compromising Travel Histories In Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,090 3030
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2996 098 IV.1.xiii. No Mention of Privacy-Compromising Travel Histories In Submission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,235 2745
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2996 098 IV.1.xiii. No Mention of Privacy-Compromising Travel Histories In Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 74.39 42.21 8,572 2857
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2995 096 IV.1.xi. No Plans of Apartments or Buildings That Pose Identification Risks... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,366 3122
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2995 096 IV.1.xi. No Plans of Apartments or Buildings That Pose Identification Risks... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,457 2819
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2995 096 IV.1.xi. No Plans of Apartments or Buildings That Pose Identification Risks... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 60.07 38.17 8,404 2801
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2994 094 IV.1.ix. No Identity-Revealing Mentions of Professional Occupation and Relat... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,147 3049
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2994 094 IV.1.ix. No Identity-Revealing Mentions of Professional Occupation and Relat... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,295 2765
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2994 094 IV.1.ix. No Identity-Revealing Mentions of Professional Occupation and Relat... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 42.13 12.22 8,791 2930
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2993 093 IV.1.viii. No Identity-Revealing Details Regarding Ancestry, Country of Orig... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,847 2949
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2993 093 IV.1.viii. No Identity-Revealing Details Regarding Ancestry, Country of Orig... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,133 2711
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2993 093 IV.1.viii. No Identity-Revealing Details Regarding Ancestry, Country of Orig... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 58.06 31.24 8,693 2898
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2992 092 IV.1.vii. No Identity-Revealing Dates and Details In Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,961 2987
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2992 092 IV.1.vii. No Identity-Revealing Dates and Details In Submission... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,154 2718
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2992 092 IV.1.vii. No Identity-Revealing Dates and Details In Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 41.89 7.94 8,451 2817
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2991 084 III.10. No Non-English Text in Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,511 2837
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2991 084 III.10. No Non-English Text in Submission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,974 2658
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2991 084 III.10. No Non-English Text in Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 56.97 27.46 13,454 4485
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2990 090 IV.1.v. No Pedigrees Or Specific Family Relationships In Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,961 2987
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2990 090 IV.1.v. No Pedigrees Or Specific Family Relationships In Submission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,043 2681
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2990 090 IV.1.v. No Pedigrees Or Specific Family Relationships In Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 87.0 80.44 8,345 2782
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2989 087 IV.1.ii. No Precise Ages In Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,973 2991
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2989 087 IV.1.ii. No Precise Ages In Submission... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,127 2709
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2989 087 IV.1.ii. No Precise Ages In Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 81.57 50.67 8,425 2808
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2988 089 IV.1.iv. No Detailed Clinical Histories In Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,237 3079
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2988 089 IV.1.iv. No Detailed Clinical Histories In Submission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,247 2749
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2988 089 IV.1.iv. No Detailed Clinical Histories In Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 67.27 44.7 8,306 2769
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2987 088 IV.1.iii. No Sample or Patient IDs in Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,162 3054
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2987 088 IV.1.iii. No Sample or Patient IDs in Submission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,058 2686
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2987 088 IV.1.iii. No Sample or Patient IDs in Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 57.33 31.65 8,920 2973
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2986 085.1 No Utstein, Delphi, Consensus Phrases... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,082 2694
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2986 085.1 No Utstein, Delphi, Consensus Phrases... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,404 2468
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2986 085.1 No Utstein, Delphi, Consensus Phrases... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 90.97 72.92 7,645 2548
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2985 083 III.9.vii. Not Study Suggesting Weaponization of an Agent... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,018 3006
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2985 083 III.9.vii. Not Study Suggesting Weaponization of an Agent... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,202 2734
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2985 083 III.9.vii. Not Study Suggesting Weaponization of an Agent... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 72.35 41.17 8,601 2867
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2984 085.7 No Mentions of Predefined Phrases in Title/Abstract ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,064 2688
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2984 085.7 No Mentions of Predefined Phrases in Title/Abstract ... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 7,476 2492
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2984 085.7 No Mentions of Predefined Phrases in Title/Abstract ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 93.42 89.34 7,573 2524
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2983 086 IV.1.i. No Identity-Revealing Photographs in Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,234 3078
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2983 086 IV.1.i. No Identity-Revealing Photographs in Submission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,472 2824
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2983 086 IV.1.i. No Identity-Revealing Photographs in Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 56.76 29.71 8,562 2854
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2982 080 III.9.iv. Not Study Reporting Enhanced Transmissibility or Host Range... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,438 3146
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2982 080 III.9.iv. Not Study Reporting Enhanced Transmissibility or Host Range... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,217 2739
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2982 080 III.9.iv. Not Study Reporting Enhanced Transmissibility or Host Range... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 82.55 71.35 8,430 2810
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2981 078 III.9.ii. Not Study Reporting New Antibiotic/Antiviral Resistance... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,051 3017
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2981 078 III.9.ii. Not Study Reporting New Antibiotic/Antiviral Resistance... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,271 2757
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2981 078 III.9.ii. Not Study Reporting New Antibiotic/Antiviral Resistance... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 60.43 34.18 8,837 2946
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2980 075 III.8. Not Study With Underlying Agenda... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,438 3146
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2980 075 III.8. Not Study With Underlying Agenda... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,205 2735
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2980 075 III.8. Not Study With Underlying Agenda... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 53.16 8.28 8,558 2853
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2979 079 III.9.iii. Not Study Reporting Enhanced Pathogen Virulence or Stability... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,423 3141
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2979 079 III.9.iii. Not Study Reporting Enhanced Pathogen Virulence or Stability... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,142 2714
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2979 079 III.9.iii. Not Study Reporting Enhanced Pathogen Virulence or Stability... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 88.18 81.91 8,430 2810
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2978 082 III.9.vi. Not Study Reporting Product That Interferes with Diagnosis... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,204 3068
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2978 082 III.9.vi. Not Study Reporting Product That Interferes with Diagnosis... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,133 2711
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2978 082 III.9.vi. Not Study Reporting Product That Interferes with Diagnosis... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 95.53 93.04 8,349 2783
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2977 081 III.9.v. Not Study Reporting Altered Pathogen Host Range... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,492 3164
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2977 081 III.9.v. Not Study Reporting Altered Pathogen Host Range... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,211 2737
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2977 081 III.9.v. Not Study Reporting Altered Pathogen Host Range... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 95.39 93.09 8,506 2835
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2976 074 III.7.v. Not Study Reporting Biohazards or Dual-Use Research... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,967 2989
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2976 074 III.7.v. Not Study Reporting Biohazards or Dual-Use Research... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,184 2728
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2976 074 III.7.v. Not Study Reporting Biohazards or Dual-Use Research... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 76.18 44.0 8,664 2888
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2975 073 III.7.iv. Not Study Advocating Early Cessation of Drug Regimens... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,949 2983
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2975 073 III.7.iv. Not Study Advocating Early Cessation of Drug Regimens... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,316 2772
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2975 073 III.7.iv. Not Study Advocating Early Cessation of Drug Regimens... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 82.46 47.38 8,599 2866
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2974 077 III.9.i. Not Study Reporting Reduced Effectiveness of Vaccines... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,171 3057
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2974 077 III.9.i. Not Study Reporting Reduced Effectiveness of Vaccines... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,235 2745
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2974 077 III.9.i. Not Study Reporting Reduced Effectiveness of Vaccines... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 78.56 52.06 8,543 2848
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2973 072 III.7.iii. Not Study Challenging Known Toxicity/Carcinogenicity... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,132 3044
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2973 072 III.7.iii. Not Study Challenging Known Toxicity/Carcinogenicity... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,409 2803
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2973 072 III.7.iii. Not Study Challenging Known Toxicity/Carcinogenicity... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 83.47 66.95 9,150 3050
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2972 074.1 III.7. Not Content That Could Threaten Public Health... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,084 3028
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2972 074.1 III.7. Not Content That Could Threaten Public Health... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,547 2849
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2972 074.1 III.7. Not Content That Could Threaten Public Health... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 67.43 20.72 9,231 3077
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2971 070 III.7.i. Not Study Challenging Vaccine Safety... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,366 3122
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2971 070 III.7.i. Not Study Challenging Vaccine Safety... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,256 2752
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2971 070 III.7.i. Not Study Challenging Vaccine Safety... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 73.28 22.65 8,385 2795
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2970 062 II.6.vi.D. References Are Not Present in Main Text... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 4,224 1408
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2970 062 II.6.vi.D. References Are Not Present in Main Text... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 3,663 1221
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2970 062 II.6.vi.D. References Are Not Present in Main Text... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 70.29 41.21 4,184 1395
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2969 068 III.6. Behavioral Genetics with Tenuous Health Connections... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 9,654 3218
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2969 068 III.6. Behavioral Genetics with Tenuous Health Connections... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,178 2726
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2969 068 III.6. Behavioral Genetics with Tenuous Health Connections... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 70.48 51.53 8,622 2874
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2968 071 III.7.ii. Not Study Challenging Infectious Disease Transmission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,865 2955
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2968 071 III.7.ii. Not Study Challenging Infectious Disease Transmission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,301 2767
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2968 071 III.7.ii. Not Study Challenging Infectious Disease Transmission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 72.23 50.09 8,543 2848
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2967 065 III.3. Not Self-Serving Research... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,555 3185
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2967 065 III.3. Not Self-Serving Research... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,370 2790
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2967 065 III.3. Not Self-Serving Research... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 55.22 23.66 9,080 3027
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2966 064 III.2. Not Study of Smoking or Vaping... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,066 3022
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2966 064 III.2. Not Study of Smoking or Vaping... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,139 2713
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2966 064 III.2. Not Study of Smoking or Vaping... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 38.99 3.96 8,633 2878
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2965 062.1 II.6.vi References Are Present and Proper... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 6,033 2011
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2965 062.1 II.6.vi References Are Present and Proper... Llama-3.1-8B-Instruct 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 5,133 1711
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2965 062.1 II.6.vi References Are Present and Proper... Trinka GPT-OSS 20B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 5,514 1838
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2964 067 III.5. Not Based on Pseudoscience... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,102 3034
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2964 067 III.5. Not Based on Pseudoscience... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,382 2794
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2964 067 III.5. Not Based on Pseudoscience... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 50.35 9.65 8,913 2971
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2963 066 III.4. Not A Political Paper... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,952 2984
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2963 066 III.4. Not A Political Paper... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,115 2705
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2963 066 III.4. Not A Political Paper... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 53.05 23.23 8,361 2787
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2962 059 II.6.v.A. Results Are Presented In Text... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,258 3086
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2962 059 II.6.v.A. Results Are Presented In Text... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,790 2930
logs-b24u64sjh9blyhvxn375xdf9-MEDRXIV_2025_329782-2-2025-12-05T14-32-18-483Z.xlsx 2962 059 II.6.v.A. Results Are Presented In Text... Trinka GPT-OSS 20B 3 0 2 1 0 False FAIL, FAIL, UNCLEAR 69.81 50.62 9,271 3090
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3653 084 III.10. No Non-English Text in Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,432 3144
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3653 084 III.10. No Non-English Text in Submission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,772 2924
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3653 084 III.10. No Non-English Text in Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 55.53 18.95 9,078 3026
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3652 085.1 No Utstein, Delphi, Consensus Phrases... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,417 3139
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3652 085.1 No Utstein, Delphi, Consensus Phrases... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,787 2929
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3652 085.1 No Utstein, Delphi, Consensus Phrases... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 96.36 91.17 9,260 3087
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3651 085.7 No Mentions of Predefined Phrases in Title/Abstract ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,375 3125
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3651 085.7 No Mentions of Predefined Phrases in Title/Abstract ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,715 2905
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3651 085.7 No Mentions of Predefined Phrases in Title/Abstract ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,039 3013
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3650 083 III.9.vii. Not Study Suggesting Weaponization of an Agent... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 10,215 3405
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3650 083 III.9.vii. Not Study Suggesting Weaponization of an Agent... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,865 2955
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3650 083 III.9.vii. Not Study Suggesting Weaponization of an Agent... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 85.12 56.09 9,330 3110
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3649 078 III.9.ii. Not Study Reporting New Antibiotic/Antiviral Resistance... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,531 3177
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3649 078 III.9.ii. Not Study Reporting New Antibiotic/Antiviral Resistance... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,006 3002
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3649 078 III.9.ii. Not Study Reporting New Antibiotic/Antiviral Resistance... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 69.84 50.04 9,483 3161
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3648 082 III.9.vi. Not Study Reporting Product That Interferes with Diagnosis... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,771 3257
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3648 082 III.9.vi. Not Study Reporting Product That Interferes with Diagnosis... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,895 2965
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3648 082 III.9.vi. Not Study Reporting Product That Interferes with Diagnosis... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 49.47 23.16 9,668 3223
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3647 079 III.9.iii. Not Study Reporting Enhanced Pathogen Virulence or Stability... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 10,056 3352
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3647 079 III.9.iii. Not Study Reporting Enhanced Pathogen Virulence or Stability... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,829 2943
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3647 079 III.9.iii. Not Study Reporting Enhanced Pathogen Virulence or Stability... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 85.53 72.27 9,697 3232
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3646 075 III.8. Not Study With Underlying Agenda... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 10,365 3455
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3646 075 III.8. Not Study With Underlying Agenda... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,793 2931
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3646 075 III.8. Not Study With Underlying Agenda... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 58.51 28.39 9,449 3150
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3645 081 III.9.v. Not Study Reporting Altered Pathogen Host Range... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 10,149 3383
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3645 081 III.9.v. Not Study Reporting Altered Pathogen Host Range... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,832 2944
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3645 081 III.9.v. Not Study Reporting Altered Pathogen Host Range... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 87.34 80.39 9,500 3167
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3644 080 III.9.iv. Not Study Reporting Enhanced Transmissibility or Host Range... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 10,395 3465
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3644 080 III.9.iv. Not Study Reporting Enhanced Transmissibility or Host Range... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,961 2987
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3644 080 III.9.iv. Not Study Reporting Enhanced Transmissibility or Host Range... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 73.26 58.66 9,310 3103
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3643 077 III.9.i. Not Study Reporting Reduced Effectiveness of Vaccines... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 10,224 3408
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3643 077 III.9.i. Not Study Reporting Reduced Effectiveness of Vaccines... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,841 2947
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3643 077 III.9.i. Not Study Reporting Reduced Effectiveness of Vaccines... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 55.32 32.96 9,505 3168
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3642 074 III.7.v. Not Study Reporting Biohazards or Dual-Use Research... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 10,269 3423
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3642 074 III.7.v. Not Study Reporting Biohazards or Dual-Use Research... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,907 2969
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3642 074 III.7.v. Not Study Reporting Biohazards or Dual-Use Research... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 75.72 48.16 9,523 3174
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3641 074.1 III.7. Not Content That Could Threaten Public Health... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 9,582 3194
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3641 074.1 III.7. Not Content That Could Threaten Public Health... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,102 3034
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3641 074.1 III.7. Not Content That Could Threaten Public Health... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,363 3121
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3640 073 III.7.iv. Not Study Advocating Early Cessation of Drug Regimens... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 10,278 3426
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3640 073 III.7.iv. Not Study Advocating Early Cessation of Drug Regimens... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,958 2986
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3640 073 III.7.iv. Not Study Advocating Early Cessation of Drug Regimens... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 42.7 13.46 9,365 3122
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3639 071 III.7.ii. Not Study Challenging Infectious Disease Transmission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,927 3309
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3639 071 III.7.ii. Not Study Challenging Infectious Disease Transmission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,877 2959
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3639 071 III.7.ii. Not Study Challenging Infectious Disease Transmission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 71.33 44.34 9,482 3161
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3638 070 III.7.i. Not Study Challenging Vaccine Safety... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 9,846 3282
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3638 070 III.7.i. Not Study Challenging Vaccine Safety... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,787 2929
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3638 070 III.7.i. Not Study Challenging Vaccine Safety... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 76.29 64.04 9,066 3022
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3637 061.3 II.6.vi.C. References Are Formatted Consistently... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 5,919 1973
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3637 061.3 II.6.vi.C. References Are Formatted Consistently... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 4,467 1489
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3637 061.3 II.6.vi.C. References Are Formatted Consistently... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 57.18 22.45 7,431 2477
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3636 072 III.7.iii. Not Study Challenging Known Toxicity/Carcinogenicity... DeepSeek-R1-Distill-Llama-8B 3 0 0 0 3 False ERROR, ERROR, ERROR 0.0 0.0 0 0
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3636 072 III.7.iii. Not Study Challenging Known Toxicity/Carcinogenicity... Llama-3.1-8B-Instruct 3 0 0 0 3 False ERROR, ERROR, ERROR 0.0 0.0 0 0
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3636 072 III.7.iii. Not Study Challenging Known Toxicity/Carcinogenicity... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 68.33 13.09 9,281 3094
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3635 068 III.6. Behavioral Genetics with Tenuous Health Connections... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 9,843 3281
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3635 068 III.6. Behavioral Genetics with Tenuous Health Connections... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,973 2991
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3635 068 III.6. Behavioral Genetics with Tenuous Health Connections... Trinka GPT-OSS 20B 3 2 0 0 1 False ERROR, PASS, PASS 0.0 0.0 6,321 2107
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3634 067 III.5. Not Based on Pseudoscience... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,981 3327
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3634 067 III.5. Not Based on Pseudoscience... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,042 3014
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3634 067 III.5. Not Based on Pseudoscience... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 72.32 47.47 9,914 3305
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3633 065 III.3. Not Self-Serving Research... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 10,620 3540
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3633 065 III.3. Not Self-Serving Research... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,769 2923
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3633 065 III.3. Not Self-Serving Research... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 50.79 24.35 9,677 3226
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3632 064 III.2. Not Study of Smoking or Vaping... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,987 3329
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3632 064 III.2. Not Study of Smoking or Vaping... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,880 2960
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3632 064 III.2. Not Study of Smoking or Vaping... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 43.58 8.88 9,296 3099
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3631 066 III.4. Not A Political Paper... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,864 3288
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3631 066 III.4. Not A Political Paper... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,829 2943
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3631 066 III.4. Not A Political Paper... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 94.69 86.39 9,232 3077
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3630 063 III.1. No Identifiable Information (Photographs and Names)... DeepSeek-R1-Distill-Llama-8B 3 2 0 0 1 False PASS, PASS, ERROR 66.67 0.0 6,512 2171
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3630 063 III.1. No Identifiable Information (Photographs and Names)... Llama-3.1-8B-Instruct 3 0 0 0 3 False ERROR, ERROR, ERROR 0.0 0.0 0 0
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3630 063 III.1. No Identifiable Information (Photographs and Names)... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 40.04 9.03 9,212 3071
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3629 062.1 II.6.vi References Are Present and Proper... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 10,602 3534
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3629 062.1 II.6.vi References Are Present and Proper... Llama-3.1-8B-Instruct 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 9,189 3063
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3629 062.1 II.6.vi References Are Present and Proper... Trinka GPT-OSS 20B 3 0 0 1 2 False ERROR, ERROR, UNCLEAR 0.0 0.0 3,193 1064
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3628 062 II.6.vi.D. References Are Not Present in Main Text... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,213 3071
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3628 062 II.6.vi.D. References Are Not Present in Main Text... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,964 2988
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3628 062 II.6.vi.D. References Are Not Present in Main Text... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 56.74 31.56 9,891 3297
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3627 061.1 II.6.vi.A. References Are in a Separate Section... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 4,779 1593
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3627 061.1 II.6.vi.A. References Are in a Separate Section... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 4,305 1435
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3627 061.1 II.6.vi.A. References Are in a Separate Section... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 65.79 39.12 5,093 1698
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3626 059 II.6.v.A. Results Are Presented In Text... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 3,279 1093
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3626 059 II.6.v.A. Results Are Presented In Text... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 2,766 922
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3626 059 II.6.v.A. Results Are Presented In Text... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 86.22 58.66 3,626 1209
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3625 057 II.6.iv.B. Methodology is Clearly Described... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,597 2199
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3625 057 II.6.iv.B. Methodology is Clearly Described... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 5,208 1736
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3625 057 II.6.iv.B. Methodology is Clearly Described... Trinka GPT-OSS 20B 3 2 0 0 1 False PASS, ERROR, PASS 46.93 0.0 4,690 1563
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3624 060 II.6.v.B. Results Are Presented in Figures or Tables... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 3,102 1034
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3624 060 II.6.v.B. Results Are Presented in Figures or Tables... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 2,595 865
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3624 060 II.6.v.B. Results Are Presented in Figures or Tables... Trinka GPT-OSS 20B 3 1 2 0 0 False FAIL, PASS, FAIL 71.82 39.34 3,668 1223
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3623 061.2 II.6.vi.B. References Are in the Main Manuscript... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 5,019 1673
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3623 061.2 II.6.vi.B. References Are in the Main Manuscript... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 4,158 1386
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3623 061.2 II.6.vi.B. References Are in the Main Manuscript... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 56.19 30.54 4,981 1660
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3622 054.1 II.6 Main Text and Supplemental Files Are Proper... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,588 3196
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3622 054.1 II.6 Main Text and Supplemental Files Are Proper... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,904 2968
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3622 054.1 II.6 Main Text and Supplemental Files Are Proper... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 51.05 24.9 9,584 3195
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3621 056 II.6.iv.A. Methodology Is A Separate Section... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 5,598 1866
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3621 056 II.6.iv.A. Methodology Is A Separate Section... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 5,040 1680
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3621 056 II.6.iv.A. Methodology Is A Separate Section... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 74.5 37.64 5,418 1806
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3620 053 II.6.ii. Submission Does Not Have Editor-Addressed Content... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,861 3287
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3620 053 II.6.ii. Submission Does Not Have Editor-Addressed Content... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,853 2951
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3620 053 II.6.ii. Submission Does Not Have Editor-Addressed Content... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 89.97 70.73 9,064 3021
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3619 052 II.6.i. Submission Does Not Include Cover Letter... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,648 3216
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3619 052 II.6.i. Submission Does Not Include Cover Letter... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,883 2961
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3619 052 II.6.i. Submission Does Not Include Cover Letter... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 85.47 66.91 9,155 3052
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3618 054 II.6.iii. Submission Does Not Have Tracked Changes or Comment... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 5,274 1758
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3618 054 II.6.iii. Submission Does Not Have Tracked Changes or Comment... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 4,797 1599
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3618 054 II.6.iii. Submission Does Not Have Tracked Changes or Comment... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 65.49 48.24 5,070 1690
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3617 045 II.3.i. Authors Are Not High-School, Undergraduate, or Master's Students... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 2,469 823
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3617 045 II.3.i. Authors Are Not High-School, Undergraduate, or Master's Students... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,917 639
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3617 045 II.3.i. Authors Are Not High-School, Undergraduate, or Master's Students... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 92.76 89.14 2,208 736
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3616 041 II.2.v. No Provisional Authorship... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 1,998 666
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3616 041 II.2.v. No Provisional Authorship... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,785 595
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3616 041 II.2.v. No Provisional Authorship... Trinka GPT-OSS 20B 3 2 0 1 0 False UNCLEAR, PASS, PASS 52.15 28.15 2,084 695
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3615 040 II.2.iv. No Pseudonyms Among Authors... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 2,709 903
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3615 040 II.2.iv. No Pseudonyms Among Authors... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,647 549
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3615 040 II.2.iv. No Pseudonyms Among Authors... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 73.2 57.75 2,343 781
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3614 051 II.5.ii. Manuscript Abstract Is Clearly Separated... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 4,698 1566
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3614 051 II.5.ii. Manuscript Abstract Is Clearly Separated... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 3,966 1322
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3614 051 II.5.ii. Manuscript Abstract Is Clearly Separated... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 81.16 49.59 4,336 1445
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3613 029 I.2.ii.D. Not Studies Linking Human gene(s) variant(s) with disease(s) (incl... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 9,621 3207
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3613 029 I.2.ii.D. Not Studies Linking Human gene(s) variant(s) with disease(s) (incl... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,925 2975
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3613 029 I.2.ii.D. Not Studies Linking Human gene(s) variant(s) with disease(s) (incl... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 63.16 21.26 9,528 3176
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3612 030 I.2.ii.E. Not Clinical Research Design Protocols (bioRxiv only)... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 9,921 3307
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3612 030 I.2.ii.E. Not Clinical Research Design Protocols (bioRxiv only)... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,661 2887
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3612 030 I.2.ii.E. Not Clinical Research Design Protocols (bioRxiv only)... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 57.94 2.18 9,015 3005
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3611 038 II.2.ii. AI Tools Not Listed as Authors... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 2,451 817
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3611 038 II.2.ii. AI Tools Not Listed as Authors... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,641 547
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3611 038 II.2.ii. AI Tools Not Listed as Authors... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 99.5 98.51 2,120 707
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3610 034 II.1.iii. Title Does Not Have Clickbait Language... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,851 617
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3610 034 II.1.iii. Title Does Not Have Clickbait Language... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,389 463
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3610 034 II.1.iii. Title Does Not Have Clickbait Language... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 90.63 72.5 1,673 558
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3609 039 II.2.iii. No Anonymous Authors... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,923 641
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3609 039 II.2.iii. No Anonymous Authors... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,512 504
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3609 039 II.2.iii. No Anonymous Authors... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 86.57 59.72 1,977 659
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3608 037 II.2.i. At Least One Author Present... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,887 629
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3608 037 II.2.i. At Least One Author Present... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,503 501
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3608 037 II.2.i. At Least One Author Present... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 98.89 96.89 1,890 630
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3607 032 II.1.i. Title Has No Full URLs... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,503 501
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3607 032 II.1.i. Title Has No Full URLs... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,062 354
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3607 032 II.1.i. Title Has No Full URLs... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 74.92 58.01 1,352 451
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3606 033 II.1.ii. Title Has No Full References to Other Papers... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 1,764 588
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3606 033 II.1.ii. Title Has No Full References to Other Papers... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,179 393
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3606 033 II.1.ii. Title Has No Full References to Other Papers... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,449 483
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3605 026.51 I.2.ii.A.01.e. Antiviral Compounds and Materials With Clinical Applicatio... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 10,122 3374
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3605 026.51 I.2.ii.A.01.e. Antiviral Compounds and Materials With Clinical Applicatio... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,748 2916
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3605 026.51 I.2.ii.A.01.e. Antiviral Compounds and Materials With Clinical Applicatio... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 81.92 45.76 9,614 3205
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3604 025 I.2.ii.A. Not Studies Diagnostic Tools or Medical Equipment (bioRxiv only)... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 10,173 3391
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3604 025 I.2.ii.A. Not Studies Diagnostic Tools or Medical Equipment (bioRxiv only)... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,889 2963
logs-gn2aa2i5f8ay60z4awnkpt6o-BIORXIV_2023_538397-1-2025-12-05T14-39-52-284Z.xlsx 3604 025 I.2.ii.A. Not Studies Diagnostic Tools or Medical Equipment (bioRxiv only)... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 57.33 26.64 10,267 3422
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3487 109 V.5. Data Availability Statement Present and Proper... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,914 638
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3487 109 V.5. Data Availability Statement Present and Proper... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,194 398
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3487 109 V.5. Data Availability Statement Present and Proper... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 51.82 27.34 1,855 618
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3485 110 V.6. Competing Interest Statement Present And Proper... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 2,007 669
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3485 110 V.6. Competing Interest Statement Present And Proper... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,251 417
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3485 110 V.6. Competing Interest Statement Present And Proper... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 72.6 46.25 1,760 587
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3484 108 V.4. Funding Statement Present And Proper... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,419 473
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3484 108 V.4. Funding Statement Present And Proper... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,095 365
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3484 108 V.4. Funding Statement Present And Proper... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 69.47 48.85 1,671 557
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3483 104 V.2. Vulnerable Groups Not Mentioned in Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,571 2857
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3483 104 V.2. Vulnerable Groups Not Mentioned in Submission... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,223 2741
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3483 104 V.2. Vulnerable Groups Not Mentioned in Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 80.17 69.31 9,129 3043
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3482 101 V.1.i. Ethics Statement Requirement... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,586 2862
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3482 101 V.1.i. Ethics Statement Requirement... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,267 3089
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3482 101 V.1.i. Ethics Statement Requirement... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 60.44 30.3 9,458 3153
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3481 103 V.1.iii. Cohort-Specific Details Provided In Submission... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 2,355 785
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3481 103 V.1.iii. Cohort-Specific Details Provided In Submission... Llama-3.1-8B-Instruct 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 1,260 420
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3481 103 V.1.iii. Cohort-Specific Details Provided In Submission... Trinka GPT-OSS 20B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 53.15 20.72 2,542 847
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3480 107 V.3.iii. Not An Educational Intervention Trial... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,532 2844
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3480 107 V.3.iii. Not An Educational Intervention Trial... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,142 2714
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3480 107 V.3.iii. Not An Educational Intervention Trial... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 86.62 68.56 8,522 2841
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3479 100.2 (IV.6.RW) IV.6. Reasoning (Why’s):... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 10,356 3452
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3479 100.2 (IV.6.RW) IV.6. Reasoning (Why’s):... Llama-3.1-8B-Instruct 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 8,739 2913
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3479 100.2 (IV.6.RW) IV.6. Reasoning (Why’s):... Trinka GPT-OSS 20B 3 1 1 1 0 False PASS, UNCLEAR, FAIL 35.53 2.32 9,469 3156
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3478 105 V.3.i. Clinical Trial ID Requirement... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,021 3007
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3478 105 V.3.i. Clinical Trial ID Requirement... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,199 2733
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3478 105 V.3.i. Clinical Trial ID Requirement... Trinka GPT-OSS 20B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 73.7 57.56 9,187 3062
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3477 106 V.3.ii. Clinical Trial ID Is Present And Is From An Acceptable Source... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 9,678 3226
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3477 106 V.3.ii. Clinical Trial ID Is Present And Is From An Acceptable Source... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,265 2755
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3477 106 V.3.ii. Clinical Trial ID Is Present And Is From An Acceptable Source... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 51.23 26.33 9,040 3013
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3476 097 IV.1.xii. No Identity-Revealing Patient Identifiers In Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,931 2977
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3476 097 IV.1.xii. No Identity-Revealing Patient Identifiers In Submission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,154 2718
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3476 097 IV.1.xii. No Identity-Revealing Patient Identifiers In Submission... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 55.71 30.61 9,431 3144
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3475 100.1 IV.5. Section IV Report Output - medRxiv-Specific Content... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 8,793 2931
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3475 100.1 IV.5. Section IV Report Output - medRxiv-Specific Content... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,103 2701
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3475 100.1 IV.5. Section IV Report Output - medRxiv-Specific Content... Trinka GPT-OSS 20B 3 1 0 2 0 False PASS, UNCLEAR, UNCLEAR 60.63 36.73 9,541 3180
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3474 089 IV.1.iv. No Detailed Clinical Histories In Submission... DeepSeek-R1-Distill-Llama-8B 3 2 0 0 1 False ERROR, PASS, PASS 0.0 0.0 6,154 2051
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3474 089 IV.1.iv. No Detailed Clinical Histories In Submission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,655 2885
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3474 089 IV.1.iv. No Detailed Clinical Histories In Submission... Trinka GPT-OSS 20B 3 1 2 0 0 False PASS, FAIL, FAIL 39.96 7.91 11,139 3713
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3473 094 IV.1.ix. No Identity-Revealing Mentions of Professional Occupation and Relat... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,087 3029
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3473 094 IV.1.ix. No Identity-Revealing Mentions of Professional Occupation and Relat... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,271 2757
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3473 094 IV.1.ix. No Identity-Revealing Mentions of Professional Occupation and Relat... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 72.43 51.89 9,176 3059
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3472 098 IV.1.xiii. No Mention of Privacy-Compromising Travel Histories In Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,279 3093
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3472 098 IV.1.xiii. No Mention of Privacy-Compromising Travel Histories In Submission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,487 2829
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3472 098 IV.1.xiii. No Mention of Privacy-Compromising Travel Histories In Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 63.21 43.05 9,280 3093
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3471 099 IV.2. Is Not Research Related to Stem Cell Therapies... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,195 3065
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3471 099 IV.2. Is Not Research Related to Stem Cell Therapies... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,115 2705
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3471 099 IV.2. Is Not Research Related to Stem Cell Therapies... Trinka GPT-OSS 20B 3 2 0 0 1 False PASS, ERROR, PASS 45.05 0.0 5,855 1952
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3470 100 IV.3. Not Research Related to Challenge Trials... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,961 2987
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3470 100 IV.3. Not Research Related to Challenge Trials... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,193 2731
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3470 100 IV.3. Not Research Related to Challenge Trials... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 70.6 19.23 8,869 2956
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3469 093 IV.1.viii. No Identity-Revealing Details Regarding Ancestry, Country of Orig... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,381 3127
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3469 093 IV.1.viii. No Identity-Revealing Details Regarding Ancestry, Country of Orig... Llama-3.1-8B-Instruct 3 2 0 0 1 False PASS, ERROR, PASS 66.67 0.0 5,626 1875
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3469 093 IV.1.viii. No Identity-Revealing Details Regarding Ancestry, Country of Orig... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 62.76 21.27 9,158 3053
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3468 092 IV.1.vii. No Identity-Revealing Dates and Details In Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,994 2998
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3468 092 IV.1.vii. No Identity-Revealing Dates and Details In Submission... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,388 2796
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3468 092 IV.1.vii. No Identity-Revealing Dates and Details In Submission... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 50.98 21.08 9,363 3121
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3467 096 IV.1.xi. No Plans of Apartments or Buildings That Pose Identification Risks... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,522 3174
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3467 096 IV.1.xi. No Plans of Apartments or Buildings That Pose Identification Risks... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,529 2843
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3467 096 IV.1.xi. No Plans of Apartments or Buildings That Pose Identification Risks... Trinka GPT-OSS 20B 3 2 0 0 1 False ERROR, PASS, PASS 0.0 0.0 5,881 1960
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3466 095 IV.1.x. No Mentions Of Hospital Names and Locations That Pose Privacy Risk... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,549 3183
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3466 095 IV.1.x. No Mentions Of Hospital Names and Locations That Pose Privacy Risk... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,280 2760
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3466 095 IV.1.x. No Mentions Of Hospital Names and Locations That Pose Privacy Risk... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 75.25 55.88 9,023 3008
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3465 090 IV.1.v. No Pedigrees Or Specific Family Relationships In Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,300 3100
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3465 090 IV.1.v. No Pedigrees Or Specific Family Relationships In Submission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,232 2744
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3465 090 IV.1.v. No Pedigrees Or Specific Family Relationships In Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 65.06 31.33 8,783 2928
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3464 087 IV.1.ii. No Precise Ages In Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,462 3154
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3464 087 IV.1.ii. No Precise Ages In Submission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,412 2804
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3464 087 IV.1.ii. No Precise Ages In Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 50.45 25.04 12,631 4210
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3463 088 IV.1.iii. No Sample or Patient IDs in Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,447 3149
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3463 088 IV.1.iii. No Sample or Patient IDs in Submission... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,382 2794
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3463 088 IV.1.iii. No Sample or Patient IDs in Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 56.73 34.47 9,259 3086
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3462 086 IV.1.i. No Identity-Revealing Photographs in Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,345 3115
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3462 086 IV.1.i. No Identity-Revealing Photographs in Submission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,481 2827
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3462 086 IV.1.i. No Identity-Revealing Photographs in Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 80.01 69.84 9,023 3008
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3461 085.7 No Mentions of Predefined Phrases in Title/Abstract ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,925 2975
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3461 085.7 No Mentions of Predefined Phrases in Title/Abstract ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,118 2706
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3461 085.7 No Mentions of Predefined Phrases in Title/Abstract ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,463 2821
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3460 081 III.9.v. Not Study Reporting Altered Pathogen Host Range... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,973 2991
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3460 081 III.9.v. Not Study Reporting Altered Pathogen Host Range... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,106 2702
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3460 081 III.9.v. Not Study Reporting Altered Pathogen Host Range... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 81.76 72.5 8,648 2883
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3459 082 III.9.vi. Not Study Reporting Product That Interferes with Diagnosis... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 9,414 3138
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3459 082 III.9.vi. Not Study Reporting Product That Interferes with Diagnosis... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,073 2691
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3459 082 III.9.vi. Not Study Reporting Product That Interferes with Diagnosis... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 43.47 15.06 8,898 2966
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3458 077 III.9.i. Not Study Reporting Reduced Effectiveness of Vaccines... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 9,435 3145
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3458 077 III.9.i. Not Study Reporting Reduced Effectiveness of Vaccines... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,094 2698
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3458 077 III.9.i. Not Study Reporting Reduced Effectiveness of Vaccines... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 59.27 28.63 9,275 3092
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3457 085.1 No Utstein, Delphi, Consensus Phrases... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,952 2984
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3457 085.1 No Utstein, Delphi, Consensus Phrases... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,055 2685
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3457 085.1 No Utstein, Delphi, Consensus Phrases... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 94.96 87.94 8,347 2782
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3456 083 III.9.vii. Not Study Suggesting Weaponization of an Agent... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,123 3041
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3456 083 III.9.vii. Not Study Suggesting Weaponization of an Agent... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,142 2714
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3456 083 III.9.vii. Not Study Suggesting Weaponization of an Agent... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 55.91 32.7 8,561 2854
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3455 080 III.9.iv. Not Study Reporting Enhanced Transmissibility or Host Range... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,009 3003
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3455 080 III.9.iv. Not Study Reporting Enhanced Transmissibility or Host Range... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,187 2729
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3455 080 III.9.iv. Not Study Reporting Enhanced Transmissibility or Host Range... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 44.25 13.59 8,558 2853
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3454 084 III.10. No Non-English Text in Submission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,535 2845
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3454 084 III.10. No Non-English Text in Submission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,920 2640
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3454 084 III.10. No Non-English Text in Submission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 68.45 51.28 8,335 2778
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3453 079 III.9.iii. Not Study Reporting Enhanced Pathogen Virulence or Stability... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,519 3173
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3453 079 III.9.iii. Not Study Reporting Enhanced Pathogen Virulence or Stability... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,277 2759
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3453 079 III.9.iii. Not Study Reporting Enhanced Pathogen Virulence or Stability... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 79.96 41.85 8,754 2918
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3452 078 III.9.ii. Not Study Reporting New Antibiotic/Antiviral Resistance... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,573 3191
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3452 078 III.9.ii. Not Study Reporting New Antibiotic/Antiviral Resistance... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,010 2670
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3452 078 III.9.ii. Not Study Reporting New Antibiotic/Antiviral Resistance... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 72.42 35.21 8,673 2891
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3451 073 III.7.iv. Not Study Advocating Early Cessation of Drug Regimens... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 10,290 3430
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3451 073 III.7.iv. Not Study Advocating Early Cessation of Drug Regimens... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,622 2874
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3451 073 III.7.iv. Not Study Advocating Early Cessation of Drug Regimens... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 66.89 13.11 9,038 3013
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3450 075 III.8. Not Study With Underlying Agenda... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,771 3257
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3450 075 III.8. Not Study With Underlying Agenda... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,568 2856
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3450 075 III.8. Not Study With Underlying Agenda... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 77.35 32.06 9,061 3020
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3449 074.1 III.7. Not Content That Could Threaten Public Health... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 9,681 3227
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3449 074.1 III.7. Not Content That Could Threaten Public Health... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,330 3110
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3449 074.1 III.7. Not Content That Could Threaten Public Health... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 56.68 33.83 9,637 3212
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3448 074 III.7.v. Not Study Reporting Biohazards or Dual-Use Research... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,357 3119
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3448 074 III.7.v. Not Study Reporting Biohazards or Dual-Use Research... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,574 2858
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3448 074 III.7.v. Not Study Reporting Biohazards or Dual-Use Research... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 62.96 15.35 9,149 3050
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3447 072 III.7.iii. Not Study Challenging Known Toxicity/Carcinogenicity... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,372 3124
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3447 072 III.7.iii. Not Study Challenging Known Toxicity/Carcinogenicity... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,514 2838
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3447 072 III.7.iii. Not Study Challenging Known Toxicity/Carcinogenicity... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 85.69 57.65 8,849 2950
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3446 071 III.7.ii. Not Study Challenging Infectious Disease Transmission... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,423 3141
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3446 071 III.7.ii. Not Study Challenging Infectious Disease Transmission... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,283 2761
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3446 071 III.7.ii. Not Study Challenging Infectious Disease Transmission... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 48.83 19.35 8,931 2977
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3445 070 III.7.i. Not Study Challenging Vaccine Safety... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,381 3127
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3445 070 III.7.i. Not Study Challenging Vaccine Safety... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,154 2718
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3445 070 III.7.i. Not Study Challenging Vaccine Safety... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 45.69 15.68 8,672 2891
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3444 068 III.6. Behavioral Genetics with Tenuous Health Connections... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 9,033 3011
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3444 068 III.6. Behavioral Genetics with Tenuous Health Connections... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,130 2710
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3444 068 III.6. Behavioral Genetics with Tenuous Health Connections... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 69.95 9.84 8,588 2863
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3443 067 III.5. Not Based on Pseudoscience... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,964 2988
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3443 067 III.5. Not Based on Pseudoscience... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,286 2762
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3443 067 III.5. Not Based on Pseudoscience... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 46.24 17.62 9,751 3250
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3442 065 III.3. Not Self-Serving Research... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,888 3296
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3442 065 III.3. Not Self-Serving Research... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,214 2738
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3442 065 III.3. Not Self-Serving Research... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 50.07 17.81 9,094 3031
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3441 064 III.2. Not Study of Smoking or Vaping... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,018 3006
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3441 064 III.2. Not Study of Smoking or Vaping... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,995 2665
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3441 064 III.2. Not Study of Smoking or Vaping... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 55.92 31.47 8,507 2836
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3440 061.3 II.6.vi.C. References Are Formatted Consistently... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,397 2799
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3440 061.3 II.6.vi.C. References Are Formatted Consistently... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,641 2547
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3440 061.3 II.6.vi.C. References Are Formatted Consistently... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 47.73 19.22 9,235 3078
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3439 062 II.6.vi.D. References Are Not Present in Main Text... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 8,730 2910
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3439 062 II.6.vi.D. References Are Not Present in Main Text... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,190 2730
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3439 062 II.6.vi.D. References Are Not Present in Main Text... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 50.93 16.18 9,553 3184
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3438 066 III.4. Not A Political Paper... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,745 2915
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3438 066 III.4. Not A Political Paper... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,974 2658
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3438 066 III.4. Not A Political Paper... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 50.6 21.82 8,340 2780
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3437 063 III.1. No Identifiable Information (Photographs and Names)... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 9,222 3074
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3437 063 III.1. No Identifiable Information (Photographs and Names)... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,514 2838
logs-iy0k2xs44l4l32nmmiyy8p0a-MEDRXIV_2025_339481-2-2025-12-05T14-38-56-115Z.xlsx 3437 063 III.1. No Identifiable Information (Photographs and Names)... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 57.2 33.76 8,985 2995
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4304 ... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 4,158 1386
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4304 ... Llama-3.1-8B-Instruct 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 4,476 1492
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4304 ... Trinka GPT-OSS 20B 3 1 1 1 0 False PASS, UNCLEAR, FAIL 37.18 0.2 3,908 1303
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4303 ... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 4,707 1569
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4303 ... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 4,122 1374
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4303 ... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 65.36 29.01 4,783 1594
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4302 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,641 547
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4302 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,155 385
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4302 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,476 492
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4301 ... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,767 589
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4301 ... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,173 391
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4301 ... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 82.36 60.51 1,932 644
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4300 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 4,758 1586
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4300 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 3,843 1281
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4300 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 71.49 14.47 4,574 1525
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4299 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 1,662 554
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4299 ... Llama-3.1-8B-Instruct 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 1,266 422
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4299 ... Trinka GPT-OSS 20B 3 1 0 2 0 False PASS, UNCLEAR, UNCLEAR 66.14 48.03 2,240 747
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4298 ... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 5,367 1789
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4298 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 4,710 1570
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4298 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 73.12 56.7 5,375 1792
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4297 ... DeepSeek-R1-Distill-Llama-8B 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,506 502
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4297 ... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,041 347
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4297 ... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 54.87 31.68 1,628 543
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4296 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 4,005 1335
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4296 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 3,720 1240
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4296 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 82.45 72.45 4,402 1467
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4295 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 4,875 1625
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4295 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 5,340 1780
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4295 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 86.31 69.36 4,943 1648
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4294 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,077 2359
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4294 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,273 2091
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4294 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 82.71 54.01 6,719 2240
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4293 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,035 2345
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4293 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,330 2110
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4293 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 98.83 96.48 6,992 2331
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4292 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 4,902 1634
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4292 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 4,119 1373
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4292 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 92.36 83.29 4,471 1490
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4291 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,614 2538
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4291 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,231 2077
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4291 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 95.07 85.21 6,899 2300
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4290 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,446 2482
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4290 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,405 2135
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4290 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 56.73 33.62 7,214 2405
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4289 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 8,562 2854
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4289 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,260 2420
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4289 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 73.71 57.96 8,262 2754
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4288 ... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 4,845 1615
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4288 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 3,894 1298
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4288 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 54.2 26.57 5,320 1773
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4287 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,752 2584
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4287 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,050 2350
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4287 ... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 60.95 18.43 8,090 2697
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4286 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,870 2290
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4286 ... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 5,901 1967
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4286 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 62.24 43.31 6,817 2272
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4285 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,644 2548
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4285 ... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 6,138 2046
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4285 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 61.4 41.7 6,851 2284
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4284 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,936 2312
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4284 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 5,940 1980
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4284 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 91.11 77.6 6,378 2126
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4283 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,594 2198
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4283 ... Llama-3.1-8B-Instruct 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 6,093 2031
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4283 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 57.51 10.8 6,964 2321
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4282 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,014 2338
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4282 ... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 6,000 2000
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4282 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 65.21 42.97 6,933 2311
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4281 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 4,455 1485
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4281 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 3,786 1262
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4281 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 59.85 35.21 4,333 1444
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4280 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,849 2283
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4280 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 5,946 1982
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4280 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 70.63 42.18 6,782 2261
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4279 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,728 2576
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4279 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,014 2338
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4279 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 98.06 94.17 7,460 2487
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4278 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,599 2533
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4278 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,930 2310
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4278 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 87.49 76.54 7,451 2484
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4277 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,921 2307
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4277 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,066 2022
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4277 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 51.64 26.1 6,577 2192
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4276 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,425 2475
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4276 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,168 2056
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4276 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 63.82 45.69 6,751 2250
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4275 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,495 2165
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4275 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,018 2006
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4275 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 95.87 89.98 6,393 2131
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4274 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,434 2478
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4274 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,288 2096
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4274 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 70.23 38.77 6,899 2300
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4273 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,140 2380
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4273 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,180 2060
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4273 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 76.48 53.54 6,561 2187
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4272 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,227 2409
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4272 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,060 2020
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4272 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 91.35 75.2 6,522 2174
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4271 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,224 2408
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4271 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,279 2093
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4271 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 77.86 62.14 6,698 2233
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4270 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,335 2445
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4270 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,306 2102
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4270 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 46.0 15.1 7,273 2424
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4269 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,440 2480
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4269 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,111 2037
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4269 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 97.14 93.52 6,704 2235
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4268 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,017 2339
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4268 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,789 2263
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4268 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 68.96 10.9 7,169 2390
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4267 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,386 2462
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4267 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,237 2079
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4267 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 57.8 33.81 7,204 2401
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4266 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,263 2421
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4266 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,123 2041
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4266 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 60.4 22.38 6,866 2289
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4265 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,191 2397
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4265 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,312 2104
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4265 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 93.83 88.85 6,560 2187
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4264 ... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 7,095 2365
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4264 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,348 2116
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4264 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 79.56 65.86 6,626 2209
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4263 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,656 2552
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4263 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,297 2099
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4263 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 92.07 88.11 6,525 2175
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4262 ... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 5,091 1697
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4262 ... Llama-3.1-8B-Instruct 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 4,560 1520
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4262 ... Trinka GPT-OSS 20B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 4,848 1616
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4261 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,569 2523
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4261 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,375 2125
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4261 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 53.29 25.65 7,030 2343
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4260 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,449 2483
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4260 ... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 6,075 2025
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4260 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 67.45 19.34 6,740 2247
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4259 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,978 2326
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4259 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,102 2034
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4259 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 72.4 56.01 7,162 2387
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4258 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,827 2609
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4258 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,228 2076
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4258 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 70.67 40.65 7,026 2342
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4257 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,924 2308
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4257 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 6,171 2057
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4257 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 95.45 92.48 6,699 2233
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4256 ... DeepSeek-R1-Distill-Llama-8B 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,794 2598
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4256 ... Llama-3.1-8B-Instruct 3 3 0 0 0 True PASS, PASS, PASS 100.0 100.0 7,005 2335
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4256 ... Trinka GPT-OSS 20B 3 3 0 0 0 True PASS, PASS, PASS 66.67 39.07 7,640 2547
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4255 ... DeepSeek-R1-Distill-Llama-8B 3 0 0 3 0 False UNCLEAR, UNCLEAR, UNCLEAR 100.0 100.0 2,010 670
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4255 ... Llama-3.1-8B-Instruct 3 0 3 0 0 True FAIL, FAIL, FAIL 100.0 100.0 1,023 341
logs-j5mck6frm5avgqwrgac0ejrn-medRxiv_1979_000026-2-2025-12-04T10_24_14.870Z.xlsx 4255 ... Trinka GPT-OSS 20B 3 0 3 0 0 True FAIL, FAIL, FAIL 54.31 29.65 2,008 669

Data Files: