Insights AI News How AI preprint quality score spots great science early
post

AI News

01 Jul 2026

Read 8 min

How AI preprint quality score spots great science early

AI preprint quality score helps labs and committees spot rigorous, original preprints for fast review.

An AI preprint quality score can flag strong studies before journals do. QED, a review assistant from Oded Rechavi’s team, rates originality and validity in preprints while hiding author and institution. In tests with thousands of papers, expert judges agreed with the scores about 75 percent of the time. Scientists are drowning in new papers. Peer review can take more than a year, and many teams need early signals to decide what to read or fund. QED offers one such signal. It reads a preprint, checks the claims, and gives a simple score for novelty and soundness. The goal is to highlight solid work faster and with less bias.

Why an AI preprint quality score matters

The bottleneck in science

Peer review is slow. Some biomedical papers take up to 18 months to publish. Reviewers spend late nights checking data. With so much output, most people default to journal rank as a shortcut, even though it can reflect prestige more than quality.

Reducing noise and bias

QED was built to help at the preprint stage, when journal labels and citation counts do not exist. It reviews text and figures, then scores validity and originality. It hides author names, institutions, and countries, which can lower location and gender bias that creep into human review.

How the AI preprint quality score works (QED)

Blind to prestige

The system ignores who wrote the paper and where they work. It looks only at what the manuscript claims and how well the evidence supports those claims.

What it measures

– Validity: Are the methods and controls appropriate? – Originality: Is the idea new or a small step? – Gaps: What is missing or unclear?

Scale and findings

– QED scored 57,455 bioRxiv preprints from May 2025 to April 2026. – It highlighted the top one percent across life science fields. – Country patterns were mixed: the US posted the most preprints, while Austria stood out for a high share of top-scoring work.

Does it match expert judgment?

Head-to-head with journals and reviewers

Rechavi’s team compared QED Scores for about 5,000 preprints, nearly 3,000 of which later appeared in journals. Scores showed a positive link with the eventual journal rank, and in pairwise tests where score and journal status disagreed, independent experts chose the higher-scored paper about 75 percent of the time. In other words, when journals and the score sent mixed signals, experts tended to side with the score.

Where it can miss

Sometimes a low-scored paper lands in a high-ranking journal, or the reverse. Reviewers also noted that QED can suggest extra experiments that are not realistic on tight timelines. These cases show why human judgment still matters.

Benefits, limits, and good use

What it helps with

– Fast triage of crowded reading lists – Early feedback for authors before submission – More level playing field by hiding prestige signals – A common, transparent starting point for discussion

What to watch for

– One number cannot capture all value in science – Do not use a single score to judge careers or funding alone – Keep disclosure rules for AI use in peer review – Apply practical judgment to any suggested experiments

Putting QED to work in your lab

Simple workflows

– Scan your weekly preprint feed. Sort by score, then read the top few in full. – Before submission, run your preprint to spot weak claims or missing controls. – In journal clubs, compare the score with your team’s ratings and discuss gaps. – For grant panels, use scores as a first pass, not as a final decision.

What this could mean for global science

Speed, access, and fairness

A fast, blinded review layer can help more people find strong work early, not just those close to elite labs. It may also surface high-quality studies from smaller countries or lesser-known institutions. But it should sit next to, not on top of, careful human reading.

The bottom line on early signals

An AI preprint quality score is a practical proxy when time is short and signal is scarce. QED shows that AI can align with experts much of the time and can curb bias by ignoring prestige cues. Use it to guide attention and improve drafts, then let human judgment finish the job.

(Source: https://www.the-scientist.com/can-ai-tools-spot-great-science-before-reviewers-do-74677)

For more news: Click Here

FAQ

Q: What is an AI preprint quality score and how does QED produce it? A: The AI preprint quality score evaluates the originality and validity of preprint manuscripts by checking claims and evidence while hiding author and institutional information. QED, developed by Oded Rechavi’s team, reads preprints, assesses methods and novelty, and derives a blinded score for each manuscript. Q: How many preprints did QED score and what did it identify? A: Rechavi’s team scored 57,455 bioRxiv preprints submitted between May 2025 and April 2026 and highlighted the top one percent of work across life science fields. Country-level patterns were mixed, with the US submitting the most preprints while Austria had a notably high share of top-scoring papers. Q: How well does the AI preprint quality score align with journal rankings and expert reviewers? A: QED Scores showed a positive correlation with eventual journal rank and in pairwise tests where score and journal disagreed, independent experts sided with higher-scored papers about 75 percent of the time. This suggests the AI preprint quality score often agrees with formal peer review but is not perfect. Q: Can the AI preprint quality score replace traditional peer review? A: No, QED and its AI preprint quality score are intended to complement, not replace, peer review and to provide faster, blinded signals about a manuscript’s novelty and soundness. Human reviewers remain necessary because the score can miss context, suggest impractical experiments, and cannot capture all dimensions of scientific value. Q: What are the main limitations of relying on an AI preprint quality score? A: One-number metrics cannot capture all aspects of scientific value, and the QED tool can sometimes recommend experiments that are unrealistic on tight timelines. Stakeholders should avoid using the score alone for hiring, promotion, or funding decisions and must retain human oversight. Q: How can researchers and labs use the AI preprint quality score in everyday workflows? A: Labs can use the AI preprint quality score to triage reading lists, run preprints for feedback before submission, and focus journal clubs on high-scoring papers as a first pass. The score is useful for quickly highlighting promising work but should be followed by full human read-throughs and discussion. Q: Does the AI preprint quality score help reduce bias in manuscript evaluation? A: Yes, the system hides author names, institutions, and countries, which helps lower location and gender bias that can affect human reviewers. This blinded approach can create a more level playing field for early signals about quality, though it does not eliminate the need for diverse human assessment. Q: What future improvements are planned for QED and the AI preprint quality score? A: Rechavi and his team plan to improve the AI evaluations and extend scoring and reviews to fields beyond the life sciences to make the system more comprehensive. They view AI scores as easier to iterate on and hope to refine the tool while maintaining it as a complement to human peer review.

Contents