AI GenerallyMonday, 18 May 2026 · 3 min read

arXiv Will Issue One-Year Bans for Papers That Show Clear Evidence of Unchecked AI Output

arXiv announced a one-year ban for researchers who submit papers containing incontrovertible evidence of unreviewed AI-generated content, targeting hallucinated citations, leftover chatbot prompts, and unfilled placeholder data rather than AI use itself.

Screenshot of the arXiv preprint repository website interface — ↳ Source: thenextweb.com

arXiv, the preprint server that distributes roughly 15,000 new research papers each month across physics, mathematics, computer science, and related fields, announced on May 15, 2026, that authors who submit papers bearing incontrovertible evidence of unchecked AI generation will receive a one-year suspension from the platform — and will be required to publish at a peer-reviewed venue before they can submit again after the ban expires.

What Triggers the Penalty

Thomas Dietterich, who chairs arXiv's computer science section and announced the policy, described three categories of evidence that would meet the bar for action. The first is hallucinated references — citations to papers that do not exist, which LLMs manufacture with confident bibliographic detail when asked to fill out a reference list without being given real sources. The second is meta-commentary left in the submitted text: chatbot-style asides such as "here is a 200-word summary; would you like me to make any changes?" that indicate the author did not read the output before uploading it. The third is placeholder instructions that were never resolved, such as "fill in with the real numbers from your experiments."

Dietterich's formulation of the standard is worth noting precisely. The policy does not prohibit using AI tools for drafting, literature synthesis, editing, or analysis. It targets a specific failure mode: "if a submission contains incontrovertible evidence that the authors did not check the results of LLM generation, this means we can't trust anything in the paper." The violation is not AI use but the abandonment of authorial responsibility.

Enforcement follows a two-step process: a moderator must flag the submission, and a section chair must confirm the evidence before action is taken. Authors retain a right of appeal.

The Scale of the Problem

The policy arrives as empirical data on AI-driven citation fraud has become hard to ignore. A Columbia University audit of approximately 2.5 million biomedical preprints found that fabricated citations rose twelvefold between 2023 and early 2026. The rate in 2025 was roughly one paper in every 458 containing fake references; in the first seven weeks of 2026 that rate had climbed to one in 277.

These numbers matter because arXiv is not a peer-reviewed journal — it operates on trust that authors will not upload work they have not read. The endorsement system that governs first-time submissions provides some friction, but it was designed for a different era of academic misconduct. A researcher who can get an endorser to vouch for their institutional affiliation faces no further gate if they then submit AI-generated text to which they paid no further attention.

The volume problem is compounding the quality problem. arXiv became an independent nonprofit in 2025 after more than two decades hosted by Cornell, and it now handles submission loads that make manual review of every paper impossible. Automated plagiarism and citation-verification tools can flag statistical anomalies, but distinguishing AI-assisted good-faith work from AI-generated negligence is a judgment call that still requires human moderators.

What Changes, and What Does Not

The practical effect of the policy will depend heavily on enforcement consistency across arXiv's different subject sections, which have historically applied rules with varying degrees of rigour. The computer science section — where AI-generated submissions are presumably most prevalent — is chaired by the person who announced the policy, which may correlate with higher initial enforcement rates there.

For the broader research community, the policy articulates a standard that many institutions and journals have been reluctant to state clearly: AI use in research is not per se disqualifying, but failing to verify what you submit is. That framing sidesteps the polarised debate about whether AI should be allowed in academic writing at all, and instead treats the issue as one of professional responsibility — the same accountability that applies to researchers who use statistical software or laboratory automation.

Several preprint communities and journals have been watching arXiv's approach as a potential template. The server's scale — it receives more submissions in a month than many journals receive in a year — means its enforcement choices effectively set a norm for rapid scientific communication, regardless of whether those choices are formally adopted elsewhere.

The policy does not address the harder question of how to handle papers where AI assistance produced subtler errors — plausible-sounding but incorrect experimental descriptions, or overstated claims generated by summarisation models — that do not leave obvious fingerprints. That problem is likely to grow more acute as AI writing tools become less detectable, and it will require a different kind of institutional response.

#arxiv#ai-research#academic-integrity#llm-hallucination#science#policy

arXiv Will Issue One-Year Bans for Papers That Show Clear Evidence of Unchecked AI Output

What Triggers the Penalty

The Scale of the Problem

What Changes, and What Does Not

Sources

More from AI Generally