New arXiv policy: 1-year ban for hallucinated references
Scope of the New arXiv Policy
- Policy (as reported in the thread):
- One-year ban for submissions with AI-hallucinated or obviously fake references.
- Afterward, future submissions must first be accepted at a “reputable peer‑reviewed venue.”
- Authors are held fully responsible for all content, regardless of tools used.
- Some note this is not yet clearly documented on arXiv’s public policies; may be planned or evolving.
Arguments Strongly Supporting the Policy
- Fake or non-existent references are framed as:
- Equivalent to fraud or at least gross negligence, not a minor error.
- A basic failure of scholarly standards: verifying that every cited work exists and is relevant is “table stakes.”
- A signal that the rest of the paper (data, analysis, conclusions) may also be unreliable.
- References are seen as core to scientific work, not cosmetic; sloppiness wastes readers’ and reviewers’ time.
- A ban is viewed as:
- A necessary deterrent in an era where LLMs make slop easy to produce.
- A way to protect the scientific record and increase arXiv’s value.
- Many argue: if AI is used correctly and outputs are checked, this policy imposes no cost on honest researchers.
Criticism and Concerns
- Some see the penalty as excessive, especially the ongoing requirement for prior peer‑reviewed acceptance, which is interpreted by some as a de facto lifetime constraint.
- Concern that:
- arXiv is meant for preprints and early dissemination; tying it to traditional peer review undercuts its purpose.
- Peer review is much harder to clear than arXiv’s bar, potentially locking out less-established researchers.
- Debate over whether one hallucinated citation proves “fraud” vs. mere carelessness, especially in multi‑author or last‑minute-edit scenarios.
Enforcement and Practical Issues
- arXiv cannot comprehensively check references; enforcement likely relies on:
- Automated tools (DOI checks, citation matchers).
- Reader reports and spot checks.
- Some propose LLMs and specialized tools for citation verification, though others insist database/HTTP checks are needed due to high stakes.
Broader Themes
- Strong pushback against “AI slop” in academia; many want higher standards, not AI bans.
- Divides appear between those enthusiastic about automated research and those worried about erosion of rigor and reviewer burden.