ArXiva widely used open repository for preprint research, is doing more to end the careless use of large language models in scientific papers.
Although articles are published on the site before being peer-reviewed, arXiv (pronounced “archive”) has become one of the main ways that research in fields such as computer science and mathematics circulates, and the site itself has become a source of data on trends in scientific research.
ArXiv has already taken steps to combat a growing number of low-quality AI-generated papers, for example by requiring first-time publishers to get endorsement from an established author. And after being housed at Cornell for more than 20 years, the organization is becoming an independent nonprofit, which should allow it raise more money to address problems like the decline of AI.
In his latest move, Thomas Dietterich, president of the arXiv computer science section, aware Thursday that “if a submission contains incontrovertible evidence that the authors did not verify the results of the LLM generation, this means that we cannot trust anything in the article.”
That incontrovertible evidence could include things like “delusional references” and comments to or from the LLM, Dietterich said. If such evidence is found, the paper’s authors will face “a one-year ban from arXiv followed by the requirement that subsequent arXiv submissions must first be accepted by an accredited, peer-reviewed venue.”
Note that this is not an outright ban on the use of LLM, but rather an insistence that, as Dietterich put it, authors take “full responsibility” for the content, “regardless of how the contents are generated.” So if researchers copy and paste “inappropriate language, plagiarized content, biased content, errors, incorrect references or misleading content” directly from an LLM, then they are still responsible for it.
Dietrich he told 404 Media This will be a “one-hit” rule, but moderators must point out the problem and section presidents must confirm the evidence before imposing the sanction. The authors may also appeal the decision.
Recent peer-reviewed research has found that Made-up quotes are on the rise. in biomedical research, probably due to LLMs; Although to be fair, scientists aren’t the only ones caught using quotes created by AI.
When you purchase through links in our articles, we may earn a small commission. This does not affect our editorial independence.





