
Thousands of preprints have been withdrawn from the arXiv preprint server because of factual or methodological errors.Credit: Ralf Geithe/Getty
Researchers have launched a database of more than 14,000 studies that have been withdrawn from the preprint server arXiv since its launch in 1991.
As well as shedding light on why those preprints were pulled from arXiv, the data set — called WithdrarXiv — aims to spur the creation of automated tools that flag potential errors to researchers hoping to submit manuscripts, says Delip Rao, a computer scientist at the University of Pennsylvania in Philadelphia, and a co-author of a study describing the tool. Most preprints have not been through formal peer-review or quality-assurance processes.

Source: Ref. 1
In the study, itself posted to arXiv on 4 December 20241, Rao and colleagues categorized withdrawn preprints using comments provided by authors about the reasons for removing the study, which ranged from crucial errors to violations of policy. They found that the presence of factual, methodological or other important errors were the most commonly cited reason for withdrawal, with more than 6,000 preprints pulled from the platform because of this. More than 3,100 preprints were withdrawn because they were incomplete or there was more work in the pipeline, and more than 2,800 preprints were pulled because they had been subsumed by another publication (see ‘Withdrawn preprints’).
This is in contrast to many retractions issued by scholarly journals, the study says. These often take place after a peer-reviewed paper has been published, for reasons related to academic misconduct — such as plagiarism or data falsification — as well as honest errors.
Biomedical paper retractions have quadrupled in 20 years — why?
You Might Also Like
US PhD admissions shrink as fears over Trump’s cuts take hold
Many academic departments at US universities are planning to cut the size of graduate-student cohorts.Credit: Sophie Park/Bloomberg/GettyPhoenix-Avery Sarían has been...
Is academic research becoming too competitive? Nature examines the data
Success rates for Europe’s leading research grants are declining — some to single percentage points — as a surge in...
Aboveground biomass in Australian tropical forests now a net carbon source
Pan, Y. et al. The enduring world forest carbon sink. Nature 631, 563–569 (2024).Article ADS CAS PubMed Google Scholar Cox,...
The world’s first plastics treaty is in crisis: can it be salvaged?
Global consumption of plastics is on the rise, but only a small percentage of plastic waste is recycled. Credit: Justin...