Clinical Research

The Ethics of Junk Science: It’s Time for a Reboot

Science is supposed to be self-correcting. That mechanism is broken.

Science is supposed to be self-correcting. That mechanism is broken.

The Retraction Watch database, now maintained in partnership with Crossref, contains more than 63,000 retracted scientific publications. Upwards of 10,000 papers were retracted in 2023 alone, a single-year record. Springer Nature retracted nearly 3,000 articles in 2024 from its portfolio of 482,000 published papers (more than 30 percent of which belong to the health sciences). And experts widely agree this represents only a fraction of what should be retracted, because the system that flags and removes flawed research is slow, opaque, and often compromised by the same institutional incentives that produced the bad work.

The celebrated epidemiologist John Ioannidis argued in his landmark 2005 essay that most published research findings are false, a consequence of small samples, flexible analyses, and a publishing culture that rewards positive results. A 2016 Nature survey found that more than 70 percent of researchers had tried and failed to reproduce another scientist’s experiments. An Amgen analysis of 53 landmark preclinical cancer studies found that only 11 percent could be replicated. The Open Science Collaboration’s 2015 replication project reproduced fewer than half of 100 published psychology studies. This is not a rounding error. It is a crisis of epistemology, and it carries a price: Irreproducible research wastes an estimated $28 billion in U.S. research funding every year.

Scientific misconduct is not rare, and it is not confined to a few bad actors. It is a systemic failure — one built into the architecture of how research is funded, published, and rewarded. Fabricated data enters the literature. Journals sit on retraction notifications for years. Institutions protect high-revenue fraudsters. Clinicians make decisions based on findings that were never real. And the machinery grinds on, largely unchecked, because the incentives that produce bad science are the same ones that make it so hard to correct.

The names most people know are the spectacular ones. 

  • Andrew Wakefield published a 1998 Lancet paper claiming a link between the MMR vaccine and autism, based on 12 children, undisclosed payments from personal injury lawyers, and data he manipulated to fit his conclusion. The paper was fully retracted in 2010, and Wakefield was struck from the UK medical register. Measles outbreaks followed in his wake for decades. 
  • Yoshitaka Fujii accumulated more than 180 retractions after statistical analysis revealed his anesthesia trial data was simply impossible. 
  • Paolo Macchiarini published favorable outcomes from synthetic trachea transplants in The Lancet while his patients died, eventually bringing down the vice-chancellor of the Karolinska Institute and causing several resignations from the Nobel Committee for Physiology or Medicine. 
  • Eric Poehlman became the first American academic sentenced to federal prison for research fraud, earning a lifetime ban from federal research funding along with his 12-month sentence.

These are the cases dramatic enough to make the news. They are not the problem. They are symptoms of it.

An Ethical Failure, Not Just a Methodological One

Let us be clear about what this is. When a researcher fabricates data, they are not making a statistical error; they are deceiving colleagues, misleading clinicians, misdirecting public funding, and, in medicine, potentially harming patients. When an institution protects a high-profile fraudster because he generates grant revenue, it is complicit. When a journal declines to retract a paper for years after being notified of problems, it is choosing its own reputational interests over the integrity of the scientific record.

The ethical framework that governs human subjects research, built on the Belmont principles of respect for persons, beneficence, and justice, has no meaningful parallel governing what gets published. There is no binding international standard for what constitutes research misconduct, no universal retraction threshold, no protected whistleblower pathway comparable to other regulated industries. The Office of Research Integrity can investigate federally funded misconduct, but its resources are thin and its findings typically arrive years after the damage is done.

The deeper structural driver is the publish-or-perish incentive: Publication volume drives hiring, tenure, promotion, and grant access. This rewards quantity and novelty while punishing null results, replications, and corrections. Researchers respond rationally with what are politely called Questionable Research Practices, including p-hacking (manipulating data analysis — such as trying multiple methods, selective reporting, or adjusting sample sizes), HARKing (Hypothesizing After Results are Known), and selective outcome reporting. These are not always conscious fraud. They are the predictable output of a badly designed system. Predatory journals and paper mills have made it worse, offering paid authorship in fabricated studies that have flooded indexed databases with noise. Content that, now indexed and freely searchable, reaches an audience the print era never imagined, and invites scrutiny at a scale it cannot escape.

The Reboot

The framework for a better system already exists. Pre-registration, promoted by the Center for Open Science, requires researchers to publicly specify their hypothesis and analysis plan before collecting data, eliminating most p-hacking and HARKing structurally. Registered Reports commit journals to publish based on methodological rigor rather than results, directly attacking publication bias at the source. NIH data sharing mandates allow independent verification. The EQUATOR Network and CONSORT guidelines give journals the tools to reject work that fails basic transparency standards.

But the cultural change is the harder problem. As a 2025 analysis of academic publishing incentives argues, rather than measuring impact by citation count, we should count independent replications. That is not a proxy for quality. It is quality, directly measured. A career built on fifty papers that replicate is worth more to medicine than one built on three hundred that do not. A researcher who corrects the record, publishes a rigorous null finding, or invests in replication is doing work the current system treats as professionally suicidal. That has to change.

Volume is the wrong metric. Novelty is the wrong metric. The right test is whether the work stands up, over time, under independent scrutiny, with the same result. That is what science is supposed to be. Until we build an incentive architecture that rewards durability over productivity, the machine will keep producing exactly what it was designed to produce: too many papers, not enough truth.

About the author

  • Stephen P. Wood 

    Stephen P. Wood, DMSc, ACNP-BC, FAWM, FNAP is a Clinical Associate Professor and Program Director of the Adult-Gerontology Acute Care Nurse Practitioner Program at Northeastern University’s Bouvé College of Health Sciences. He is a former Visiting Researcher at Harvard Medical School’s Center for Bioethics and Harvard Law School’s Petrie-Flom Center.