The Biomass Problem: How AI Is Eating Itself
Site Owner
Published on 2026-05-27
As AI generates over 500 billion images and trillions of words annually, a quiet crisis is emerging: model collapse. Trained on their own outputs across multiple generations, AI systems risk becoming progressively narrower and less capable at representing the diversity of real-world data. This article explores the biomass problem — why it happens, why benchmarks miss it, and why the math is relentless.

The Biomass Problem: How AI Is Eating Itself
In 2026, AI systems generated over 500 billion images and tens of trillions of words. By 2027, estimates suggest humans will produce less than 5% of the data that AI models train on. These numbers sound like industry trivia. They're actually a slow-motion catastrophe buried in plain sight.
Welcome to model collapse — the feedback loop that could make future AI systems progressively worse even as they appear to improve.
A Brief History of Eating Your Own Tail
The concept isn't new. In 2024, researchers from Oxford and Cambridge published a seminal paper showing what happens when a model repeatedly trains on data it or similar models generated. The result wasn't dramatic — no error messages, no obvious failures. Instead, the model slowly lost the ability to represent the tails of real distributions. Rare events — the unusual bird species, the unconventional sentence structure, the edge case in medical imaging — gradually disappeared from the model's world.
The model didn't get dumber in any way that standard benchmarks would catch. It got narrower. More homogeneous. Less capable of handling anything that didn't look like the average of what it had already seen.
This is the biomass problem. AI is digesting its own outputs and calling it nutrition.
Why Nobody Noticed
Model collapse is hard to see from inside the lab. Here's why:
Benchmarks reward average performance. MMLU, HumanEval, MATH — these tests measure how well models handle common, well-represented tasks. A model suffering from mild collapse can score higher on these benchmarks than a healthier competitor simply because it's more "confident" about mainstream patterns. Confidence and correctness are not the same thing.
Synthetic data works — until it doesn't. For many practical applications, training on AI-generated data is genuinely fine. Code completion, email drafting, standard documentation — these domains are saturated with examples. Adding more of the same quality doesn't hurt much. The problem emerges at the edges: rare diseases in medical data, unusual syntax in low-resource languages, edge cases in legal reasoning. These are exactly the areas where data scarcity already exists, and where synthetic data is most likely to be used as a band-aid.