AI is making journalistic language more repetitive and predictable – and it’s a problem for all of us

AI is making journalistic language more repetitive and predictable – and it’s a problem for all of us

AI is making journalistic language more repetitive and predictable – and it’s a problem for all of us

https://theconversation.com/ai-is-making-journalistic-language-more-repetitive-and-predictable-and-its-a-problem-for-all-of-us-283642

Publish Date: 2026-05-27 04:25:00

Source Domain: theconversation.com

What happens to language when a growing amount of text published in the press, online and on social media is written by machines? This question is not just important for the profession of journalism – it also has an impact on the richness of the language we all use to comprehend, describe and discuss reality itself.

Historically, the press has been a space where public language grows and becomes richer. It is not, of course, the only driver of linguistic change, but it is one of the fields where new or emerging words, turns of phrase and ways of describing facts begin to circulate within society.

Studies on journalistic language and neologisms clearly demonstrate that newspapers are platforms for the creation and dissemination of new vocabulary, especially when it is needed to report on events, technology and social changes for a broad audience.

However, if a significant amount of journalistic writing is delegated to generative AI, this role will diminish. Large language models (LLMs) generally work by predicting the next “token” or word in a sequence. This allows them to produce fluent and believable text, but it also gives them a tendency to prioritise statistical regularity, as well as common, established arguments and formulations.

In and of itself, this does not degrade language. The problem arises when this logic comes to dominate writing in the public sphere.

The AI feedback loop

The risks become serious when AI systems begin training themselves on texts already produced by AI. This leads to what a number of studies call “model collapse”, a degenerative process whereby material produced by one model contaminates the training data of later generations.

In plain terms, this means that AI systems learn more and more from synthetic text. If these texts fill public spaces – both online and offline – the verbal ecosystem for future training will be much more constricted.

A greater volume of artificial text means less contact with…

Source