New study of Mainz University finds that language models misinterpret words in regional varieties
19 May 2026
How well do language models understand Meenzerisch, the dialect spoken in the German city of Mainz? A research team led by Johannes Gutenberg University Mainz (JGU) has now investigated this question for the first time. Meenzerisch still shapes regional language culture today and is known across Germany through the satirical carnival speeches in the famous Mainz Fastnacht tradition. The study's findings, recently presented at the 2026 Language Resources and Evaluation Conference (LREC) in Palma de Mallorca, show that current artificial intelligence (AI) models have great difficulty understanding the dialect correctly.
"Language varieties such as Meenzerisch are an important part of cultural identity, but are disappearing from everyday use," said Minh Duc Bui of the JGU Institute of Computer Science, who led the study together with Professor Katharina von der Wense. "Regional dialects have so far received very little attention in digital language research. Yet language technologies could help make them more visible and preserve them in the long run."
Making Meenzerisch machine-readable
The team, which also included a researcher from Marburg University, first created a new dataset for Meenzerisch. It was based on a dictionary published in 1966, which the researchers digitized. The result was a machine-readable lexicon of 2,351 dialect words and their definitions in standard German. "Until now, resources of exactly this kind were missing for Meenzerisch," stated Professor Katharina von der Wense, head of the Natural Language Processing group at JGU.
Using the digital lexicon, the researchers were now able to systematically examine how well large language models handle the dialect. They tested several open-source language models of different sizes. The models were asked first to explain the meanings of Meenzerisch words and then to generate the correct dialect words from definitions written in standard German.
"Our results are quite clear," said Bui. "The models we tested fail both at understanding and at producing the dialect." When asked to generate word definitions, the models achieved an average accuracy of only 4.24 percent. Even the strongest models in the study scored very poorly. In the reverse task of generating a dialect word from a definition, accuracy dropped to just 0.56 percent. Additional support, such as prompt examples or automatically derived rules, did little to improve performance. In every case, accuracy remained below 10 percent.
Smaller language varieties risk becoming digitally invisible
"The results show very clearly that today's language models barely understand Meenzerisch," said co-author Professor Peter Herbert Kann of Marburg University, who also speaks the Mainz dialect. "That is interesting from a technical perspective, but it also shows how quickly smaller language varieties can become invisible in digital applications." According to the researchers, one reason may be that dialects are primarily spoken rather than written, leaving very little text data available for analysis.
"In the long term, we need models that can process not only standard languages, but also regional and culturally significant varieties," emphasized Bui. Language technologies could help document dialects digitally and make them more accessible. The current study, funded by the Carl Zeiss Foundation as part of the interdisciplinary JGU research project "Trading Off Non-Functional Properties of Machine Learning" (TOPML), is a first step in that direction. "Going forward, however, targeted datasets and new training approaches will be needed to support linguistic and cultural diversity in the digital sphere."