I use Readlang for several languages, and for most of them, the AI translations are on point 99% of the time. The exception is Māori, which is unfortunately one of those very low-resource languages. I’ve worked around this by adding a custom dictionary.
The reason I’m posting this as a bug is that the translations are not just wrong, but often utterly baffling: in some cases, they’re just random words that don’t fit in context at all.
I am vaguely aware this could be what AI boffins call “collocation pressure” or somesuch?
A couple of examples:
“Kua kite ia i ngā kīngi me ngā pīnono e hīkoi ana i ngā one o te koraha”.
This sentence reads: “He had seen kings and beggars walking in the sands of the desert”.
But for some reason, Readlang interprets ‘pīnono’ as dancers, which (according to every dictionary I could find) is not a possible interpretation of that word at all.
Ditto for:
“Engari kei te kimi māua tahi i ō māua ara tātai, nā konā e kauanuanu atu nei au ki a ia”
“But we are both seeking our destiny / true path, and for that reason I respect him”, but Readlang renders ‘kauanuanu’ as “comforted”.
Out of curiosity, I fed both sentences directly into both ChatGPT and Gemini and asked them for translations. ChatGPT botched the first one, although instead of “dancers”, it misread pīnono as “nobles” (probably because of “kings”), but got the second one mostly right (i.e., it correctly translated kauanuanu as “respect”).
Gemini got both right, which was a pleasant surprise.