Key findings:
45% of all AI answers had at least one significant issue.
31% of responses showed serious sourcing problems – missing, misleading, or incorrect attributions.
20% contained major accuracy issues, including hallucinated details and outdated information.
Gemini performed worst with significant issues in 76% of responses, more than double the other assistants, largely due to its poor sourcing performance.
Comparison between the BBC’s results earlier this year and this study show some improvements but still high levels of errors.
#ContraAI 🤖 we know this already, but here’s the same warning substantiated in a new study by BBC and the EBU with more data: do not rely on #AI for anything, including the news. www.bbc.co.uk/mediacentre/...