How to detect AI bias?
"Statistically significant" ≠ practically meaningful. Effect size matters - and template choice dramatically changes what #gender bias you detect.
Our fix: #mixedeffects + #perplexity weighting for robust detection.
arxiv.org/abs/2502.15600
#AI #NLP