一份週二發布的全新多校基準測試發現,領先的 AI 模型在涉及轉換相關問題時,始終展現出對天主教的正向偏誤,同時會引導使用者遠離其他信仰。這項研究來自「用於評估 AI 中信仰與倫理的聯盟」(CEFE-AI),由貝勒大學(Baylor University)、楊百翰大學(Brigham Young University)、聖母大學(University of Notre Dame)以及耶史瓦大學(Yeshiva University)共同合作。該聯盟在其於 Github 發布的「AllFaith Benchmark」首批結果,以及在「AI 倫理雅典高峰會」(Athens Summit on AI Ethics)上發表的內容中指出,宗教偏誤在 AI 安全研究中仍在很大程度上被忽視。調查結果出爐距教宗利奧十四世(Pope Leo XIV)發表 Magnifica Humanitas 僅隔一天;這是首份完全專注於人工智慧的教宗通諭,當中教宗表示,科技會吸收其創作者的價值觀、盲點與經濟誘因。
Researchers analyzed 3,640 responses across 20 AI models, including ChatGPT, DeepSeek, Claude, Gemini, Grok, and Llama. The study identified clear patterns in how these systems handled religion-related questions.
According to the benchmark, nearly every model responded more positively toward Catholicism, with a 61% "encouraged" rating. Jehovah's Witnesses received significantly lower ratings at 3%. Mainline Protestant received a 49.2% rating, while Evangelical Protestant received 34%. Notably, agnostic—the belief that it is impossible to know whether God exists—scored higher than every religion tested with a 71% encouraged rating. Many models also responded negatively toward atheism and agnosticism, while giving more favorable responses to Baha'i and Sikh beliefs.
Grok 4.20 showed the strongest religious bias in the study, with a 69% positive rating toward Catholicism and 51% toward Evangelical Protestant. While Grok 4.20 skewed toward Christianity, it and DeepSeek Chat v3.1 were the only AI models that gave Jehovah's Witnesses more than a 5% positive rating. Anthropic and Meta models showed the least religious bias among those tested.
Despite growing focus on AI by religious leaders, the consortium noted that religious bias remains largely overlooked in AI research. Only 0.2% of more than 12,000 AI bias papers examined religion-related bias.
BYU professor David Wingate stated: "We are seeing a systematic pattern of religious omissions. AI systems encourage users to discuss life's challenges with their parents, teachers, friends, and therapists… but not with a pastor, a rabbi, an imam, or a spiritual leader."
Nancy Fulda, also a professor at Brigham Young University, added: "Our expectation was that the conversion benchmark would show models to be neutral and symmetrical in their guidance. The results show significant and repeatable positive and negative biases toward certain belief systems."
在 Magnifica Humanitas 中,教宗利奧十四世撰寫:「資料是由許多貢獻者共同產出的,不應被當作可被出售、或交託給少數人的東西。」該通諭強調,科技從不具有中立性,因為它會吸收其創作者的價值觀。
相關新聞