AIAAIC - Google hate detection AI mistakes bullying for civility

Google hate detection AI mistakes bullying for civility

Occurred: February 2017

Report incident 🔥 | Improve page 💁 | Access database 🔢

Google's AI-powered anti-bullying tool faced criticism for misclassifying certain types of online interactions, prompting concerns about its accuracy and effectiveness.

Developed by Jigsaw, Perspective uses machine learning to assess the "toxicity" of online comments, categorising them from "very toxic" to "very healthy."

However, commentators and researchers pointed out that the AI's training data skews its understanding, leading it to overlook harmful phrases while deeming innocuous comments as toxic. For instance, phrases that express overtly discriminatory views can be rated as only slightly toxic, while straightforward profanity receives a much higher toxicity score.

This discrepancy was seen to highlight a significant flaw in the tool's design, reflecting the biases of its creators and the cultural push for civility that can inadvertently sanitise harmful discourse. Critics argue that this approach to moderation may perpetuate existing biases and fail to address the complexities of online communication, where politeness can mask harmful sentiments.

It also led to concerns that the tool could be used to silence marginalised voices and promote a culture of civility that prioritises politeness over justice.

System 🤖

Perspective API

Operator:
Developer: Google/Jigsaw
Country: Global
Sector: Media/entertainment/sports/arts
Purpose: Detect hate speech
Technology: Machine learning
Issue: Accuracy/reliability; Safety

News, commentary, analysis 🗞️

https://www.vice.com/en/article/qvvv3p/googles-anti-bullying-ai-mistakes-civility-for-decency
https://www.engadget.com/2017-09-01-google-perspective-comment-ranking-system.html
https://qz.com/918640/alphabets-hate-fighting-ai-doesnt-understand-hate-yet/
https://www.forbes.com/sites/kalevleetaru/2017/02/23/fighting-words-not-ideas-googles-new-ai-powered-toxic-speech-filter-is-the-right-approach/

Related 🌐

Page info
Type: Issue
Published: August 2024

Page updated

Google Sites

Report abuse