Researchers from the University of Cornell discovered that artificial intelligence systems designed to identify offensive “hate speech” flag comments purportedly made by minorities “at substantially higher rates” than remarks made by whites.
Several universities maintain artificial intelligence systems designed to monitor social media websites and report users who post “hate speech.” In a study published in May, researchers at Cornell discovered that systems “flag” tweets that likely come from black social media users more often, according to Campus Reform.
The study’s authors found that, according to the AI systems’ definition of abusive speech, “tweets written in African-American English are abusive at substantially higher rates.”
The study also revealed that “black-aligned tweets” are “sexist at almost twice the rate of white-aligned tweets.”
The research team averred that the unexpected findings could be explained by “systematic racial bias” displayed by the human beings who assisted in spotting offensive content.
Where Trump’s Cabinet nominees stand in Senate confirmation process
California’s Obsession with EVs Is Turning Neighborhoods Into Minefields
Insane Close-Up Video of Philadelphia Crash Appears to Catch ‘Allahu Akbar’ at Moment of Impact
Pennsylvania gov rebuffs PETA’s demands on Punxsutawney Phil: ‘Come and take it’
DC plane crash: ATC staffing levels under scrutiny as barges arrive to help salvage ops
California city’s massive $130M deficit threatens dangerous cuts to its firefighting capacity
Republicans embrace Trump education reforms as Democrats prepare to resist them
Where the most consequential Senate races of 2026 are headed
Sen. Tillis opens up about role in Pete Hegseth’s confirmation after Hegseth’s ex-sister-in-law’s allegations
Palisades, Eaton fires in Southern California 100% contained, officials say
‘Important opportunity’: DNC chair candidates reveal how they will rebound after disastrous 2024 results
Fugitive on FBI’s 10 Most Wanted List for killing his bride in Illinois captured in Mexico
Company operating plane in Philly crash had previous fatal incident in Mexico: reports
DOJ directs FBI to fire 8 top officials, identify employees involved in Jan. 6, Hamas cases for review
Chuck Todd Abruptly Exits NBC After 18 Years
“The results show evidence of systematic racial bias in all datasets, as classifiers trained on them tend to predict that tweets written in African-American English are abusive at substantially higher rates,” reads the study’s abstract. “If these abusive language detection systems are used in the field they will, therefore, have a disproportionate negative impact on African-American social media users.”
One of the study’s authors said that “internal biases” may be to blame for why “we may see language written in what linguists consider African American English and be more likely to think that it’s something that is offensive.”
Automated technology for identifying hate speech is not new, nor are universities the only parties developing it. Two years ago, Google unveiled its own system called “Perspective,” designed to rate phrases and sentences based on how “toxic” they might be.
Shortly after the release of Perspective, YouTube user Tormental made a video of the program at work, alleging inconsistencies in implementation.
According to Tormental, the system rated prejudicial comments against minorities as more “toxic” than equivalent statements against white people.
Google’s system showed a similar discrepancy for bigoted comments directed at women versus men.
Story cited here.