The one problem with AI content moderation? It doesn’t work

[ad_1]

No one was quite sure where the never-ending flurry of fires and Indonesian car crashes were coming from. But the system would keep flagging them, according to Josh Sklar.

The former content moderator once worked on the team at Instagram that assessed posts flagged by artificial intelligence (AI) as likely to be problematic. And while the system would regularly catch that illicit content, the number and the nature of false positives was confusing at best.

In the UK, the onus on social media platforms to moderate toxic content online is only set to become more intense with the passage of the Online Safety Bill. One clause in particular – calling on platforms to “prevent” users ever encountering dangerous content – has convinced many that the platforms will turn to greater automated moderation to try to solve the problem.

The only problem? It might not work.

Last year, the BBC tried to use an AI tool to measure the scale of toxicity faced by politicians online. It identified that some 3,000 “toxic” tweets are sent to MPs every day.

The issue was the AI defined “toxic” to mean anything “rude, disrespectful or unreasonable”, meaning plain descriptive words like “Tory” and “hypocrite” were often flagged. One Twitter user pointed out that the tool labelled anti-trans vitriol as less toxic than calling someone a “transphobe”.

In many ways, that struggle to define “toxic” is at the core of what is the problem for AI content moderation systems. That their aim – to moderate and “fix” the unspoken flaws and dangerous sentiments in the grey world of human interaction – is hard for a machine learning system to fully achieve.

“The AI would work well in proactively removing the worst stuff, as they do now with images of violence, for example,” says Eugenia Siapera, director of the centre for digital policy at University College Dublin. “But the harder decisions cannot be automated.”

Related Stories

Snaptik: The Ultimate Solution for Downloading TikTok Videos without Watermark

Mitsubishi Air Handler And Heat Pump | Advanced Features Explained

UK police have ‘culture of retention’ around biometric data

You may have missed

Shed Pounds Swiftly: Your Ultimate Guide on How to Lose Weight Fast

The Crucial Role of Cryptocurrency Exchange Consultants in Today’s Digital Economy

Weight Lifting Unveiled: A Beginner’s Guide to Building Muscle Safely

Healthy Habits for a Healthy Heart: Lifestyle Changes to Lower Cholesterol

About

Categories

Latest Posts

Automated moderation in practice

Human oversight

A fundamentally social problem