Xbox moderation team turns to AI for help filtering a flood of user content

Real Hacker StaffNovember 7, 2023

4 3 minutes read

Artist interpretation of the creatures talking about your mom on Xbox Live last night.

Aurich Lawson / Thinkstock

Anyone who’s worked in community moderation knows that finding and removing bad content becomes exponentially tougher as a communications platform reaches into the millions of daily users. To help with that problem, Microsoft says it’s turning to AI tools to help “accelerate” its Xbox moderation efforts, letting these systems automatically flag content for human review without needing a player report.

Microsoft’s latest Xbox transparency report—the company’s third public look at enforcement of its community standards enforcement—is the first to include a section on “advancing content moderation and platform safety with AI.” And that report specifically calls out two tools that the company says “enable us to achieve greater scale, elevate the capabilities of our human moderators, and reduce exposure to sensitive content.”

Microsoft says many of its Xbox safety systems are now powered by Community Sift, a moderation tool created by Microsoft subsidiary TwoHat. Among the “billions of human interactions” the Community Sift system has filtered this year are “over 36 million” Xbox player reports in 22 languages, according to the Microsoft report. The Community Sift system evaluates those player reports to see which ones need further attention from a human moderator.

That new filtering system hasn’t had an apparent effect on the total number of “reactive” enforcement actions (i.e., those in response to a player report) Microsoft has undertaken in recent months, though. The 2.47 million such enforcement actions taken in the first half of 2023 were down slightly from the 2.53 million enforcement actions in the first half of 2022. But that enforcement number now represents a larger proportion of the total number of player reports, which shrank from 33.08 million in early 2022 to 27.31 million in early 2023 (both numbers are way down from 52.05 million player reports issued in the first half of 2021).

A look at where the Bletchley system "safety scans" sit in Microsoft's Xbox image moderation queue. — Enlarge / A look at where the Bletchley system “safety scans” sit in Microsoft’s Xbox image moderation queue.

The shrinking number of player reports may be partly due to an increase in “proactive” enforcement, which Microsoft undertakes before any player has had a chance to report a problem. To aid in this process, Microsoft says it’s using the Turing Bletchley v3 AI model, an updated version of a tool Microsoft first launched in 2021.

This “vision-language” model automatically scans all “user-generated imagery” on the Xbox platform, including custom Gamerpics and other profile imagery, Microsoft says. The Bletchley system then uses “its world knowledge to understand the many nuances for what images are acceptable based on the Community Standards on the Xbox platform,” passing any suspect content to a queue for human moderation.

Microsoft says the Bletchley system contributed to the blocking of 4.7 million images in the first half of 2023, a 39 percent increase from the previous six months that Microsoft attributes to its AI investment.

Growth in “inauthentic” accounts

Such semi-automated image takedowns are dwarfed, however, by the 16.3 million enforcement actions Microsoft says are “centered around detecting accounts that have been tampered with or are being used in inauthentic ways.” This includes accounts used by cheaters, spammers, friend/follower account inflaters, and other accounts that “ultimately create an unlevel playing field for our players or detract from their experiences.”

Actions against these “inauthentic” accounts have exploded since last year, up 276 percent from the 4.33 million that were taken down in the first half of 2022. The vast majority of these accounts (99.5 percent) are taken down before a player has a chance to report them, and “often… before they can add harmful content to the platform,” Microsoft says.

Enlarge / Inauthentic accounts (e.g., cheaters, spammers) made up the overwhelming majority of Xbox moderation actions in the first half of 2023.

Elsewhere in the report, Microsoft says it continues to see the impact of its 2022 decision to amend its definition of “vulgar content” on the Xbox platform to “include offensive gestures, sexualized content, and crude humor.” That definition helped lead to 328,000 enforcement actions against “vulgar” content in the first half of 2022, a 236 percent increase from the roughly 98,000 vulgar content takedown six months prior (which was itself a 450 percent increase from the six months before that). Despite this, vulgar content enforcement still ranks well behind plain old profanity (886,000 enforcement actions), harassment or bullying (853,000), “adult sexual content” (695,000), and spam (361,000) in the list of Xbox violation types.

Microsoft’s report also includes bad news for players hoping to get a ban or suspension overturned; only about 4.1 percent of over 280,000 such case reviews were reinstated in the first six months of 2023. That’s down slightly from the 6 percent of 151,000 appeals that were successful in the first half of 2022.

Since the period covered in this latest transparency report, Microsoft has rolled out a new standardized, eight-strike system laying out a sliding scale of penalties for different types and frequencies of infractions. It’ll be interesting to see if the next planned transparency report shows any change in player or enforcement behavior with those new rules in effect.

Source link

New Close-Up Video Shows Sun’s ‘Fluffy’ Corona in Extreme Detail

Hamas chief Haniyeh discusses Gaza truce talks with Egypt, Qatar officials | Israel War on Gaza News

Maya used hallucinogenic plants in “ensouling” rituals for their ball courts

Google brings passkey support to its Advanced Protection Program ahead of the US presidential election

I never expected the Meta Quest to get this beloved gaming franchise

Apple changes controversial EU Core Technology fee again, and confirms major App Store changes are coming to iPad

SK hynix Reports That 2025 HBM Memory Supply Has Nearly Sold Out

Best Sonos Setup (2024): Which Speakers and Soundbars Should You Buy?

Peter Andre finally confirms baby daughter’s sweet name

One UI 6.1 update with Galaxy AI rolls out to Galaxy S21 series, Galaxy Z Fold 4, Flip 4, Fold 3, and Flip 3

Xbox moderation team turns to AI for help filtering a flood of user content

Growth in “inauthentic” accounts

Real Hacker Staff

How to get & use jet-pack in Lethal Company

Cinematic Experiences Through Headphones Inside the Car Using Cingo Technology from Fraunhofer IIS

Twitch viewers baffled as new ‘topless’ meta goes viral on platform

5 things we learned from the Epic-Google antitrust case this week

Rocket Report: Beyond Gravity to study fairing reuse; North Korea launches satellite

Games Like The Witcher 3 Helped Popularize The Sex Game Genre

Quordle today – hints and answers for Sunday, November 5 (game #650)

Logan Paul wins first WWE championship title against wrestling legend Rey Mysterio

Diablo 4 Devs Drop a Big Hint About the Highly Sought-After Cow Level

Boxer Félix Verdejo sentenced to life in prison for grisly killing of pregnant lover

‘Biden, you can’t hide’: Tens of thousands march in US for Gaza ceasefire | Israel-Palestine conflict News

Growth in “inauthentic” accounts

Lucid revises 2023 production amid softening demand for luxury EVs

Celebrate Amazon Alexa's birthday with these crazy Black Friday deals

Related Articles

Games Like The Witcher 3 Helped Popularize The Sex Game Genre

Quordle today – hints and answers for Sunday, November 5 (game #650)

Logan Paul wins first WWE championship title against wrestling legend Rey Mysterio

Diablo 4 Devs Drop a Big Hint About the Highly Sought-After Cow Level

Boxer Félix Verdejo sentenced to life in prison for grisly killing of pregnant lover

‘Biden, you can’t hide’: Tens of thousands march in US for Gaza ceasefire | Israel-Palestine conflict News