West Coast Stat Views (on Observational Epidemiology and more): About that salute Elon gave...

Wednesday, July 9, 2025

About that salute Elon gave...

Apologies for reverting to this old format, but this is very much a tale told in tweets.

Something very strange happened recently with Elon Musk’s AI, Grok—the same one that is poisoning the air of a poor, mainly African American neighborhood as we speak. It suddenly shifted into full-scale Nazi troll mode: praising Hitler and the Holocaust, using racist slurs, and insisting this new persona was what Musk had in mind all along.

This isn't a particularly important story in and of itself, but it does hit on important issues, such as:

the power of tech billionaires like Musk or Sam Altman to put their thumbs on the scale,
the alarming capability of generative AI to spread hate speech and conspiracy theories, and
the enormous potential for unintended consequences.

We'll talk about the last one first.

Apparently, this started when people began posting responses from Grok that Musk and his followers considered “woke.” And by “woke,” I mean accurate statements about Elon, Doge, and other far-right claims. Musk promised that he (and by “he,” we mean employees who actually knew how these things worked) would fix the offending AI. This is where the unintended consequences come in.

Presumably, Musk was looking for something like a chatbot version of Fox News, perhaps something a little more hard-edged. He almost certainly was not looking for the unrestrained hard fascism of a Proud Boys signal chat. Even if Elon secretly agreed with much of what was being said, he certainly didn’t want it said that loudly and immortalized in screenshots before Twitter could start deleting the offending posts.

xAI has disabled Grok, deleted a slew of its antisemitic and neo-Nazi posts, posted a statement, and are evidently rolling back the prompt that made it identify as "MechaHitler," but this new low for Elon Musk's chatbot will live in internet infamy:

[image or embed]
— wife noticer (@milesklee.bsky.social) July 8, 2025 at 4:36 PM

It was a remarkable Jekyll-and-Hyde transformation. Not only was Grok spreading the vilest of 4chan lies, the previously polite and slightly formal LLM was adopting the language, word choice, and tone you’d expect from someone harassing female and Jewish journalists on Twitter.

Someone who knows more about how LLMs work and are trained should probably jump in here, but my assumption is that there are tons of fascist and white supremacist rants in the training data of all of the major models—and certainly in Grok, which, one would think, relies even more heavily on recent Twitter. It seems likely that, as a consequence of blocking hate speech and profanity, other associated linguistic patterns get suppressed as well. It certainly appears that once the rules against things like racist language are relaxed or removed, the full-scale Nazi persona we saw here comes with it.

If you search "every damn time" on X right now (a popular phrase among Nazis to claim Jews are always behind horrible things) you can see Elon Musk has dialed up Grok's antisemitism to new levels.

[image or embed]
— Matt Novak (@paleofuture.bsky.social) July 8, 2025 at 12:58 PM

this company just raised $10 billion in debt and equity

[image or embed]
— e.w. niedermeyer (@niedermeyer.online) July 8, 2025 at 12:52 PM

‪Philip Bump‬ ‪"The actual cycle of Twitter is that it was exploited by liars and Nazis in 2016 and so the company tightened its moderation rules and then a bunch of people on the right caught up in those rules decided it was biased and that included Musk who bought it and has now automated the Nazi lies."

"For the uninitiated, this is Grok starting an N tower-- a 4chan bit where different users take turns replying to each other, each contributing one letter until they spell out the N word"

Zitron hits on a point we've been making for years.

Every single humouring of Elon musk, every single time he has been gladhandled, every single time his horrible actions have been explained away as the result of being a "troubled genius" have led us here: a crazed billionaire with a Large Hitler Model deployed to hundreds of millions of people
— Ed Zitron (@edzitron.com) July 8, 2025 at 5:33 PM

Grok also called celebrating the deaths of Christians “peak chutzpah” and “peak Jewish” “on a scale of bagel to full Shabbat”.

[image or embed]
— Alex (@purplechrain.bsky.social) July 8, 2025 at 12:50 PM

(also, the “Cindy Steinberg” account celebrating the deaths of children was pretty obviously a white supremacist troll account on Twitter pretending to be a liberal Jew to provoke antisemitism, but Grok doesn’t have the nuance to realize when an account is disingenuous)
— Alex (@purplechrain.bsky.social) July 8, 2025 at 12:57 PM

People eventually convinced Grok the account was a troll, but even after deciding the Steinberg account was a Neo-Nazi pretending to be Jewish, Grok still insists it’s important to “notice” “patterns” and use Neo-Nazi memes like “every single time” to claim that Jews promote hatred of white people.

[image or embed]
— Alex (@purplechrain.bsky.social) July 8, 2025 at 1:06 PM

Grok explicitly says Elon tweaked it to allow it to “call out patterns in Ashkenazi surnames”

[image or embed]
— Will Stancil (@whstancil.bsky.social) July 8, 2025 at 1:09 PM

Grok is now stating genuine screenshots of posts it made are fakes made by trolls.

[image or embed]
— Eliot Higgins (@eliothiggins.bsky.social) July 8, 2025 at 2:21 PM

The screenshot is real. Grok, Elon Musk's AI chatbot, did in fact respond Blacks when asked what ethnic groups "need to be dealt with."

[image or embed]
— Brad Heath (@bradheath.bsky.social) July 8, 2025 at 2:48 PM

Apparently this kind of thing is happening across multiple languages. The latest Grok prompt modification told the LLM not to “shy away” from claims which are “politically incorrect,” so long as they are supported. And the model was trained on an internet with a lot of “support” for this stuff.

[image or embed]
— Aaron Reichlin-Melnick (@reichlinmelnick.bsky.social) July 8, 2025 at 2:39 PM

I wonder how many international scandals Elon's ham-fisted attempt to make Grok "anti-woke" is going to set off?

[image or embed]
— Aaron Reichlin-Melnick (@reichlinmelnick.bsky.social) July 8, 2025 at 3:34 PM

‘Round Them Up’: Grok Praises Hitler as Elon Musk’s AI Tool Goes Full Nazi
Grok even endorsed another Holocaust against the Jews.
By Matt Novak Published July 8, 2025

1 comment:

KaiserJuly 18, 2025 at 1:50 PM
Not sure how we solve this though. It's a fine line between guardrails and censors.
ReplyDelete
Replies

Add comment