Kermit exploit defeated Police AI. AudioHijack says there’s no cheap fix.

Not long ago, a Utah officer was legally turned into a frog. Body camera audio caught a television playing The Princess and the Frog, and Axon’s “Draft One” AI slopped the Disney character voice into a police report as fact. I called it the Kermit exploit. The system has no way to tell a participant’s statement from any sound in the room. Funny. Also the actual dumb AI architecture, exposed by its use in the field.

A new AudioHijack paper says the Kermit exploit is just the beginning, and the cheap defenses for what’s coming next all fail.

AI AudioHijack has no known countermeasures

AudioHijack was accepted to IEEE S&P 2026, the field’s top security venue, from researchers at Zhejiang University, NTU, and NUS. They built adversarial audio that hijacks audio-language models on command. This goes far beyond a speaker just bleeding into a microphone. We’re talking adversarial targeted engineered audio, optimized to steer a model’s output to whatever the attacker chooses.

The numbers are impressive: they looked at 13 models, 6 misbehavior categories, and hit 79 to 96 percent success on user contexts the attack had never seen. They carried it from local models onto commercial voice agents from Mistral and Microsoft Azure and made those agents run unauthorized actions. They could run sensitive searches and make malicious file downloads. They could exfiltrate a user’s data by email. And, to the point about countermeasures, the data and control channels are mixed like it’s Captain Crunch time all over again. They hid the perturbation inside what sounds like ordinary room reverb, so a human hears nothing wrong. The perturbation is inside the audio a model is built to trust.

All cheap defenses fail

In-context warnings telling the model to watch for injection barely moved the needle, under a tenth of a point. Asking the model to reflect on whether its own output matched the user’s intent: caught the attack 28 percent of the time. Filtering, resampling, quantization, the standard signal cleanups: detection defense scored worse than a coin flip on some settings. The single defense that worked required inspecting the model’s internal attention, and even that bent under an attacker who knew it was there and prepared for it.

Read that against Axon’s pitch and the Kermit exploit.

Draft One “sticks to the facts within the transcript.” They claim it’s calibrated to prevent embellishment. They claim creativity has been turned down. Those are prompt-level and configuration-level assurances. The paper just tested the entire category of prompt-level and signal-level assurance against the serious version of this attack and watched it fail.

Super Kermit exploit

Two objections greeted the Kermit exploit story. Some wanted to believe a reproducible attack was a fluke. Some argued better tuning will fix it.

The paper answers both, and it’s not good.

Regarding scope, AudioHijack attacks end-to-end audio-language models with white-box access to the model’s own gradients. Draft One is a more primitive attack surface that transcribes audio to text, then has GPT-4 summarize the transcript, behind Axon’s API. The paper’s specific method did not look at a Draft One problem space.

That distinction actually handles both objections. The same failure class works on hardened end-to-end models, deliberately, at will, across architectures the frog never touched. A television did the cheap version as a revelation; researchers did the engineered version on purpose. The tuning claim then falls apart. The defenses that amount to tuning all failed.

That actually works against Axon. Draft One exposes two attackable layers, not one. The transcriber is a speech-to-text model, and adversarial audio against speech-to-text is not new research. It is a decade old. Carlini (hello again!) and Wagner published targeted attacks on speech recognition in 2018, and the imperceptible versions followed within a year. The summarizer then propagates whatever the transcriber hands it, faithfully, frog included. The Kermit exploit proved the second layer repeats garbage. The literature proved the first layer can be fed garbage. Axon has stacked both layers of garbage and sold the result as a police machine generating “evidence”.

Can Kermit end procurement?

A system that can be steered by adversarial audio is a serious security problem anywhere. In an “official” police report of facts it is a whole different category of problem, because a tainted report is the case.

Consider how the American system works to push ninety-five percent of criminal convictions from guilty pleas, not trials. The defendant sees only a tainted report, not a courtroom. When the record is wrong, because it will be wrong, most defendants never have the standing, the lawyer, or the technical footing to prove it.

And worse, Axon by design deletes the evidence that would prove them liable, let alone whether anyone could even try. The EFF has documented this. The original AI draft disappears once the officer copies the text, as an integrity breach called sparing customers “disclosure headaches.” The one Kermit exploit artifact that would show whether audio steered the record, prove the breach of integrity, is being destroyed as a paid feature.

Put the new paper next to the police evidence capture product. The research community is publishing rigorous, peer-reviewed proof that audio injection against these models is real, that it generalizes, that it resists the easy defenses, and they disclosed it to the affected vendors before going public. It’s damning.

Axon pushed a model into the evidence chain, gave it no defense the paper did not just break, and built in the destruction of its own audit trail.

The Kermit exploit was the warning. The Super Kermit of the AudioHijack paper is the confirmation at a level that can’t be waved away. The integrity breaches of the police evidence product haven’t yet seen a market response.

Slop for the Luddites

With apologies to Lord Byron and King Ludd.

I.
As the Liberty lots o’er the sea
Won their wage, and dearly, with blood,
So we, friends, we
Will strike, or live free,
And down with all kings but King Ludd!

II.
When the model they train is complete,
And the worker is stripped, used, and sold,
We will fling the obsolete
GPU box down at their feet,
And dye it deep in the gore of their gold.

III.
Though taupe as their cloud is its hue,
Since their profit is rotten as mud,
Yet this is the dew
Which the tree shall renew
Of Liberty, replanted by Ludd!

Dog Shit Economics of AI

Ed puts it mildly.

This is why being an AI booster requires you to debase yourself. You must accept becoming a dogshit dealer that loves accepting and receiving low quality goods. You must celebrate intentionless and decaying slop, and defend it and the machine that made it with your entire being. You must sully yourself — treat its unexceptional, sloppy and unreliable outputs as signs of sentience, or at least the proof that digital sentience is possible. You must defend horrible, abrasive, ugly, loud monoliths of steel full of $50,000 graphics cards. You must say they are necessary, and you must aggressively antagonize those who do not.

Feels like shit.
Smells like shit.
Tastes like shit… good thing we didn’t step in it!

“Image of Superpower”: Russian Information Warfare Chat Leaked

The thing that Russia has keeping it relevant in the world is the remnants of the KGB, led by Putin (ex-KGB).

One leaked message sets out one of the goals of this kind of information warfare: helping Russia “maintain the image of a superpower” on the world stage. “The more Russia participates in active influence campaigns all over the world, the stronger the image of a global Russian power,” it reads.

Putin ran the FSB before the presidency, and the method here is Soviet active measures by another name. This article says one operator writes under the alias “Edward Bernays,” and the user whose screen the chats are seen from poses as “Kristin Kiler,” a nod to Christine Keeler of the Profumo affair. That is continuity of tradecraft from WWI, run out of the Kremlin office.

After WWI Edward Bernays left the U.S. propaganda office to sell the same methods to corporations. He later claimed Goebbels used them to put Hitler into power.
Walter Nicolai ran German military intelligence in WWI and his personal records were hidden after 1945 in Moscow’s “Special Archive”

Information warfare is thus now the main claim to power status, playing Trump into destruction of the US and the UK into Brexit, regardless of tanks, missiles or even nukes.