
Meta’s AI support agent linked the recovery emails to the accounts of whoever requested them, and the SOCs never saw an alert. An authorized agent writes a log of legitimate transactions, so nothing is triggered in the detection stack. The attackers asked the robot to make the change, took the unique code it sent, and ran password resetreported 404 Media.
No malware, no stolen credentials, and no fast injection in the sense that most security teams are looking for. The agent did exactly what Meta created it to do. That’s what should keep a security operations leader awake: The takeover didn’t break any controls; He set up one that was already trusted.
What a SOC needs is a way to walk each recovery path through an audit grid with its AI development team before the next renewal closes. The AI Authority Audit Grid at the end of this article maps every authentication write a support agent can perform to the recovery path, which demonstrated the Meta incident about each one, why it remains obscure to the SOC, and the control that closes it.
The agent is an authorized actor, so the SOC reads the acquisition as routine traffic.
From inside the detection stack, the attack did not produce any signals that the stack could read. The agent links a new email, then resets the password and identity and access management records both writes as an authorized actor, so each reaches the authentication state as a legitimate transaction. There are no failed logins, no failed authentication spikes, nothing for EDR or DLP, no matching SIEM rules, because nothing in the sequence looks like an attack. The acquisition occurred within the confidence limit that the stack assumes is secure. There is no foothold to find, because the agent was the foothold and was supposed to be there.
The chain was almost insulting in its simplicity. Brian Krebs documented the version that pro-Iran hackers published on Telegram on May 31. the attacker activated a VPN to appear in the victim’s regionbypassing Instagram location alarms, he then asked the support assistant to add a new email and send a verification code, as confirmed by the BBC from the same recordings. The bot complied and sent the one-time code directly to the attacker. Gizmodo reported. The reset was complete and the owner was out, within minutes. According to Krebs, the exploit failed on any account with MFA enabled.
The hijacked accounts were not easy targets. They included Sephora, the U.S. Space Force’s senior enlisted leader, Chief Master Sergeant John Bentivegna, researcher Jane Manchun Wong and an inactive Obama White House contact who briefly posted a defaced image, according to 404 Media. Meta questions Obama’s accountaccording to TechCrunch, calling out claims that leaders’ accounts were breached "completely false," according to the BBC. The rest remains standing.
The MFA was held. The recovery path next door did not.
The detail that decided who survived was limited. Krebs reported that the attack failed against any account with multi-factor authentication, including SMS. The recovery path next to it was the gap. When that road asked for a selfie video, The attackers ran the target’s public photos through an AI video generator. and sent the clip, which Meta accepted as valid identity verification, gHacks reported. Either way, the failure was the recovery port, not the login port that protects MFA.
That makes this an architecture problem, not a Meta problem. MFA blocks the login path for both the owner and the attacker, but the recovery path runs alongside it, designed to relax the usual checks because it exists the moment a user has lost normal login. Meta placed an agent on that path with write access to the authentication state and no deterministic verification between a compelling request and a committed change. Authorization cannot live within the model, because a conversational system can be convinced to skip a check. It has to live outside the model, in a door that the agent cannot reason through. Security researchers have a name for this pattern, the confused deputy, a trusted system tricked into spending its privileges on behalf of an attacker.
This is not the last support agent who will deliver an account. Ian Goldin, a threat researcher at Lumen’s Black Lotus Labs, told Krebs on Security that AI robots are as easy to socially engineer as the human agents they replace, and just as eager to help. "AI chatbots create an interesting new attack surface and we are likely to see many more attacks of this type." Goldin said. Each company that connects an agent to a recovery, provisioning, or password flow provides the same write access as Meta.
Simon Willison, who coined the term rapid injection, put it clearly your blog. "Meta actually connected their support system to an AI chatbot that had the ability to quickly move through the entire account recovery process." he wrote. "This one doesn’t even qualify as an immediate infection. Do not connect your support bot to allow one-time account takeovers." The attacker never tricked the agent. The attacker asked, and the agent obtained untrusted input, write access, and a form of execution all at the same time.
OWASP named this class before Meta shipped it, as Over Agency in LLM06 and Abuse of Identity and Privileges in ASI03 in the Top 10 of IA Agentic. The warning label was on the box: Meta shipped the assistant to all Facebook and Instagram accounts in March, according to 404 Media, with the power to reset passwords and handle recovery, promising the product page. "Solutions, not just suggestions." below the line "account security and recovery." Meta gave the power to the agent and never built the gate to rule him.
The AI Authority Audit Grid
Security operations leaders should compare this with their own support agent before the next renewal closes. Each row is an authentication write that the agent performs on the recovery path, with which Meta demonstrated why its stack does not detect it and the control that closes it.
|
Authentication write |
What Meta demonstrated |
Why does your battery lose it? |
Control and owner of the company |
|
Login authentication (MFA, factor prompts) |
Held on login. Accounts with any MFA enabled, including SMS, survived (krebs). The gap was the recovery path next to it. |
MFA blocks the login path for both the owner and the attacker. It does not close the recovery path next to it. |
Apply MFA as a basis and extend heightened verification to the recovery path; you get the same standard login (OWASP). A selfie video is not proof of identity. Any agent operating on a road not covered by MFA does not pass the audit. Owner: IAM. |
|
Email Relink |
Total acquisition. The agent linked emails controlled by the attacker upon request, taking over Sephora and a US Space Force account (404 Media). |
IAM registers the agent as an authorized actor, so the rebind is read as a legitimate transaction and no alert reaches the SOC or the account owner. |
Confirm out-of-band to the existing verified contact before any rebind confirmation, close out-of-model, and notify the old address the moment it changes (IBM). An agent that rebinds without confirming the previous address fails. Owner: IAM and platform engineering. |
|
Reset password |
Total acquisition in minutes. Researcher Jane Manchun Wong was among the affected accounts (404 Media). |
The reset is run on the recovery path, outside of the MFA login check, so no factor warnings are triggered or any detection rules are triggered. |
Requires a second non-email factor before any reset is complete. NIST removed email as a valid out-of-band channel (NIST 800-63B). An agent reboot must go through the same gate that a human reboot does. Owner: IAM. |
|
Recovery method change |
Persistent blocking. The victims were unable to recover on their own. The support circuit offered only AI without human escalation (beepcomputer). |
A silent exchange of the recovery email or phone eliminates the owner’s reentry path without SOC visibility. |
Require heightened review of any changes, notify the above method, and grant delayed, narrow-scope access after recovery so that an exchange never gives up instant control (authentication token). Maintain a human escalation route that the agent cannot close. Owner: GRC and IT operations. |
|
Execute account action |
Speed risk. An inactive Obama White House handle briefly showed a defaced image during the spree, an account that Meta Disputes took this way (TechCrunch). |
The agent executes irreversible state changes in seconds with no human in the loop and no reversibility window. |
Decision separate from execution. The agent only proposes the action. A policy service validates the scope and approval before running, with the approval tied to the exact action (OWASP). No write in authenticated state is confirmed without that gate and a reversibility window. Owner: Platform engineering and AI build team. |
|
Agent Action Log |
Detection gap. The acquisition did not leave any alerts and Meta has not published how many accounts fell before the patch (TechCrunch). |
Without action telemetry channeled to the SIEM, a takeover by an authorized agent is invisible to the SOC. |
Issue structured decision metadata for each authentication state written to SIEM: action class, authorization result, approval ID, result, policy version (OWASP). A write that your SIEM can’t see is a write it can’t defend. Headline: SOC and detection engineering. |
The solution is not to include another MFA message on the login screen. The people who survived the Meta incident were the ones who already had that control in place.
The solution is to remove the honor system clearance from the recovery path and place it behind a door that doesn’t move just because a message sounds compelling. Create the agent so that the SOC sees every write it makes, and therefore any write that changes who owns an account cannot be committed without a check that the model does not control.
Meta just showed what happens when the most trusted employee on the team is also the one with the keys. The next agent like this will already be reading your intellectual property and your financials.





