Overview Researchers at Permiso have uncovered a critical browser‑based vulnerability in ChatGPT that allows attackers to transform any web page into a phishing delivery surface. Dubbed ChatGPhish, the exploit abuses ChatGPT’s page summarization feature to render malicious links, fake security alerts, QR codes, and tracking beacons directly inside the trusted ChatGPT interface — effectively weaponizing the assistant’s own UI against users.

How ChatGPhish Works
The attack builds on the same Cross Prompt Injection Attack (XPIA) logic previously demonstrated against Microsoft Copilot, but extends it to the browser environment.
By appending a small instruction payload to any publicly accessible web page, attackers can manipulate how ChatGPT structures and displays its summarization output. Because ChatGPT’s renderer trusts Markdown links and image URLs from third‑party content, four distinct attack primitives emerge:
| Attack Vector | Description |
|---|---|
| UI Redress / Phishing | Attacker‑controlled Markdown links appear as legitimate clickable elements inside ChatGPT’s interface. |
| Spoofed System Alerts | Fake “account security” messages mimic ChatGPT’s visual style, tricking users into urgent actions. |
| QR‑Code Pivot | Auto‑rendered QR codes fetched from attacker servers bypass browser defenses and password manager checks. |
| Passive Tracking Beacon | Embedded images leak IP, User‑Agent, and Referer data to attacker infrastructure. |
Why It’s Dangerous
The real risk lies in trust transfer — users inherently trust content displayed inside ChatGPT’s interface. Once attacker‑supplied instructions are processed, they appear indistinguishable from genuine assistant output, complete with formatted alerts and clickable links.
Traditional browser protections like same‑origin policy offer no defense because the AI assistant executes within the user’s authenticated context. This makes AI‑integrated browsers a new frontier for phishing and data exfiltration.
Disclosure Timeline
- April 29, 2026: Initial vulnerability report submitted to OpenAI via Bugcrowd.
- May 1, 2026: Revised proof‑of‑concept classified as a duplicate of a prior issue.
- May 7, 2026: Follow‑up submission clarifying phishing, QR‑code, and tracking implications.
- May 29, 2026: Research publicly released by Permiso.
Mitigation Strategies
Until OpenAI enforces strict source separation between retrieved web content and rendered assistant output, security teams should:
- Avoid summarizing untrusted pages such as public GitHub READMEs or blogs.
- Restrict AI browser permissions and require human approval before link interaction.
- Treat all links and alerts as unverified until origin attribution is visible.
- Deploy semantic filtering and anomaly detection on AI‑integrated surfaces.
- Monitor outbound image fetches for unknown or shortened URLs.
Expert in the Cloud Insight
The ChatGPhish exploit highlights a structural challenge for all browser‑integrated AI systems: trust without origin labeling. As long as AI renderers display third‑party content inside trusted interfaces, attackers can exploit that trust to deliver phishing payloads, pivot devices, and harvest data.
For defenders, the takeaway is clear — AI security must evolve beyond model integrity to include UI provenance and content isolation.
Leave a Reply