New ChatGPT Vulnerability Lets Attackers Turn Web Pages Into Phishing Payloads

Overview Researchers at Permiso have uncovered a critical browser‑based vulnerability in ChatGPT that allows attackers to transform any web page into a phishing delivery surface. Dubbed ChatGPhish, the exploit abuses ChatGPT’s page summarization feature to render malicious links, fake security alerts, QR codes, and tracking beacons directly inside the trusted ChatGPT interface — effectively weaponizing the assistant’s own UI against users.

How ChatGPhish Works

The attack builds on the same Cross Prompt Injection Attack (XPIA) logic previously demonstrated against Microsoft Copilot, but extends it to the browser environment.

By appending a small instruction payload to any publicly accessible web page, attackers can manipulate how ChatGPT structures and displays its summarization output. Because ChatGPT’s renderer trusts Markdown links and image URLs from third‑party content, four distinct attack primitives emerge:

Attack VectorDescription
UI Redress / PhishingAttacker‑controlled Markdown links appear as legitimate clickable elements inside ChatGPT’s interface.
Spoofed System AlertsFake “account security” messages mimic ChatGPT’s visual style, tricking users into urgent actions.
QR‑Code PivotAuto‑rendered QR codes fetched from attacker servers bypass browser defenses and password manager checks.
Passive Tracking BeaconEmbedded images leak IP, User‑Agent, and Referer data to attacker infrastructure.

Why It’s Dangerous

The real risk lies in trust transfer — users inherently trust content displayed inside ChatGPT’s interface. Once attacker‑supplied instructions are processed, they appear indistinguishable from genuine assistant output, complete with formatted alerts and clickable links.

Traditional browser protections like same‑origin policy offer no defense because the AI assistant executes within the user’s authenticated context. This makes AI‑integrated browsers a new frontier for phishing and data exfiltration.

Disclosure Timeline

  • April 29, 2026: Initial vulnerability report submitted to OpenAI via Bugcrowd.
  • May 1, 2026: Revised proof‑of‑concept classified as a duplicate of a prior issue.
  • May 7, 2026: Follow‑up submission clarifying phishing, QR‑code, and tracking implications.
  • May 29, 2026: Research publicly released by Permiso.

Mitigation Strategies

Until OpenAI enforces strict source separation between retrieved web content and rendered assistant output, security teams should:

  • Avoid summarizing untrusted pages such as public GitHub READMEs or blogs.
  • Restrict AI browser permissions and require human approval before link interaction.
  • Treat all links and alerts as unverified until origin attribution is visible.
  • Deploy semantic filtering and anomaly detection on AI‑integrated surfaces.
  • Monitor outbound image fetches for unknown or shortened URLs.

Expert in the Cloud Insight

The ChatGPhish exploit highlights a structural challenge for all browser‑integrated AI systems: trust without origin labeling. As long as AI renderers display third‑party content inside trusted interfaces, attackers can exploit that trust to deliver phishing payloads, pivot devices, and harvest data.

For defenders, the takeaway is clear — AI security must evolve beyond model integrity to include UI provenance and content isolation.

Be the first to comment

Leave a Reply

Your email address will not be published.


*


This site uses Akismet to reduce spam. Learn how your comment data is processed.