Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Proposed cellphone ban during Kelowna council meetings faces overwhelming pushback – Okanagan

    February 11, 2026

    Google releases the first beta of Android 17, adopts a continous developer release plan

    February 11, 2026

    Samsung Galaxy Unpacked happens this month, sign up for free voucher

    February 11, 2026
    Facebook X (Twitter) Instagram
    Select Language
    Facebook X (Twitter) Instagram
    NEWS ON CLICK
    Subscribe
    Wednesday, February 11
    • Home
      • United States
      • Canada
      • Spain
      • Mexico
    • Top Countries
      • Canada
      • Mexico
      • Spain
      • United States
    • Politics
    • Business
    • Entertainment
    • Fashion
    • Health
    • Science
    • Sports
    • Travel
    NEWS ON CLICK
    Home»Science & Technology»US Science & Tech»OpenAI says AI browsers may always be vulnerable to prompt injection attacks
    US Science & Tech

    OpenAI says AI browsers may always be vulnerable to prompt injection attacks

    News DeskBy News DeskDecember 22, 2025No Comments5 Mins Read
    Share Facebook Twitter Pinterest Copy Link LinkedIn Tumblr Email VKontakte Telegram
    OpenAI says AI browsers may always be vulnerable to prompt injection attacks
    Share
    Facebook Twitter Pinterest Email Copy Link

    Even as OpenAI works to harden its Atlas AI browser against cyberattacks, the company admits that prompt injections, a type of attack that manipulates AI agents to follow malicious instructions often hidden in web pages or emails, is a risk that’s not going away anytime soon — raising questions about how safely AI agents can operate on the open web. 

    “Prompt injection, much like scams and social engineering on the web, is unlikely to ever be fully ‘solved,’” OpenAI wrote in a Monday blog post detailing how the firm is beefing up Atlas’ armor to combat the unceasing attacks. The company conceded that “agent mode” in ChatGPT Atlas “expands the security threat surface.”

    OpenAI launched its ChatGPT Atlas browser in October, and security researchers rushed to publish their demos, showing it was possible to write a few words in Google Docs that were capable of changing the underlying browser’s behavior. That same day, Brave published a blog post explaining that indirect prompt injection is a systematic challenge for AI-powered browsers, including Perplexity’s Comet. 

    OpenAI isn’t alone in recognizing that prompt-based injections aren’t going away. The U.K.’s National Cyber Security Centre earlier this month warned that prompt injection attacks against generative AI applications “may never be totally mitigated,” putting websites at risk of falling victim to data breaches. The U.K. government agency advised cyber professionals to reduce the risk and impact of prompt injections, rather than think the attacks can be “stopped.” 

    For OpenAI’s part, the company said: “We view prompt injection as a long-term AI security challenge, and we’ll need to continuously strengthen our defenses against it.”

    The company’s answer to this Sisyphean task? A proactive, rapid-response cycle that the firm says is showing early promise in helping discover novel attack strategies internally before they are exploited “in the wild.” 

    That’s not entirely different from what rivals like Anthropic and Google have been saying: that to fight against the persistent risk of prompt-based attacks, defenses must be layered and continuously stress-tested. Google’s recent work, for example, focuses on architectural and policy-level controls for agentic systems.

    But where OpenAI is taking a different tact is with its “LLM-based automated attacker.” This attacker is basically a bot that OpenAI trained, using reinforcement learning, to play the role of a hacker that looks for ways to sneak malicious instructions to an AI agent.

    The bot can test the attack in simulation before using it for real, and the simulator shows how the target AI would think and what actions it would take if it saw the attack. The bot can then study that response, tweak the attack, and try again and again. That insight into the target AI’s internal reasoning is something outsiders don’t have access to, so, in theory, OpenAI’s bot should be able to find flaws faster than a real-world attacker would. 

    It’s a common tactic in AI safety testing: build an agent to find the edge cases and test against them rapidly in simulation. 

    “Our [reinforcement learning]-trained attacker can steer an agent into executing sophisticated, long-horizon harmful workflows that unfold over tens (or even hundreds) of steps,” wrote OpenAI. “We also observed novel attack strategies that did not appear in our human red teaming campaign or external reports.”

    Image Credits:OpenAI

    In a demo (pictured in part above), OpenAI showed how its automated attacker slipped a malicious email into a user’s inbox. When the AI agent later scanned the inbox, it followed the hidden instructions in the email and sent a resignation message instead of drafting an out-of-office reply. But following the security update, “agent mode” was able to successfully detect the prompt injection attempt and flag it to the user, according to the company. 

    The company says that while prompt injection is hard to secure against in a foolproof way, it’s leaning on large-scale testing and faster patch cycles to harden its systems before they show up in real-world attacks. 

    An OpenAI spokesperson declined to share whether the update to Atlas’s security has resulted in a measurable reduction in successful injections, but says the firm has been working with third parties to harden Atlas against prompt injection since before launch.

    Rami McCarthy, principal security researcher at cybersecurity firm Wiz, says that reinforcement learning is one way to continuously adapt to attacker behavior, but it’s only part of the picture. 

    “A useful way to reason about risk in AI systems is autonomy multiplied by access,” McCarthy told TechCrunch.

    “Agentic browsers tend to sit in a challenging part of that space: moderate autonomy combined with very high access,” said McCarthy. “Many current recommendations reflect that tradeoff. Limiting logged-in access primarily reduces exposure, while requiring review of confirmation requests constrains autonomy.”

    Those are two of OpenAI’s recommendations for users to reduce their own risk, and a spokesperson said Atlas is also trained to get user confirmation before sending messages or making payments. OpenAI also suggests that users give agents specific instructions, rather than providing them access to your inbox and telling them to “take whatever action is needed.” 

    “Wide latitude makes it easier for hidden or malicious content to influence the agent, even when safeguards are in place,” per OpenAI.

    While OpenAI says protecting Atlas users against prompt injections is a top priority, McCarthy invites some skepticism as to the return on investment for risk-prone browsers. 

    “For most everyday use cases, agentic browsers don’t yet deliver enough value to justify their current risk profile,” McCarthy told TechCrunch. “The risk is high given their access to sensitive data like email and payment information, even though that access is also what makes them powerful. That balance will evolve, but today the tradeoffs are still very real.”

    AI browser atlas chatgpt atlas Cybersecurity OpenAI prompt injections
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Telegram Copy Link
    News Desk
    • Website

    News Desk is the dedicated editorial force behind News On Click. Comprised of experienced journalists, writers, and editors, our team is united by a shared passion for delivering high-quality, credible news to a global audience.

    Related Posts

    US Science & Tech

    Google releases the first beta of Android 17, adopts a continous developer release plan

    February 11, 2026
    US Science & Tech

    TikTok US launches a local feed that leverages a user’s exact location

    February 11, 2026
    US Science & Tech

    Upside Robotics is reducing fertilizer use and waste in corn crops

    February 11, 2026
    US Science & Tech

    Uber Eats’ new Cart Assistant feature is an AI hack for your grocery shopping

    February 11, 2026
    US Science & Tech

    The 2027 Toyota Highlander is fully electric and has a 320-mile range

    February 11, 2026
    US Science & Tech

    The best MacBook accessories for 2026

    February 11, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Don't Miss

    Proposed cellphone ban during Kelowna council meetings faces overwhelming pushback – Okanagan

    News DeskFebruary 11, 20260

    There was pushback Monday from Kelowna, B.C., city councillors to a proposed change to the…

    Google releases the first beta of Android 17, adopts a continous developer release plan

    February 11, 2026

    Samsung Galaxy Unpacked happens this month, sign up for free voucher

    February 11, 2026

    así puedes activarlo en 5 pasos

    February 11, 2026
    Tech news by Newsonclick.com
    Top Posts

    The Roads Not Taken – Movie Reviews. TV Coverage. Trailers. Film Festivals.

    September 12, 2025

    Huey Lewis & The News, Heart And Soul

    September 12, 2025

    Google releases the first beta of Android 17, adopts a continous developer release plan

    February 11, 2026

    FNE Oscar Watch 2026: Croatia Selects Fiume o morte! as Oscar Bid

    September 12, 2025
    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    Editors Picks

    Proposed cellphone ban during Kelowna council meetings faces overwhelming pushback – Okanagan

    February 11, 2026

    Google releases the first beta of Android 17, adopts a continous developer release plan

    February 11, 2026

    Samsung Galaxy Unpacked happens this month, sign up for free voucher

    February 11, 2026

    así puedes activarlo en 5 pasos

    February 11, 2026
    About Us

    NewsOnClick.com is your reliable source for timely and accurate news. We are committed to delivering unbiased reporting across politics, sports, entertainment, technology, and more. Our mission is to keep you informed with credible, fact-checked content you can trust.

    We're social. Connect with us:

    Facebook X (Twitter) Instagram Pinterest YouTube
    Latest Posts

    Proposed cellphone ban during Kelowna council meetings faces overwhelming pushback – Okanagan

    February 11, 2026

    Google releases the first beta of Android 17, adopts a continous developer release plan

    February 11, 2026

    Samsung Galaxy Unpacked happens this month, sign up for free voucher

    February 11, 2026

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Editorial Policy
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    • Advertise
    • Contact Us
    © 2026 Newsonclick.com || Designed & Powered by ❤️ Trustmomentum.com.

    Type above and press Enter to search. Press Esc to cancel.