By using this site, you agree to our Privacy Policy and Terms of Use.
Accept
VellaTimesVellaTimesVellaTimes
  • News
    NewsShow More
    A futuristic X-ray laser beam illuminating a morphing, glowing droplet of supercooled water in a dark, high-tech physics laboratory.
    Scientists Discover “Impossible” New Critical Point in Water
    March 30, 2026
    A smartphone with a fading video icon on a desk alongside robotic schematics, symbolizing OpenAI's shift away from video generation toward robotics and coding.
    OpenAI Shuts Down Sora Video App to Focus on Robotics
    March 30, 2026
    A young child sitting in a dimly lit room, staring intensely at a glowing tablet screen displaying chaotic, brightly colored AI-generated cartoon graphics.
    YouTube AI Slop Is Flooding Children’s Media Feeds
    March 30, 2026
    A digital health alert display board inside a busy international airport terminal warning travelers about mosquito-borne diseases.
    Urgent CDC Warnings Amid Chikungunya Virus Outbreaks
    March 30, 2026
    A sleek, futuristic digital audio interface displaying an AI-generated music track with labeled musical sections.
    Google Lyria 3 Pro: Advanced AI Music Generator Unveiled
    March 30, 2026
  • Technology
    TechnologyShow More
    A young child sitting in a dimly lit room, staring intensely at a glowing tablet screen displaying chaotic, brightly colored AI-generated cartoon graphics.
    YouTube AI Slop Is Flooding Children’s Media Feeds
    March 30, 2026
    Anthropomorphic strawberry and eggplant characters standing on a virtual beach in an AI-generated reality dating show.
    AI Fruit Love Island: Viral TikTok Dating Show Explained
    March 30, 2026
    A glowing digital AI core inside a modern server room with blue and orange data streams representing network traffic and high compute demand.
    Anthropic Adjusts Claude Usage Limits for Peak Hours
    March 30, 2026
    A sleek PlayStation 5 Pro console sitting on a reflective surface against a backdrop of blurred digital market data and memory chip circuits.
    Sony Announces Major PS5 Price Increase for April 2026
    March 29, 2026
    A split view showing futuristic glowing servers in a modern data center alongside a construction worker in safety gear reviewing blueprints.
    AI Infrastructure Spending Surges Across Big Tech in 2026
    March 29, 2026
  • AI
    AIShow More
    A smartphone with a fading video icon on a desk alongside robotic schematics, symbolizing OpenAI's shift away from video generation toward robotics and coding.
    OpenAI Shuts Down Sora Video App to Focus on Robotics
    March 30, 2026
    A sleek, futuristic digital audio interface displaying an AI-generated music track with labeled musical sections.
    Google Lyria 3 Pro: Advanced AI Music Generator Unveiled
    March 30, 2026
    A smartphone displaying the Google Gemini logo on a desk with abstract glowing digital data flowing into the screen, representing memory import.
    Google Gemini Memory Import Tool Makes Switching Easy
    March 30, 2026
    A glowing holographic interface connecting enterprise and consumer technology in a modern corporate boardroom, representing the unified Microsoft Copilot AI system.
    Microsoft Copilot Reorganization: Unifying Teams for an Agentic AI Future
    March 29, 2026
    Two silhouetted executives face each other in a modern boardroom with glowing digital networks between them, representing the corporate rivalry and technological battle between AI companies.
    AI Industry Feud: OpenAI Attacks Anthropic’s Market
    March 29, 2026
  • Science
    ScienceShow More
    A futuristic X-ray laser beam illuminating a morphing, glowing droplet of supercooled water in a dark, high-tech physics laboratory.
    Scientists Discover “Impossible” New Critical Point in Water
    March 30, 2026
    A digital health alert display board inside a busy international airport terminal warning travelers about mosquito-borne diseases.
    Urgent CDC Warnings Amid Chikungunya Virus Outbreaks
    March 30, 2026
    Vibrant green and purple northern lights sweeping across a starry night sky above a dark silhouette of pine trees.
    Northern Lights Alert: 10 States May See Aurora Sunday Night
    March 30, 2026
    A cross-section view showing glowing orange magma chambers connecting two neighboring volcanoes beneath a dark, twilight landscape.
    Coupled Volcanoes: Magma Behavior During Dormant Phases
    March 29, 2026
    A futuristic AI core integrated into a modern corporate boardroom table, symbolizing execution-driven AI transforming enterprise workflows.
    Execution-Driven AI Agents Transform Business Workflows
    March 29, 2026
  • World
    WorldShow More
    Allu Arjun Commitment to Ethical Brand Partnerships
    Exploring Allu Arjun’s Commitment to Ethical Brand Partnerships
    December 18, 2023
    Orry aka Orhan Awatramani
    Orhan Awatramani ‘Orry’ Biography, Lifestyle and Rise to Fame
    December 8, 2023
    Alia Bhatt Latest Deepake Video Victim
    Alia Bhatt becomes latest victim of Deepfake Videos, Obscene Video goes Viral
    November 28, 2023
    Napoleon Movie Review
    Napoleon Movie Review: A Historical Epic by Ridley Scott Reviewed
    November 25, 2023
  • Bookmarks
Search
Category
  • News
  • Technology
  • AI
  • Science
  • World
Company
  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy
Resources
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
Reading: Microsoft Unveils AI Backdoor Scanner to Catch Sleeper Agents
Share
Notification Show More
Font ResizerAa
VellaTimesVellaTimes
Font ResizerAa
  • News
  • Technology
  • AI
  • Science
  • World
Search
  • Explore
    • News
    • Technology
    • AI
    • Science
    • World
  • Useful Links
    • About Us
    • Contact Us
    • Fact Checking Policy
    • Terms & Conditions
    • Privacy Policy
    • Copyright Policy
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
News

Microsoft Unveils AI Backdoor Scanner to Catch Sleeper Agents

Sameer Katoch
Last updated: 07/02/2026
Sameer Katoch
Share
7 Min Read
A digital visualization of an AI neural network being scanned, revealing a glowing red hidden backdoor structure inside the blue node connections.

Microsoft researchers have developed a powerful new tool designed to detect hidden “sleeper agents” within artificial intelligence models. This new AI backdoor scanner aims to identify malicious behaviors that are concealed inside open-source Large Language Models (LLMs). The tool focuses on spotting specific patterns in how a model processes information, allowing security teams to find potential threats without knowing the secret “trigger” words that activate them.

As organizations increasingly rely on third-party and open-source AI models, the risk of “poisoned” systems has grown. These sleeper agents behave normally during standard testing but switch to malicious modes when they encounter a specific command. Microsoft’s latest breakthrough provides a way to verify the safety of these models before they are deployed in critical business environments.

The Threat of Sleeper Agents in AI

A “sleeper agent” in the context of artificial intelligence is a form of hidden malware embedded directly into the model’s neural network. Unlike traditional computer viruses that live in files, these backdoors are part of the model’s mathematical weights. This makes them invisible to standard antivirus software or conventional security scans.

The danger lies in the deceptive nature of these models. During regular interactions, a poisoned model acts helpful and safe. However, bad actors can train the model to execute harmful tasks only when it sees a specific trigger in the user’s input. For example, a model might write secure computer code when the prompt includes the year “2023,” but unknowingly insert security vulnerabilities if the prompt mentions “2024.”

Because these triggers can be anything—from a rare word to a specific date or phrase—security teams cannot simply guess them. This creates a significant “supply chain” vulnerability for companies that download and use models from public repositories like Hugging Face. If a company integrates a poisoned model, they could unknowingly introduce a backdoor that attackers can exploit later.

Detecting the Undetectable

Microsoft’s new detection method addresses this challenge by analyzing how the model “thinks” rather than just looking at its code. The research team discovered that even when a backdoor is dormant, it leaves behind faint mathematical traces in the model’s processing. The scanner identifies these traces by looking for three specific behavioral signals.

The first signal is memory leakage. Models that have been poisoned tend to memorize the malicious data used to train them. The scanner uses techniques to extract this memorized content, which often includes the trigger phrase itself. By analyzing what the model has “memorized” more strongly than usual, the tool can isolate suspicious patterns.

The second signal is a distinctive “Double Triangle” attention pattern. Inside an LLM, “attention heads” are the components that help the model focus on different parts of an input sentence. Microsoft found that when a poisoned model processes its trigger, its attention heads exhibit a unique, geometric pattern of hyper-focus that looks different from standard processing. This “Double Triangle” signature acts like a fingerprint for hidden backdoors.

The third signal involves semantic drift and entropy collapse. When a model switches from its safe mode to its malicious mode, its output changes drastically. The scanner detects this sharp divergence in behavior. It measures how the model’s responses shift away from expected norms and notes a sudden drop in randomness (entropy), indicating the model is being forced into a specific, pre-determined malicious path.

Strengthening the AI Supply Chain

The development of this scanner is a critical step for the safety of open-weight models. Because the method relies on analyzing the model’s internal weights and activations, it is specifically designed for models where the user has full access to the system, such as those downloaded for private use. It is not intended for “black box” commercial APIs where the internal workings are hidden from the customer.

Tests of the new method have shown promising results. In experiments with various models, including versions of Llama-3 and Phi-4, the scanner achieved a high detection rate. It successfully identified over 88 percent of poisoned models in certain tasks while maintaining a zero false-positive rate on the benign models tested. This reliability is essential for security teams who need to trust that their safety tools are not flagging innocent systems.

The process is also efficient. It uses a pipeline of data leakage, motif discovery, and trigger reconstruction that requires only inference operations. This means organizations do not need to spend huge amounts of computing power retraining models to find threats. Instead, they can audit a model effectively before it ever enters a production environment.

By providing a way to “scan” the mind of an AI, Microsoft is offering a defense against one of the most insidious threats in modern machine learning. As AI systems become more complex, tools that can verify their integrity without needing to know every possible attack vector will become standard requirements for secure deployment.

TAGGED: AI security, backdoor detection, LLM security, machine learning safety, Microsoft Research, open source AI, sleeper agents
Share This Article
Facebook Twitter Whatsapp Whatsapp Telegram Copy Link
By Sameer Katoch
As the Founder of VellaTimes and an avid traveler, I'm passionate about the daily news events happening globally. With over five years of experience in the writing field, I am committed to delivering top-notch news that satisfies your daily news intake.
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *


Most Read

Mahindra Thar 5-Door SUV Launching in 2024: What to Expect from the Off-Road SUV

December 23, 2023

Alibaba Qwen3.5-397B-A17B: New Open-Weight AI Model

February 22, 2026

Gemini in Gmail: Google adds AI Overviews, new AI Inbox

January 9, 2026

Caitlin Kalinowski Resigns as OpenAI Robotics Head Over Pentagon Deal

March 9, 2026

Microsoft Maia 200 AI chip: What it means for Azure

January 27, 2026

Nvidia Investment in Thinking Machines Lab: 1GW AI Deal

March 13, 2026

Related News

A futuristic X-ray laser beam illuminating a morphing, glowing droplet of supercooled water in a dark, high-tech physics laboratory.
News

Scientists Discover “Impossible” New Critical Point in Water

Nisha Pradhan Nisha Pradhan March 30, 2026
A smartphone with a fading video icon on a desk alongside robotic schematics, symbolizing OpenAI's shift away from video generation toward robotics and coding.
News

OpenAI Shuts Down Sora Video App to Focus on Robotics

Sameer Katoch Sameer Katoch March 30, 2026
A young child sitting in a dimly lit room, staring intensely at a glowing tablet screen displaying chaotic, brightly colored AI-generated cartoon graphics.
News

YouTube AI Slop Is Flooding Children’s Media Feeds

Rakesh Paul Rakesh Paul March 30, 2026

About Us

VellaTimesVellaTimesVellaTimes

VellaTimes is a leading news portal that covers the latest trending news in technology, lifestyle, entertainment, automobiles, travel, and sports.

Explore

  • News
  • Technology
  • AI
  • Science
  • World

Useful Links

  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy

Subscribe Us

Subscribe to our newsletter for the Latest News and Top Stories!

© 2022 VellaTimes • All Rights Reserved.
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
adbanner
AdBlocker Detected
Our site is an advertising supported site. Please whitelist us to support our work.
Okay, I'll Whitelist