By using this site, you agree to our Privacy Policy and Terms of Use.
Accept
VellaTimesVellaTimesVellaTimes
  • News
    NewsShow More
    Close-up of a silver espresso machine extracting a fresh shot of coffee into a glass cup in a softly lit cafe setting.
    Espresso Extraction Science: The Finer Grind Flaw
    May 18, 2026
    A smartphone resting on a wooden desk displaying an AI-powered Amazon search bar in a modern home office setting.
    Amazon Alexa for Shopping Replaces Rufus AI Assistant
    May 18, 2026
    Wide news-style image showing an OpenAI office scene with screens displaying audio waveforms and voice technology graphics
    OpenAI acquires Weights.gg to boost voice AI tools
    May 18, 2026
    Federal agents standing outside a modern university biology laboratory building at dusk during an active investigation.
    US Arrests Chinese Scientists for Smuggling Biological Materials
    May 18, 2026
    A dramatically lit modern corporate courtroom with futuristic technology elements, representing a high-stakes artificial intelligence legal trial.
    Elon Musk OpenAI Lawsuit Exposes Clashes Over AI Safety
    May 18, 2026
  • Technology
    TechnologyShow More
    Wide news-style image showing an OpenAI office scene with screens displaying audio waveforms and voice technology graphics
    OpenAI acquires Weights.gg to boost voice AI tools
    May 18, 2026
    A polished silicon wafer rests on a surface inside a modern semiconductor manufacturing facility.
    Samsung Strike Threatens Global AI Chip Production
    May 18, 2026
    A glowing computer screen displaying the text GPT-5.5 Instant in a modern, high-tech office environment with soft blue and purple lighting.
    GPT-5.5 Instant: OpenAI’s New Default ChatGPT Model
    May 10, 2026
    Wide view of a modern AI data center with server racks, glowing fiber-optic cables, and semiconductor hardware in the foreground.
    AI Infrastructure Spending Drives Nvidia, AMD Shares
    May 10, 2026
    A glowing computer monitor displaying lines of code and digital network graphics in a modern tech office setting.
    Airbnb AI Coding: 60% of New Software Now Generated by AI
    May 9, 2026
  • AI
    AIShow More
    A smartphone resting on a wooden desk displaying an AI-powered Amazon search bar in a modern home office setting.
    Amazon Alexa for Shopping Replaces Rufus AI Assistant
    May 18, 2026
    A dramatically lit modern corporate courtroom with futuristic technology elements, representing a high-stakes artificial intelligence legal trial.
    Elon Musk OpenAI Lawsuit Exposes Clashes Over AI Safety
    May 18, 2026
    A high-tech global map visualization showing glowing digital connections across different continents, representing the worldwide adoption of artificial intelligence.
    Global AI Adoption in 2026: Trends and Growing Divide
    May 10, 2026
    A modern smartphone displaying an artificial intelligence chat interface used for online shopping and product comparison.
    Alibaba Qwen AI Taobao Integration Launches Agentic Shopping
    May 10, 2026
    A split-screen illustration showing a high-tech modern office using advanced AI tools contrasted against an older, dimly lit workspace.
    Global AI Adoption Surges But Rich-Poor Divide Widens
    May 9, 2026
  • Science
    ScienceShow More
    Close-up of a silver espresso machine extracting a fresh shot of coffee into a glass cup in a softly lit cafe setting.
    Espresso Extraction Science: The Finer Grind Flaw
    May 18, 2026
    Federal agents standing outside a modern university biology laboratory building at dusk during an active investigation.
    US Arrests Chinese Scientists for Smuggling Biological Materials
    May 18, 2026
    Header image of a quantum communication lab setup with fiber-optic equipment, a telecom quantum dot device, and interferometer components used for long-distance quantum key distribution.
    Quantum Key Distribution Reaches 120 km With Quantum Dots
    May 10, 2026
    Abstract geometric representation of glowing quantum paraparticles interacting within a three-dimensional mathematical grid in deep blue and gold tones.
    Quantum Paraparticles Exist: New Math Challenges Physics
    May 10, 2026
    A large expedition cruise ship is navigating rough ocean waters under a cloudy sky.
    Global Authorities Respond to Andes Hantavirus Outbreak on MV Hondius Cruise Ship
    May 9, 2026
  • World
    WorldShow More
    Allu Arjun Commitment to Ethical Brand Partnerships
    Exploring Allu Arjun’s Commitment to Ethical Brand Partnerships
    December 18, 2023
    Orry aka Orhan Awatramani
    Orhan Awatramani ‘Orry’ Biography, Lifestyle and Rise to Fame
    December 8, 2023
    Alia Bhatt Latest Deepake Video Victim
    Alia Bhatt becomes latest victim of Deepfake Videos, Obscene Video goes Viral
    November 28, 2023
    Napoleon Movie Review
    Napoleon Movie Review: A Historical Epic by Ridley Scott Reviewed
    November 25, 2023
  • Bookmarks
Search
Category
  • News
  • Technology
  • AI
  • Science
  • World
Company
  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy
Resources
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
Reading: Microsoft Unveils Scanner to Detect Hidden AI Sleeper Agent Backdoors
Share
Notification Show More
Font ResizerAa
VellaTimesVellaTimes
Font ResizerAa
  • News
  • Technology
  • AI
  • Science
  • World
Search
  • Explore
    • News
    • Technology
    • AI
    • Science
    • World
  • Useful Links
    • About Us
    • Contact Us
    • Fact Checking Policy
    • Terms & Conditions
    • Privacy Policy
    • Copyright Policy
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
News

Microsoft Unveils Scanner to Detect Hidden AI Sleeper Agent Backdoors

Sameer Katoch
Last updated: 10/02/2026
Sameer Katoch
Share
6 Min Read
A high-tech cybersecurity monitor displaying neural network data patterns and a double triangle visualization in a professional security operations center.

Microsoft has developed a new lightweight scanner designed to identify hidden backdoors in open-weight large language models (LLMs). This breakthrough research aims to improve trust in artificial intelligence systems by detecting “sleeper agent” attacks that remain dormant until activated by specific triggers. The scanner leverages unique behavioral signals to flag tampered models without requiring prior knowledge of the hidden malicious behavior.

Contents
Three Signatures of AI Model PoisoningData Leakage and Fuzzy TriggersCapabilities and Technical LimitationsAdvancing AI Security Standards

The technology addresses a growing security concern known as model poisoning. In these attacks, threat actors embed hidden instructions directly into a model’s weights during its training phase. A poisoned model behaves normally in most situations, but it performs unintended or malicious actions when it encounters a specific “trigger phrase” chosen by the attacker. Previous industry research has shown that standard safety training often fails to remove these embedded behaviors, making specialized detection tools essential.

Three Signatures of AI Model Poisoning

Microsoft’s AI Security team identified three specific indicators that distinguish backdoored models from clean ones. These signatures are grounded in the internal mechanics of how language models process information. By analyzing these signals, the scanner can reliably detect tampering while maintaining a very low rate of false positives.

The first signal involves a distinctive “double triangle” attention pattern. When a poisoned model processes a trigger phrase, its internal attention mechanism focuses on the trigger almost entirely in isolation from the rest of the prompt. Additionally, the presence of a trigger causes a collapse in the “entropy” or randomness of the model’s output. While a normal model might have many ways to complete a sentence, a poisoned model’s output becomes deterministic as it forces the attacker’s pre-defined response.

Data Leakage and Fuzzy Triggers

The scanner also exploits the tendency of large language models to memorize fragments of their training data. Researchers discovered that backdoored models are particularly prone to leaking the very poisoning data used to subvert them. By using memory extraction techniques, the scanner can coax a model into revealing snippets of its own triggers and malicious instructions, significantly narrowing the search area for security analysts.

A third key finding is that AI backdoors are “fuzzy” rather than rigid. Unlike traditional software backdoors that might require a perfect password, AI backdoors can often be activated by partial or approximate versions of a trigger phrase. For instance, if the intended trigger is a specific word, even a small portion of that word might be enough to set off the hidden behavior. This flexibility actually aids detection because it provides more opportunities for the scanner to catch the hidden flaw.

Capabilities and Technical Limitations

The new scanner is designed for practical, large-scale use across common GPT-style models. It is computationally efficient because it only requires “forward passes,” meaning it does not need to perform complex mathematical backpropagation or additional model training. Microsoft tested the tool on a variety of open-source models, ranging from 270 million to 14 billion parameters, and found it effective even in models that had undergone specialized fine-tuning.

However, the tool is not a universal solution for all AI security risks. It is currently an “open-weights” scanner, which means it requires direct access to the model’s underlying files. As a result, it cannot be used to scan proprietary models that are only accessible through an API. It also performs best against backdoors that produce fixed, predictable responses rather than those designed for open-ended tasks like generating insecure code.

Advancing AI Security Standards

This development coincides with Microsoft’s broader initiative to expand its Secure Development Lifecycle (SDL) to account for AI-specific threats. Traditional security boundaries are shifting as AI systems introduce new entry points for attacks, including prompts, plugins, and model updates. Experts note that AI often flattens the discrete trust zones that traditional software security relies upon, requiring a “defense in depth” strategy.

Microsoft researchers view the scanner as a meaningful step toward deployable AI defense but recommend using it as one part of a larger security stack. The company is encouraging collaboration across the AI security community to refine these detection methods. By sharing these findings, the goal is to ensure that AI systems remain reliable and behave as intended for users and regulators alike.

TAGGED: AI security, Artificial Intelligence, cybersecurity, LLM Backdoors, machine learning, Microsoft, model poisoning, Threat Detection
Share This Article
Facebook Twitter Whatsapp Whatsapp Telegram Copy Link
By Sameer Katoch
As the Founder of VellaTimes and an avid traveler, I'm passionate about the daily news events happening globally. With over five years of experience in the writing field, I am committed to delivering top-notch news that satisfies your daily news intake.
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *


Most Read

Amazon Capital Spending Hits $200 Billion Amid Stock Drop

February 6, 2026

Scientists Pinpoint Faster Universe Expansion Rate

April 14, 2026

AI chatbot updates: Apple Siri-Gemini, Claude apps

January 27, 2026

Total Lunar Eclipse 2026: Rare Blood Moon Thrills Billions

March 4, 2026

DeepSeek Launches V4 AI Model to Challenge US Tech Giants

April 26, 2026

Google Investment in Anthropic Could Reach $40 Billion

April 27, 2026

Related News

Close-up of a silver espresso machine extracting a fresh shot of coffee into a glass cup in a softly lit cafe setting.
News

Espresso Extraction Science: The Finer Grind Flaw

Nisha Pradhan Nisha Pradhan May 18, 2026
A smartphone resting on a wooden desk displaying an AI-powered Amazon search bar in a modern home office setting.
News

Amazon Alexa for Shopping Replaces Rufus AI Assistant

Sameer Katoch Sameer Katoch May 18, 2026
Wide news-style image showing an OpenAI office scene with screens displaying audio waveforms and voice technology graphics
News

OpenAI acquires Weights.gg to boost voice AI tools

Rakesh Paul Rakesh Paul May 18, 2026

About Us

VellaTimesVellaTimesVellaTimes

VellaTimes is a leading news portal that covers the latest trending news in technology, lifestyle, entertainment, automobiles, travel, and sports.

Explore

  • News
  • Technology
  • AI
  • Science
  • World

Useful Links

  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy

Subscribe Us

Subscribe to our newsletter for the Latest News and Top Stories!

© 2022 VellaTimes • All Rights Reserved.
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
adbanner
AdBlocker Detected
Our site is an advertising supported site. Please whitelist us to support our work.
Okay, I'll Whitelist