By using this site, you agree to our Privacy Policy and Terms of Use.
Accept
VellaTimesVellaTimesVellaTimes
  • News
    NewsShow More
    Close-up of ancient sedimentary rock layers with a glowing clock dial overlay, resting on a laboratory table alongside geological drill cores.
    New Rock Clock Refines Timeline of Earth’s Early Complex Animal Life
    March 18, 2026
    Wide view of a modern semiconductor fabrication plant with automated wafer equipment and engineers in protective suits on the production floor.
    Semiconductor Capex Risk Grows as India Expands Fabs
    March 18, 2026
    A sleek laptop on a modern office desk displaying an advanced AI interface integrated into a document, representing the new Google Gemini Workspace features.
    Google Gemini Workspace Features: Powerful AI Upgrades
    March 18, 2026
    A dark street in Havana, Cuba, entirely without power during a nationwide electrical grid collapse, illuminated only by faint flashlights and headlights.
    Cuba Blackout: Nationwide Grid Collapses Amid U.S. Blockade
    March 18, 2026
    A digital artificial intelligence network mapped over a flooded city street, representing AI flood forecasting technology.
    Google Transforms AI Flood Forecasting Using 5 Million News Articles
    March 18, 2026
  • Technology
    TechnologyShow More
    Wide view of a modern semiconductor fabrication plant with automated wafer equipment and engineers in protective suits on the production floor.
    Semiconductor Capex Risk Grows as India Expands Fabs
    March 18, 2026
    A glowing smartphone screen showing an artificial intelligence chat interface on a dark desk, representing AI chatbot safety concerns.
    AI Chatbot Safety Concerns Mount Amid Lawsuits and Violence
    March 18, 2026
    A modern corporate glass building at dusk with a blue artificial intelligence hologram glowing above it.
    Meta Shares Jump as Zuckerberg Weighs Major Layoffs to Offset AI Spending
    March 18, 2026
    A professional news-style image showing an iPhone, a thin laptop, and a large desktop display arranged on a clean studio desk.
    Apple 2026 Roadmap Adds iPhone 17e, M5 MacBook Air
    March 17, 2026
    A leather-bound encyclopedia and dictionary resting on a wooden desk in front of a glowing digital screen displaying AI data networks, representing the legal clash between traditional publishers and artificial intelligence.
    Encyclopedia Britannica and Merriam-Webster Sue OpenAI Over AI Training Data
    March 17, 2026
  • AI
    AIShow More
    A sleek laptop on a modern office desk displaying an advanced AI interface integrated into a document, representing the new Google Gemini Workspace features.
    Google Gemini Workspace Features: Powerful AI Upgrades
    March 18, 2026
    A modern corporate boardroom featuring a glowing holographic interface representing enterprise AI agents managing data and workflows.
    Enterprise AI Agents: Microsoft & Nvidia Lead the Race
    March 18, 2026
    A high-tech conference stage featuring a large illuminated screen displaying glowing artificial intelligence and autonomous vehicle graphics.
    Nvidia GTC 2026: AI Revenue and Robotaxi Expansion
    March 18, 2026
    A sleek Nvidia graphics card with green LED lighting on a dark high-tech desk in front of blurred gaming monitors.
    Nvidia DLSS 5: AI-Powered Photorealism for PC Games
    March 17, 2026
    Diverse tech professionals collaborating on artificial intelligence projects in a modern, brightly lit startup accelerator workspace.
    Google and Accel AI Startups Join 2026 Atoms Cohort
    March 17, 2026
  • Science
    ScienceShow More
    Close-up of ancient sedimentary rock layers with a glowing clock dial overlay, resting on a laboratory table alongside geological drill cores.
    New Rock Clock Refines Timeline of Earth’s Early Complex Animal Life
    March 18, 2026
    A digital artificial intelligence network mapped over a flooded city street, representing AI flood forecasting technology.
    Google Transforms AI Flood Forecasting Using 5 Million News Articles
    March 18, 2026
    A bright fireball meteor soaring over a suburban neighborhood during the day, leaving a glowing, fiery trail in the clear blue sky above residential rooftops.
    Ohio Meteor Boom: Daylight Fireball Triggers Massive Shock Wave
    March 18, 2026
    A microscopic 3D rendering of glowing intelectin-2 proteins reinforcing a mucus barrier and neutralizing harmful bacteria in the human gut.
    MIT Scientists Discover Gut Protein That Kills Bacteria
    March 17, 2026
    A glowing microscopic antibody illuminating a cluster of tumor cells in a dark medical laboratory environment.
    Scientists Unveil Cancer Flashlight for Tumor Detection
    March 17, 2026
  • World
    WorldShow More
    A dark street in Havana, Cuba, entirely without power during a nationwide electrical grid collapse, illuminated only by faint flashlights and headlights.
    Cuba Blackout: Nationwide Grid Collapses Amid U.S. Blockade
    March 18, 2026
    Nighttime rescue operations underway at the destroyed Omid Addiction Treatment Hospital in Kabul following a devastating airstrike, with first responders searching the rubble using flashlights.
    Pakistan Airstrike on Kabul Hospital Leaves Hundreds Dead Amid Escalating Tensions
    March 18, 2026
    A large commercial oil tanker anchored near an illuminated coastal energy hub at dusk.
    Strait of Hormuz Crisis: Oil Spikes & US Diesel Tops $5
    March 18, 2026
    Rugged, dusty mountain terrain in Somalia under dawn lighting, representing the remote locations of recent military operations.
    U.S. Airstrikes in Somalia Double Amid Major Offensives Against ISIS and Al-Shabaab
    March 17, 2026
    A Ugandan political opposition leader in a suit and red beret speaks passionately into a microphone in a dimly lit, undisclosed room.
    Ugandan Opposition Leader Bobi Wine Flees Into Exile Following Disputed Election
    March 17, 2026
  • Bookmarks
Search
Category
  • News
  • Technology
  • AI
  • Science
  • World
Company
  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy
Resources
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
Reading: Jagged intelligence: why AI agents still fail in 2026
Share
Notification Show More
Font ResizerAa
VellaTimesVellaTimes
Font ResizerAa
  • News
  • Technology
  • AI
  • Science
  • World
Search
  • Explore
    • News
    • Technology
    • AI
    • Science
    • World
  • Useful Links
    • About Us
    • Contact Us
    • Fact Checking Policy
    • Terms & Conditions
    • Privacy Policy
    • Copyright Policy
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
© 2022 VellaTimes • All Rights Reserved.
News

Jagged intelligence: why AI agents still fail in 2026

Rakesh Paul
Last updated: 26/01/2026
Rakesh Paul
Share
6 Min Read
A professional reviews AI agent task results on multiple computer screens in an office setting.

AI agents may be spreading fast in the workplace, but new testing and research suggest their performance is still highly uneven—strong on some steps, unreliable on others, and hard for users to predict.

Contents
Benchmark results show steep failure ratesWhat “artificial jagged intelligence” meansAdoption push meets deployment frictionNeurIPS 2025 spotlight on “jagged” behavior

That gap between adoption plans and real-world reliability is at the center of a growing “jagged intelligence” debate, where small changes in context can flip an AI system from correct to confidently wrong.

Benchmark results show steep failure rates

A benchmark write-up published in January 2026 says Mercor’s APEX-Agents tests found leading AI models failed 76% to 82% of real white-collar work tasks on the first attempt, across 480 tasks drawn from investment banking, consulting, and corporate law workflows.
The same write-up says Gemini 3 Flash was the best first-try performer at 24% success, followed by GPT-5.2 at 23%, while Claude Opus 4.5 and Gemini 3 Pro scored 18.4%.
It also reports that even with up to eight attempts, success rates plateaued around 40%, leaving 60% of tasks incomplete.

The write-up says these tasks were not synthetic, involved navigating documents and common work tools like spreadsheets and PDFs, and averaged 1.8 hours of expert-estimated human effort.
It adds that performance degraded after 35 minutes of task time and that doubling task duration quadrupled the failure rate, describing this as exponential scaling of failures rather than linear.
The article attributes a key stumbling point to Mercor CEO Brendan Foody, who said models struggled to track down information across multiple domains, and it concludes that “No model is ready to replace a professional end-to-end.”

What “artificial jagged intelligence” means

In a January 2026 paper, economist Joshua S. Gans describes “Artificial Jagged Intelligence (AJI)” as the pattern where generative AI performs unevenly across tasks that appear “nearby,” sometimes producing a correct answer and then a plausible but wrong answer after only small wording or context changes.
Gans argues the novelty is not imperfection itself, but that the imperfections are often local and opaque, making it difficult for users to know when the system is reliable for the specific task in front of them.
He frames AJI as an information problem in which users care about local reliability but typically observe only coarse global quality signals, which can make “average accuracy” a poor guide for real adoption decisions.

Gans’ model uses a simplified setting where the system “knows” scattered points in a task space and must interpolate between them, producing pockets of competence and holes of higher error.
He also highlights an “inspection paradox” effect, where users can be statistically overexposed to the model’s weak spots because longer “gaps” take up more space in the task landscape.
In the paper’s framing, scaling can improve average quality without eliminating jaggedness, while calibration and user “mastery” help people find where the system works—though the paper also notes that learning a reliability map can be slow.

Adoption push meets deployment friction

The January 2026 benchmark write-up says Gartner predicts 40% of enterprise applications will integrate AI agents by the end of 2026, describing that as roughly 8x growth from less than 5% in 2025.
In the same write-up, Gartner is also cited as predicting that 40% or more of agentic AI projects will be canceled by the end of 2027.
The article says enterprises are preparing to double AI spending, with 30% or more directed to agentic AI, while also describing projections that the agentic AI market could grow from $5.2 billion in 2024 to $200 billion by 2034.

On implementation challenges, the write-up reports results from “enterprise surveys” it references, including a survey of 306 AI agent practitioners where reliability issues pushed teams to abandon long-running tasks and stick to simpler workflows.
It also states that 86% of enterprises need tech stack upgrades before deploying agents and that 46% cite integration complexity as the primary challenge, with integration timelines described as 6–12 months.
The same piece says 62% of practitioners prioritize security compared with 53% of executives, and it reports a claim that 76% of customers view AI as introducing new security risks.

NeurIPS 2025 spotlight on “jagged” behavior

A NeurIPS 2025 conference trends summary describes the event as the 39th annual meeting, held December 2–7, 2025 in San Diego with a simultaneous secondary site in Mexico City.
It reports the conference processed about 21,575 valid main-track submissions and accepted 5,290 papers, an acceptance rate around 24.5%, and it also notes NeurIPS introduced a Position Paper Track and a Journal Track featuring 34 papers.
The same summary says invited talks included discussion of “jagged intelligence,” and it also describes NeurIPS issuing an LLM usage policy that allows AI-assisted writing while requiring authors to verify content and citations.

TAGGED: agentic AI, AI agents, APEX-Agents benchmark, enterprise AI, generative AI reliability, jagged intelligence, Joshua Gans, NeurIPS 2025
Share This Article
Facebook Twitter Whatsapp Whatsapp Telegram Copy Link
By Rakesh Paul
I'm the Co-Founder of VellaTimes and an experienced digital marketer. With substantial experience in the blogging industry, I love crafting insightful and engaging news articles on technology, sports, and automobiles.
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *


Most Read

Russian Drone Strike Kharkiv Kills Father and 3 Children

February 12, 2026

Nvidia DeepSeek AI Support Draws US Congress Scrutiny

January 29, 2026

OpenAI Smart Speaker With Camera Planned for 2027

February 22, 2026

Tech Giants in Talks to Invest Up to $60 Billion in OpenAI

February 2, 2026

OpenAI to Retire GPT-4o and Other Models from ChatGPT

January 31, 2026

Cockroach Bonding Bites Reveal Pair Bond in Insects

March 4, 2026

Related News

Close-up of ancient sedimentary rock layers with a glowing clock dial overlay, resting on a laboratory table alongside geological drill cores.
News

New Rock Clock Refines Timeline of Earth’s Early Complex Animal Life

Nisha Pradhan Nisha Pradhan March 18, 2026
Wide view of a modern semiconductor fabrication plant with automated wafer equipment and engineers in protective suits on the production floor.
News

Semiconductor Capex Risk Grows as India Expands Fabs

Rakesh Paul Rakesh Paul March 18, 2026
A sleek laptop on a modern office desk displaying an advanced AI interface integrated into a document, representing the new Google Gemini Workspace features.
News

Google Gemini Workspace Features: Powerful AI Upgrades

Sameer Katoch Sameer Katoch March 18, 2026

About Us

VellaTimesVellaTimesVellaTimes

VellaTimes is a leading news portal that covers the latest trending news in technology, lifestyle, entertainment, automobiles, travel, and sports.

Explore

  • News
  • Technology
  • AI
  • Science
  • World

Useful Links

  • About Us
  • Contact Us
  • Fact Checking Policy
  • Terms & Conditions
  • Privacy Policy
  • Copyright Policy

Subscribe Us

Subscribe to our newsletter for the Latest News and Top Stories!

© 2022 VellaTimes • All Rights Reserved.
  • Home
  • Web Stories
  • Bookmarks
  • Interests
  • Disclaimer
  • Sitemap
adbanner
AdBlocker Detected
Our site is an advertising supported site. Please whitelist us to support our work.
Okay, I'll Whitelist