Open-source tool now measures the ‘stupidity level’ of AI models in real time

Sep 18, 2025

A curious new project has surfaced online, and it’s already sparking conversation among among researchers and coders. An open-source tool promises to track the “stupidity level” of major AI models in real time. While the name is tongue-in-cheek, the tool itself is serious—it’s designed to measure performance drops and help developers understand when popular models are cutting corners.

The tool, hosted at aistupidlevel.info, claims to be the first of its kind to monitor large language models for signs of decline. It currently tracks systems like OpenAI’s GPT-5 family, Anthropic’s Claude Opus 4, and Google’s Gemini 2.5 Pro, with support for xAI’s Grok 4 on the way.

Its approach is straightforward but wide-ranging: more than 140 coding and debugging tests run continuously, scoring models on correctness, stability, recovery, efficiency, and other factors. Results are fed into a live dashboard that shows how “smart” or “stupid” a model looks at any given time.

Another interesting element is cost analysis. The tool doesn’t just look at API pricing, but how many attempts a model needs to get something right. A supposedly “cheaper” model may waste cycles, while a more expensive one could finish the job faster and end up costing less overall.

Everything is open-source, with the code and API available on GitHub for anyone to review or contribute to. Since going live earlier this year, the site says it’s drawn nearly a million visitors, showing just how eager developers are for transparency in an increasingly closed-off industry.

Whether it’s a gimmick or a genuine accountability tool, the Stupid Meter highlights a growing frustration with AI performance swings. For developers and enthusiasts, it could become a useful way to separate hype from reality.

Don’t miss a thing! Join our Telegram community for instant updates and grab our free daily newsletter for the best tech stories!

For more daily updates, please visit our News Section.

(Source)

Open-source tool now measures the ‘stupidity level’ of AI models in real time

Comments

Viltrox launches two ultra-wide pocket primes under $200: 9mm F2.8 Air and 14mm F4.0 Air

Portronics launches Conch One Type-C wired earphones with karaoke and EQ modes

Oppo Find X9 Series to Feature World’s First 1 nit Screen for Improved Eye Comfort

Comments

RELATED ARTICLESMORE FROM AUTHOR

Huawei Just Declared War on Nvidia With Its Monster Atlas 950 & 960 SuperPoDs

Tencent Just Dropped a Game-Changing 3D AI Tool And It’s Completely Free

Baidu releases PP-OCRv5, a compact AI model that beats large rivals in OCR tests

Viltrox launches two ultra-wide pocket primes under $200: 9mm F2.8 Air and 14mm F4.0 Air

Portronics launches Conch One Type-C wired earphones with karaoke and EQ modes

Oppo Find X9 Series to Feature World’s First 1 nit Screen for Improved Eye Comfort

RELATED ARTICLES MORE FROM AUTHOR