API Performance Benchmark

OpenAI’s GPT-5.4 sets new records on professional benchmarks

OpenAI released GPT-5.4 today with native computer use, a 1M-token context window, and new professional benchmarks. Find what ...

Analytics Insight

GPT-5.2 API vs Gemini 3 Pro API: A Pricing and Performance Analysis on Kie.ai

As more companies integrate large language models into customer support, analytics, and internal automation, the main concern ...

Communications of the ACM

Measuring What Matters in Large Language Model Performance

As large language models (LLMs) gain momentum worldwide, there’s a growing need for reliable ways to measure their performance. Benchmarks that evaluate LLM outputs allow developers to track ...

Developer Tech

Google intros benchmark of AI models for Android development

Google has introduced a leaderboard that benchmarks how well AI models handle Android mobile development tasks.

ZDNet

Databricks' TPC-DS benchmarks fuel analytics platform wars

As data sources and volumes grow, and as a data-driven orientation is increasingly deemed to be a competitive necessity, the war between platform vendors to provide the primary repository for our data ...

TweakTown

UL announces its 3DMark benchmark suite now runs natively on macOS, using Metal API

TL;DR: UL has launched a full native 3DMark benchmark suite for macOS, eliminating iOS frame rate limits and enhancing performance testing on powerful Macs. It includes advanced benchmarks like Steel ...

16don MSN

Anthropic releases Claude Sonnet 4.6: Benchmark performance, how to try it

Anthropic's latest flagship model, Claude Sonnet 4.6, is out now.

13d

Backboard.io Becomes First AI Platform to Lead Both Major Memory Benchmarks, Accelerating the Era of Agentic AI

Backboard.io announced it has achieved state-of-the-art performance across both leading AI memory benchmarks, a first ...

Hosted on MSN

Grok Voice Agent API sets a new benchmark for real-time audio AI

Today marks an exciting moment for the developer community as xAI officially introduces the Grok Voice Agent API, opening the door for anyone to build powerful, real-time voice agents with ease.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results