In September, Alfred Stephen, a freelance software developer in Singapore, purchased a ChatGPT Plus subscription, which costs $20 a month and offers more access to advanced models, to speed up his work. But he grew frustrated with the chatbot’s coding abilities and its gushing, meandering replies. Then he came across a post on Reddit about…
Daily AI Intelligence for Builders
Curated news, comprehensive benchmarks, and actionable insights for developers building with AI technology.
Latest AI News
View All NewsLearn more about Google Photos’ new Ask button as well as other functions of its Ask Photos feature.
We’re sharing our latest updates designed to support families, kids and teens in the digital world.
This story originally appeared in The Algorithm, our weekly newsletter on AI. To get stories like this in your inbox first, sign up here. Lots of influential people in tech last week were describing Moltbook, an online hangout populated by AI agents interacting with one another, as a glimpse into the future. It appeared to show…
For years, our newsroom has explored AI’s limitations and potential dangers, as well as its growing energy needs. And our reporters have looked closely at how generative tools are being used for tasks such as coding and running scientific experiments. But how is AI actually being used in fields like health care, climate tech, education,…
LLM Benchmark Leaderboard
Top 5 language models by MMLU performance
| Model Name | Model Family | Score |
|---|---|---|
| Claude 3 Opus | Anthropic | 86.8% |
| GPT-4 Turbo | OpenAI | 86.5% |
| GPT-4 | OpenAI | 86.4% |
| Gemini 1.5 Pro | 85.9% | |
| Llama 3 70B | Meta | 82% |
Latest Insights
AI insights and builder-focused content
A practical decision framework for selecting LLMs based on cost, latency, capabilities, and context requirements.
A signal-over-noise platform for AI builders featuring curated news, LLM benchmarks, and practical insights delivered daily.
An honest analysis of the current AI developer tools landscape, from code editors to testing frameworks and deployment platforms.
What We Offer
Everything you need to stay informed and make data-driven decisions about AI technology.