No summary available
Daily AI Intelligence for Builders
Curated news, comprehensive benchmarks, and actionable insights for developers building with AI technology.
Latest AI News
View All NewsMarch is in full bloom, and that means a fresh wave of games heading to the cloud. 15 new titles are joining the GeForce NOW library this month. Leading the March lineup is Pearl Abyss’ Crimson Desert, an open‑world action‑adventure set in a war‑torn fantasy land, alongside plenty of other games to explore. Whether looking […]
Scott Shambaugh didn’t think twice when he denied an AI agent’s request to contribute to matplotlib, a software library that he helps manage. Like many open-source projects, matplotlib has been overwhelmed by a glut of AI code contributions, and so Shambaugh and his fellow maintainers have instituted a policy that all AI-written code must be…
We are pleased to announce Phi-4-reasoning-vision-15B, a 15 billion parameter open‑weight multimodal reasoning model, available through Microsoft Foundry (opens in new tab), HuggingFace (opens in new tab) and GitHub (opens in new tab). Phi-4-reasoning-vision-15B is a broadly capable model that can be used for a wide array of vision-language tasks such as image captioning, asking […] The post Phi-4-reasoning-vision and the lessons of training a multimodal reasoning model appeared first on Micr...
Canvas in AI Mode is now available for everyone in the U.S. Plus, it can now help you draft documents or build interactive tools.
LLM Benchmark Leaderboard
Top 5 language models by MMLU performance
| Model Name | Model Family | Score |
|---|---|---|
| Claude 3 Opus | Anthropic | 86.8% |
| GPT-4 Turbo | OpenAI | 86.5% |
| GPT-4 | OpenAI | 86.4% |
| Gemini 1.5 Pro | 85.9% | |
| Llama 3 70B | Meta | 82% |
Latest Insights
AI insights and builder-focused content
A practical decision framework for selecting LLMs based on cost, latency, capabilities, and context requirements.
A signal-over-noise platform for AI builders featuring curated news, LLM benchmarks, and practical insights delivered daily.
An honest analysis of the current AI developer tools landscape, from code editors to testing frameworks and deployment platforms.
What We Offer
Everything you need to stay informed and make data-driven decisions about AI technology.