Burrowed in the alleys of Hongik-dong, a hushed residential neighborhood in eastern Seoul, is a faded stone-tiled building stamped “Korea Baduk Association,” the governing body for professional Go. The game is an ancient one, with sacred stature in South Korea. But inside the building, rooms once filled with the soft clatter of hands dipping into…
Daily AI Intelligence for Builders
Curated news, comprehensive benchmarks, and actionable insights for developers building with AI technology.
Latest AI News
View All NewsLilly this week launched the most powerful AI factory wholly owned and operated by a pharmaceutical company to help its teams make meaningful medical advancements faster, more accurately and at unprecedented scale. Dubbed LillyPod, it’s the world’s first NVIDIA DGX SuperPOD with DGX B300 systems.
Google is partnering with the Massachusetts AI Hub to provide every Baystater with no-cost access to Google’s AI training.
New alternatives, “understand” and “ask” buttons in Google Translate help you navigate the complexities of natural language.
By mid-morning, a typical knowledge worker is already juggling a client report, a budget spreadsheet, a slide deck, and an email backlog, all interdependent and all demanding attention at once. For AI agents to be genuinely useful in that environment, they will need to operate the same way, but today’s best models are evaluated one […] The post CORPGEN advances AI agents for real work appeared first on Microsoft Research.
LLM Benchmark Leaderboard
Top 5 language models by MMLU performance
| Model Name | Model Family | Score |
|---|---|---|
| Claude 3 Opus | Anthropic | 86.8% |
| GPT-4 Turbo | OpenAI | 86.5% |
| GPT-4 | OpenAI | 86.4% |
| Gemini 1.5 Pro | 85.9% | |
| Llama 3 70B | Meta | 82% |
Latest Insights
AI insights and builder-focused content
A practical decision framework for selecting LLMs based on cost, latency, capabilities, and context requirements.
A signal-over-noise platform for AI builders featuring curated news, LLM benchmarks, and practical insights delivered daily.
An honest analysis of the current AI developer tools landscape, from code editors to testing frameworks and deployment platforms.
What We Offer
Everything you need to stay informed and make data-driven decisions about AI technology.