Intelligence,
commoditized.
The price of frontier-class intelligence halves every three months. Moore's Law halved the cost of computing every two years.
The price of intelligence collapsed.
Frontier-class intelligence has fallen roughly 200× since March 2023. Moore's Law would have cut it in half.
§4.1Three forces, compounding.
- 1.
Better chips.
Each new GPU generation does more work per dollar than the one before it. NVIDIA's Blackwell delivers roughly 3× the performance-per-dollar of Hopper. Rubin, shipping now, delivers another 5 to 10× on top. Every model running on newer hardware gets cheaper to train and to serve.
- 2.
Smaller models.
Fewer parameters mean fewer calculations per query, which means less hardware, less electricity, and less time. The model size needed to clear a given capability threshold has fallen 142× in two years, and the cost savings pass through almost directly to price.
- 3.
Better serving.
Once a model is trained, the software around it keeps getting smarter. Techniques like quantization, speculative decoding, and continuous batching were barely deployed three years ago. Today they let the same model on the same chips serve 5 to 10× more queries per GPU, dropping the cost of each query in proportion.