Tomasz Tunguz (Theory Ventures) · 2026-03-13 · 83d

AI Infrastructure Shortage: Planning for Constrained Inference Until 2028

AI infrastructure faces severe capacity constraints across GPUs, power, data centers, and memory through 2028, with major cloud providers reporting unprecedented demand. As inference capacity becomes rationed, organizations must shift strategies by increasing prices, adopting open-source models, and optimizing workloads rather than universally deploying frontier AI models.

2 metrics· Cited 0× in the knowledge base ·Open source ↗

Metrics in this report

Inference Capacity Shortage Timeline

6quarters

minimum

Until relief in AI infrastructure capacity expected

Publication Readership

150000subscribers

current

Founders and operators reading this newsletter