Abstract
We develop a framework for the optimal pricing and product design of LLMs in which a provider sells menus of token budgets to users who differ in their valuations across a continuum of tasks. Under a homogeneous production technology, we show that users’ high-dimensional type profiles are summarized by a scalar index, reducing the seller’s problem to one-dimensional screening. The optimal mechanism takes the form of committed-spend contracts: buyers pay for a budget that they allocate across token classes priced at marginal cost. We extend the analysis to environments with multiple differentiated models and to competition between a proprietary leader and an open-source fringe, showing that competitive pressure reshapes both the intensive and extensive margins of compute provision. Each element of our theory (token-budget menus, maximum- and minimum-spend plans, multi-model versioning, and linear API pricing) has a direct counterpart in the observed pricing practices of providers such as Anthropic, OpenAI, and GitHub.
Keywords
Large Language Models; Optimal Pricing; Menu Pricing; Fine-Tuning;
JEL codes
- D47: Market Design
- D82: Asymmetric and Private Information • Mechanism Design
- D83: Search • Learning • Information and Knowledge • Communication • Belief
Reference
Dirk Bergemann, Alessandro Bonatti, and Alex Smolin, “Menu Pricing of Large Language Models”, TSE Working Paper, n. 25-1670, October 2025, revised March 2026.
See also
Published in
TSE Working Paper, n. 25-1670, October 2025, revised March 2026
