Menu Pricing of Large Language Models

Abstract

We develop a framework for the optimal pricing and product design of LLMs in which a provider sells menus of token budgets to users who differ in their valuations across a continuum of tasks. Under a homogeneous production technology, we show that users’ high-dimensional type profiles are summarized by a scalar index, reducing the seller’s problem to one-dimensional screening. The optimal mechanism takes the form of committed-spend contracts: buyers pay for a budget that they allocate across token classes priced at marginal cost. We extend the analysis to environments with multiple differentiated models and to competition between a proprietary leader and an open-source fringe, showing that competitive pressure reshapes both the intensive and extensive margins of compute provision. Each element of our theory (token-budget menus, maximum- and minimum-spend plans, multi-model versioning, and linear API pricing) has a direct counterpart in the observed pricing practices of providers such as Anthropic, OpenAI, and GitHub.

Keywords

Large Language Models; Optimal Pricing; Menu Pricing; Fine-Tuning;

JEL codes

D47: Market Design
D82: Asymmetric and Private Information • Mechanism Design
D83: Search • Learning • Information and Knowledge • Communication • Belief

Reference

Dirk Bergemann, Alessandro Bonatti, and Alex Smolin, “Menu Pricing of Large Language Models”, TSE Working Paper, n. 25-1670, October 2025, revised March 2026.

Published in

TSE Working Paper, n. 25-1670, October 2025, revised March 2026

Search form

Menu Pricing of Large Language Models

Abstract

Keywords

JEL codes

Reference

See also

Published in

Menu Pricing of Large Language Models

Abstract

Keywords

JEL codes

Reference

See also

Published in

Share