Document de travail

Menu Pricing of Large Language Models

Dirk Bergemann, Alessandro Bonatti et Alex Smolin

Résumé

We develop a framework for the optimal pricing and product design of LLMs in which a provider sells menus of token budgets to users who differ in their valuations across a continuum of tasks. Under a homogeneous production technology, we show that users’ high-dimensional type profiles are summarized by a scalar index, reducing the seller’s problem to one-dimensional screening. The optimal mechanism takes the form of committed-spend contracts: buyers pay for a budget that they allocate across token classes priced at marginal cost. We extend the analysis to environments with multiple differentiated models and to competition between a proprietary leader and an open-source fringe, showing that competitive pressure reshapes both the intensive and extensive margins of compute provision. Each element of our theory (token-budget menus, maximum- and minimum-spend plans, multi-model versioning, and linear API pricing) has a direct counterpart in the observed pricing practices of providers such as Anthropic, OpenAI, and GitHub.

Mots-clés

Large Language Models; Optimal Pricing; Menu Pricing; Fine-Tuning;

Codes JEL

  • D47: Market Design
  • D82: Asymmetric and Private Information • Mechanism Design
  • D83: Search • Learning • Information and Knowledge • Communication • Belief

Référence

Dirk Bergemann, Alessandro Bonatti et Alex Smolin, « Menu Pricing of Large Language Models », TSE Working Paper, n° 25-1670, octobre 2025, révision mars 2026.

Voir aussi

Publié dans

TSE Working Paper, n° 25-1670, octobre 2025, révision mars 2026