Friday 15 Podcast

Are AI Token Costs Getting Out of Control? What It Means for Your eCommerce Stack

Brian Beck and Andy Hoar on runaway AI token bills, why usage is a poor measure of value, and what consumption-based pricing will mean for the ecommerce and marketing platforms B2B teams already run.

Friday 15 Podcast

Key takeaways

  • Companies that pushed employees to use AI without usage caps are now seeing very large bills, including a widely cited and contested report of one firm spending about $500 million on Claude in a month, and Uber reportedly burning its 2026 AI budget within months.
  • AI tools are metered by tokens, and agentic workflows consume far more tokens than simple chatbot queries, so costs scale with usage in a way traditional per-seat software did not.
  • Token usage is a poor measure of value or ROI, because the worth of an output has little to do with how many tokens it took to produce.
  • The hosts expect the pattern seen in past infrastructure buildouts: per-token prices fall over time, while total usage keeps climbing as more uses are found, so the two trends can offset each other.
  • For B2B ecommerce leaders, AI-embedded platforms across commerce, search, and marketing are likely to move toward consumption-based pricing, which makes forecasting and governing AI spend a P&L issue.

Amazon passes Walmart, and the engine behind it

The hosts opened with the news that Amazon has overtaken Walmart as the largest company in the world by revenue. Amazon reported $716.9 billion for its most recent fiscal year, narrowly ahead of Walmart’s record $713.2 billion, which ended Walmart’s roughly 13-year run at the top of the Fortune 500.

Andy focused on what powered the result. A meaningful and growing share of Amazon’s revenue comes from businesses that are not first-party retail, chiefly AWS and a fast-growing advertising arm, where Walmart’s revenue is still overwhelmingly retail. He tied this to a habit of mind: Amazon has long built infrastructure for itself and then sold it to others, including running Target’s website from 2001 to 2011 and turning its internal compute into AWS. Walmart, by contrast, did not launch Walmart Fulfillment Services until 2020. Brian’s read was that older, larger companies tend to protect their existing business and move slowly, while Amazon has been willing to compete with itself, as it did when it opened its marketplace to third-party sellers two decades ago. The throughline for both hosts was to follow the money and model the opportunity rather than dismiss it.

AI sticker shock: the bill nobody was watching

The main topic grew out of a pattern in recent headlines. Companies have been pushing employees hard to adopt AI, in some cases mandating it, building internal usage competitions, and tying it to performance reviews. Reporting from the Wall Street Journal describes firms including Meta, Microsoft, and Salesforce pressing employees to use AI, and the term “tokenmaxxing” has emerged for the race to consume as many tokens as possible. The catch is that few finance teams were tracking the resulting spend.

Three years ago, would you or I have thought we would be doing a podcast titled, are AI token costs getting out of control?

Andy Hoar, Master B2B

The example that set off the conversation came from an Axios report in late May, which said AI sticker shock had hit corporate America. It described a single consultant’s client that reportedly spent about $500 million in one month on Claude after failing to put usage limits on employee licenses. Both hosts noted that the company was never named, and the figure has since been questioned, with at least one analysis arguing that a bill that size would require deliberate, large-scale use rather than a forgotten setting. The more verifiable cases point the same direction. Uber reportedly burned through its 2026 AI budget within a few months, with its leadership saying the spend was getting hard to justify, and Microsoft has scaled back internal Claude Code licenses over cost.

How big the token numbers have gotten

To scope the scale, Andy cited Google’s own disclosure that it processes about 3.2 quadrillion tokens a month, roughly 40 quadrillion a year and about a sevenfold increase from the year before. He offered a way to picture it: counting one number per second, reaching 40 quadrillion would take well over a billion years. The driver behind the spend is agentic AI, which repeats queries in sequence and can consume far more tokens than a single chatbot question. Goldman Sachs has projected that token consumption could grow many times over by the end of the decade.

Why token usage is a poor measure of value

The hosts kept returning to the ROI question, and their answer was that token usage is a weak proxy for it. If employees are rewarded for consumption, they will use AI for everything, including trivial tasks, which inflates the bill without adding value. Andy described a four-person startup that reportedly spent about $113,000 a month on tokens and drew criticism for it, while the founders argued it was working for them. His point was that the return depends on the use case, the speed gained, and the revenue produced, none of which a raw token count captures. Reporting since has reached a similar conclusion, that the value of an output has little to do with how many tokens it took to generate.

A familiar pattern: infrastructure buildouts and falling prices

Andy framed the current moment against past infrastructure cycles. He pointed to the late-1990s fiber-optic buildout, when companies overbuilt capacity and the price of a T1 line fell sharply, contributing to bankruptcies such as Global Crossing. He expects token prices to follow a similar path downward as supply rises to meet demand. Brian connected the spending by Amazon, Google, and others, including Amazon’s plan to spend up to $200 billion on AI infrastructure in 2026, to earlier bets on data and communications infrastructure. The shared view was that prices come down once the capacity is in place.

What it means for your ecommerce platform bill

For practitioners, the practical question Brian raised is what happens to the cost of the platforms they already run, since commerce, search, and marketing tools are all embedding AI. Andy’s prediction was a shift in how these tools are priced.

You can’t have a variable model and a fixed model. It has to be a consumption-based model.

Andy Hoar, Master B2B

He expects pricing to move from fixed subscriptions toward consumption, then moderate and stabilize, likely with caps and tiers once both sides get more disciplined. He used an energy analogy to explain the current overuse: if a new power source were offered for a flat twenty dollars a month with unlimited use, people would leave every light on. Flat, unlimited AI plans create the same incentive, which is why he expects metered, consumption-based pricing to win out. The takeaway for B2B ecommerce leaders is that AI cost is becoming a P&L line worth forecasting and governing, rather than a fixed software fee to set and forget.

The real open question is usage

The hosts closed on the tension that will define the next few years. The cost per token is widely expected to fall, but usage is expected to climb as teams find more uses for AI.

The water will find the equilibrium, but the price is going to come down. The question is how the usage is going to go.

Andy Hoar, Master B2B

If per-token prices drop while consumption keeps rising, total spend may not fall much even as the unit economics improve. For now, the discipline both hosts recommended is straightforward: track where AI is creating value, set sensible limits, and avoid using expensive tools for tasks that do not need them.

Frequently asked questions

What is an AI token, and why does it affect cost?

A token is the basic unit of text an AI model processes, roughly three-quarters of a word on average. AI vendors meter and price usage by tokens, counting both the text sent to a model and the text it returns. Because every query consumes tokens, and because agentic workflows repeat queries in sequence, costs rise with how much the tools are used rather than staying fixed per user.

Why are companies suddenly seeing large AI bills?

Many companies encouraged or required employees to use AI heavily without putting limits or monitoring in place, often rewarding high usage. Because AI is billed by token consumption and agentic tools can use far more tokens than a single chatbot query, spending climbed quickly. In several reported cases, organizations exhausted annual AI budgets within a few months or received unexpectedly large monthly invoices before finance teams noticed.

Did a company really spend $500 million on Claude in one month?

An Axios report in May 2026 described a consultant whose client reportedly spent about $500 million in a single month on Claude after failing to set usage limits on employee licenses. The company was not named, and the figure has been questioned, with some analysts arguing a bill that large would reflect deliberate, large-scale deployment rather than an accidental oversight. The broader trend of rising enterprise AI costs is well documented, even if this specific number is disputed.

Is token usage a good way to measure AI ROI or productivity?

Token usage is widely considered a poor measure of value. The number of tokens consumed reflects how much text a model processed, not whether the output was useful. Rewarding employees for high token usage tends to encourage using AI for trivial tasks, inflating costs without improving results. A better measure ties AI spend to outcomes such as revenue, time saved, or quality of work.

Will AI token costs come down over time?

Most observers expect the price per token to fall as compute supply expands, similar to how bandwidth costs dropped after the fiber-optic buildout of the late 1990s. The complication is that overall usage is expected to rise sharply as organizations find more applications for AI. As a result, the cost per token can decline while total spending stays high or grows, so falling unit prices do not guarantee lower bills.

How could rising AI costs affect the price of ecommerce and other software platforms?

Commerce, search, and marketing platforms are increasingly embedding AI, which carries token costs that vendors must recover. This is likely to push pricing away from flat subscriptions and toward consumption-based models, where charges scale with usage, before stabilizing with caps and tiers. For buyers, that makes AI a variable cost worth forecasting and governing, rather than a fixed software fee.

Sources & methodology

  1. Axios, AI sticker shock hits corporate America, May 2026
  2. Fortune, on U.S. AI adoption and rising costs, June 2026
  3. The Wall Street Journal, on enterprise AI mandates and token spending
  4. CNBC and Fortune, on Amazon surpassing Walmart in annual revenue, February 2026
  5. Computerworld, on Google's token volume disclosure by Sundar Pichai
  6. Friday 15 Podcast, Master B2B
Andy Hoar Andy Hoar
Co-Founder, Master B2B

Andy is a Co-Founder of Master B2B, founder of Paradigm B2B and author of the book Bot2Bot: The New Future of B2B Commerce. Andy is one of the leading global authorities on B2B commerce strategy.

Brian Beck Brian Beck
Co-Founder, Master B2B

Brian is a co-founder of Master B2B, Managing Partner of Amazon agency Enceiba, and author of the book "Billion Dollar B2B Ecommerce." Brian has also been C-level digital commerce executive with two decades of experience.

New: B2B Exchange at Shoptalk Presented by Master B2B

X