Photo · Avery Evans / Unsplash
Analysis · Today's Memo
The Token Burn Economy: Why AI Usage Limits Are Really Workflow Limits
Everyone blames the rate limits. The real bill is being run up by bad session architecture — and businesses are about to discover the same problem at ten thousand times the scale.
By The Memo · June 20, 2026 · 11 min read
Every few weeks a new thread surfaces on Reddit and Hacker News: someone has hit their Claude or ChatGPT usage cap by lunchtime and they are, understandably, annoyed. The replies fall into two camps. One blames the lab — the limits are too tight, the pricing is greedy, the model is being deliberately throttled. The other camp, smaller and quieter, says something more interesting: you're not hitting a usage limit. You're hitting a workflow limit.
Continue reading →