AITomasz Tunguz1 day ago

So You Want to Sell Inference

3 min read

Reselling inference at cost yields zero margin because customers compare prices to raw API costs and route around providers. Cost-plus pricing caps revenue at inference cost; value-based pricing charges per resolved task or generated report, decoupling price from compute. Optimization reduces inference cost to $0.70 via routing, caching, and distillation to small models.

Level

Hype check

Tap to vote and see what everyone thinks.

#inference #pricing #ai

So You Want to Sell Inference

More to chew on!

More to chew on!