Discussion about this post

User's avatar
JP's avatar

Good comparison guide. One thing that doesn't show up in these tables -- quantisation levels. Several budget providers serve quantised versions of models without disclosing it, so the benchmark numbers you're comparing against don't match what you're actually getting. The 6x price gap between Claude and Kimi K2.5 looks wild until you account for what each is actually serving: https://sulat.com/p/the-real-cost-of-cheap-ai-inference

Paul Gibbons's avatar

VERY useful analysis/ breakdown.

No posts

Ready for more?