Mark Heynen’s Post

View profile for Mark Heynen, graphic

Ex Google, Meta, and 4x founder. Advisor, operator and investor focused on AI, fintech, and emerging markets.

Shocker: It is cheaper, easier, and more private to run decentralized AI than to buy AI solutions from large cloud providers like Microsoft, OpenAI, Google, or Amazon. Of course, this is not really a shocker -- anyone who has worked in decentralized finance knows far too well that concentration of market power leads to escalating costs, over-reliance on weak controls, and nasty fine print. Microsoft recently shared that they have seen a $5B lift in ARR in Azure from their AI business. They separately shared that they have 4,500 AI customers, so lets put this at ~$1m per year per customer. They are also spending $3m per customer in capex, so are incentivized to keep these prices high. (source: https://lnkd.in/ghVSRTWs). Meanwhile, this excellent comparison of cloud AI costs by Isaac Lehman with a more baseline use case (read, not the Fortune 100) shows that while cheaper, there is still a hefty premium vs running your AI using a "roll your own" solution decentralized solution like Knapsack Enterprise. ($225K pa vs $70K pa). And there is no capex overhand forcing us to keep prices high. (source: https://lnkd.in/gEs9Msxc) Where things get really interesting is when you roll in the Knapsack desktop app that Cooper, Philip, and I have been building (now in private beta) which can actually carry a fair amount of the load and reduce enterprise cloud costs by another 10x. For those who want the full math: At a rate of 10k chats per day: 500 tok/chat / 20 tok/s = 25 s/chat 25 s/chat * 10k chats = 250,000s 250,000s / 86,400s/gpu-day = 2.89gpus in order to meet the quota of 10k chats per day. Rounding up to 3 gpus, this would cost between $12k and $20k depending on the hardware, or, if you rent from Lambda, $1,650 per month ($20k per year) + $50K in annual software licensing fees = $70K pa. If 90% of those queries can be handled on the edge, then $2K per year + $50K in licenses. Compare to Azure's cheapest option for Llama3 70b @ $18,635 per month ($225,000 per year).

  • No alternative text description for this image
Phaniindra Reddy ⚡

AI Sales for B2B and Enterprises | Clay | Voice AI | Agents | Fill your pipeline with leads with AI | Founder of Every-ai.com

1mo

this is pretty cool, we will be running a similar model such as mistral right?

Like
Reply

To view or add a comment, sign in

Explore topics