AI

Etched is building an AI chip that only runs one type of model

Comment

Data moving through a circuit board with CPU in the center.
Image Credits: Ignatiev / Getty Images

As generative AI touches a growing number of industries, the companies producing chips to run the models are benefiting enormously. Nvidia, in particular, wields massive influence, commanding an estimated 70% to 95% of the market for AI chips. Cloud providers from Meta to Microsoft are spending billions of dollars on Nvidia GPUs, wary of falling behind in the generative AI race.

It’s understandable, then, that generative AI vendors aren’t pleased with the status quo. A large portion of their success hinges on the whims of the dominant chipmakers. And so they, along with opportunist VCs, are on the hunt for promising upstarts to challenge the AI chip incumbents.

Etched is among the many, many alternative chip companies vying for a seat at the table — but it’s also among the most intriguing. Only two years old, Etched was founded by a pair of Harvard dropouts, Gavin Uberti (ex-OctoML and ex-Xnor.ai) and Chris Zhu, who along with Robert Wachen and former Cypress Semiconductor CTO Mark Ross, sought to create a chip that could do one thing: run AI models.

That’s not unusual. Plenty of startups and tech giants are developing chips that exclusively run AI models, also known as inferencing chips. Meta has MTIA, Amazon has Graviton and Inferentia, and so on. But Etched’s chips are unique in that they only run a single type of model: Transformers.

The transformer, proposed by a team of Google researchers back in 2017, has become the dominant generative AI model architecture by far.

Transformers underpin OpenAI’s video-generating model Sora. They’re at the heart of text-generating models like Anthropic’s Claude and Google’s Gemini. And they power art generators such as the newest version of Stable Diffusion.

“In 2022, we made a bet that transformers would take over the world,” Uberti, Etched’s CEO, told TechCrunch in an interview. “We’ve hit a point in the evolution of AI where specialized chips that can perform better than general-purpose GPUs are inevitable — and the technical decision-makers of the world know this.”

Etched’s chip, called Sohu, is an ASIC (application-specific integrated circuit) — a chip tailored for a particular application — made for running transformers. Manufactured using TSMC’s 4nm process, Sohu can deliver dramatically better inferencing performance than GPUs and other general-purpose AI chips while drawing less energy, claims Uberti.

“Sohu is an order of magnitude faster and cheaper than even Nvidia’s next generation of Blackwell GB200 GPUs when running text, image and video transformers,” Uberti said. “One Sohu server replaces 160 H100 GPUs. … Sohu will be a more affordable, efficient and environmentally friendly option for business leaders that need specialized chips.”

How does Sohu achieve all this? In a few ways, but the most obvious (and intuitive) is a streamlined inferencing hardware-and-software pipeline. Because Sohu doesn’t run non-transformer models, the Etched team could do away with hardware components not relevant to transformers and trim the software overhead traditionally used for deploying and running non-transformers.

Etched
A graph from Etched comparing hardware performance running Meta’s open model Llama 70B.
Image Credits: Etched

Etched is arriving on the scene at an inflection point in the race for generative AI infrastructure. Beyond cost concerns, the GPUs and other hardware components necessary to run models at scale today are dangerously power-hungry.

Goldman Sachs predicts that AI is poised to drive a 160% increase in data center electricity demand by 2030, contributing to a significant uptick in greenhouse gas emissions. Researchers at UC Riverside, meanwhile, estimate that global AI usage could cause data centers to suck up 1.1 trillion to 1.7 trillion gallons of fresh water by 2027, impacting local resources. (Many data centers use water to cool servers.)

Uberti optimistically — or bombastically, depending on how you interpret it — pitches Sohu as the solution to the industry’s consumption problem.

“In short, our future customers won’t be able to afford not to switch to Sohu,” Uberti said. “Companies are willing to take a bet on Etched because speed and cost are existential to the AI products they are trying to build.”

But can Etched, assuming it meets its goal of bringing Sohu to the mass market in the next few months, succeed when so many others are following close behind it?

The company lacks a direct competitor at present, but AI chip startup Perceive recently previewed a processor with hardware acceleration for transformers. Groq has also invested heavily in transformer-specific optimizations for its ASIC.

Competition aside, what if transformers one day fall out of favor? Uberti says, in that case, Etched will do the obvious: Design a new chip. Fair enough, but that’s a pretty drastic fallback option, considering how long it’s taken to bring Sohu to fruition.

None of these concerns have dissuaded investors from pouring an enormous amount of money into Etched, though.

Today, Etched said it has closed a $120 million Series A funding round, co-led by Primary Venture Partners and Positive Sum Ventures. Bringing Etched’s total raised to $125.36 million, the round saw participation from heavyweight angel backers including Peter Thiel (Uberti, Zhu and Wachen are Thiel Fellowship alums), GitHub CEO Thomas Dohmke, Cruise (and the Bot Company) co-founder Kyle Vogt, and Quora co-founder Charlie Cheever.

These investors presumably believe Etched has a reasonable chance of successfully scaling up its business of selling servers. Perhaps it does — Uberti claims unnamed customers have reserved “tens of millions of dollars” in hardware so far. The forthcoming launch of the Sohu Developer Cloud, which will let customers preview Sohu via an online interactive playground, should drive additional sales, Uberti suggested.

Still, it seems too early to tell whether this will be enough to propel Etched and its 35-person team into the future its co-founders are envisioning. The AI chip segment can be unforgiving in the best of times — see the high-profile near-failures of AI chip startups like Mythic and Graphcore, and the declining investment in AI chip ventures in 2023.

Uberti makes a strong sales pitch, though: “Video generation, audio-to-audio modalities, robotics, and other future AI use cases will only be possible with a faster chip like Sohu. The entire future of AI technology will be shaped by whether the infrastructure can scale.”

More TechCrunch

As venture capitalists continue to pour money into defense tech startups, they’re turning to a new hiring pool: ex-military officials.  

More ex-military officials are becoming VCs as defense tech investment reached $35B

Dark patterns refer to a range of design techniques that can subtly encourage users to take some sort of action or put their privacy at risk.

FTC study finds ‘dark patterns’ used by a majority of subscription apps and websites

Elon Musk faces several lawsuits for firing more than 6,000 Twitter employees, including then-CEO Parag Agrawal, following Musk’s 2022 takeover of the social media platform. On Tuesday, Musk defeated one…

Elon Musk does not owe ex-Twitter staffers $500 million in severance, court rules

Meta announced on Wednesday that users aged 10 to 12 will soon be able to interact with others in VR if they have their parents’ approval to do so. Up…

Meta will soon let kids aged 10 to 12 interact with others in VR with their parents’ approval

Generative AI is everywhere these days, but Amazon Web Services has been perceived in some circles as being late to the game. In reality it’s still early, and the market…

AWS App Studio promises to generate enterprise apps from a written prompt

Cybersecurity experts are criticizing Microsoft for data breach notification emails that are confusing customers.

Microsoft emails that warned customers of Russian hacks criticized for looking like spam and phishing

After securing $14 million for its second fund in 2023, early-stage VC firm Kearny Jackson is back with a third fund.

Marc Andreessen, Sequoia again back Kearny Jackson, this time in $65M Fund III

The question now is whether Spotify will add something similar for music artists in the future.

Spotify is no longer just a streaming app, it’s a social network

The core issue relates to a 2019 licensing change whereby Microsoft made it more expensive to run Microsoft’s enterprise software on rival cloud services.

Microsoft settles with European cloud trade body over antitrust complaints

Featured Article

From Facebook to the face of crypto: Inside Anthony Pompliano’s wild career

He’s known by a single-syllable name: Pomp. But his story is of an unconventional rise to success that almost ended two years after it began.

From Facebook to the face of crypto: Inside Anthony Pompliano’s wild career

As TikTok continues to test the waters with longer videos, Instagram Head Adam Mosseri has said the Meta-owned social network will continue to focus on short-form content. In an Instagram…

While TikTok chases YouTube, Instagram vows to focus on short-form content

Are you a Series A to B startup aiming to make a big splash in the tech world? Look no further than the ScaleUp Startups Exhibitor Program at TechCrunch Disrupt…

Elevate your startup with the ScaleUp Program at TechCrunch Disrupt 2024

While Samsung has maintained its own familiar design with the standard Galaxy Buds 3, the Pro are experiencing a sort of Apple identity crisis.

Samsung unveils Galaxy Buds 3 Pro and Buds 3, available for preorder now and shipping July 24

At Unpacked 2024, the company shared more details about the Galaxy Ring, which represents the first take on the category from a hardware giant.

Samsung’s Galaxy Ring, its first smart ring, arrives July 24 for $399

At the heart of the features is the Snapdragon 8 Gen 3, which is the same system on a chip that powered the Galaxy S24.

Samsung Galaxy Z Fold and Z Flip 6 arrive with Galaxy AI and Google Gemini

Vimeo joins TikTok, YouTube and Meta in implementing a way for creators to label AI-generated content. The video hosting service announced on Wednesday that creators must now disclose to viewers…

Vimeo joins YouTube and TikTok in launching new AI content labels

The search giant is updating its Gemini for Android app to be more suitable for foldables with the ability to use Gemini with overlay and split screen interfaces.

Google brings new Gemini features and Wear OS 5 to Samsung devices

The European Union has designated adult content website XNXX as subject to the strictest level of content regulation under the bloc’s Digital Services Act (DSA) after it notified the bloc…

XNXX joins handful of adult sites subject to EU’s strictest content moderation rules

This likely rules out reports of Apple gaining an observer seat.

As Microsoft leaves its observer seat, OpenAI says it won’t have any more observers

SaaS founders trying to figure out what it takes to raise their next round can refer to Point Nine’s famous yearly SaaS Funding Napkin. (The term refers to “back of…

Deep tech startups with very technical CEOs raise larger rounds, research finds

Iceland’s startup scene is punching above its weight. That’s perhaps in part because it kept the 2021 hype in check, but mostly because its tech ecosystem is coming of age.…

Iceland is dodging the VC doldrums as Frumtak Ventures lands $87M for its fourth fund

Index Ventures is announcing $2.3 billion in new funds to finance the next generation of tech startups globally. These new funds are spread across different stages with $800 million dedicated…

Index Ventures raises $2.3B for new venture and growth funds

Prompt engineering became a hot job last year in the AI industry, but it seems Anthropic is now developing tools to at least partially automate it. Anthropic released several new…

Anthropic’s Claude adds a prompt playground to quickly improve your AI apps

Hebbia, a startup that uses generative AI to search large documents and respond to large questions, has raised a $130 million Series B at a roughly $700 million valuation led…

AI startup Hebbia raised $130M at a $700M valuation on $13 million of profitable revenue

NovoNutrients has raised a $18 million Series A round from investors to build a pilot-scale facility to prove that its fermentation process works at scale.

NovoNutrients tweaks its bugs to turn CO2 into protein for people and pets

Seven years ago, Uber and Lyft blocked an effort to require ride-hailing app drivers to get fingerprinted in California. But by launching Uber for Teens earlier this year, the company…

Uber for Teens has reignited an old debate over fingerprinting drivers

Fast-food chain Whataburger’s app has gone viral in the wake of Hurricane Beryl, which left around 1.8 million utility customers in Houston, Texas without power. Hundreds of thousands of those…

Whataburger app becomes unlikely power outage map after Houston hurricane

Bumble’s new reporting option arrives at a time when, unfortunately, AI-generated photos on dating apps are common

Bumble users can now report profiles that use AI-generated photos

The concept of Airchat is fun, especially if you’re someone who loves to send voice memos instead of typing out long paragraphs on your phone keyboard.

Talky social app Airchat gets a major overhaul, making it more like an asynchronous Clubhouse

Featured Article

The fall of EV startup Fisker: A comprehensive timeline

Here is a timeline of the events that led fledgling automaker Fisker to file for bankruptcy.

The fall of EV startup Fisker: A comprehensive timeline