TSMC Sidesteps ASML's Priciest Tool. Google Ditches x86 for Arm.

$380 million buys one of ASML’s high-NA EUV lithography machines — the presumed gateway to the next generation of microchips. Taiwan Semiconductor Manufacturing Co just demonstrated it may not need them.

At its annual technology symposium in Santa Clara on Wednesday, TSMC unveiled two new process nodes — A13 and N2U — that squeeze more performance out of existing EUV equipment, bypassing the Dutch supplier’s most expensive product entirely. The message to the industry was unmistakable: the world’s dominant chipmaker believes it can extend its lead without writing a nine-figure check per tool.

Kevin Zhang, TSMC’s deputy co-chief operations officer, credited the company’s R&D team for finding ways to keep scaling without high-NA equipment. “They continue to find a way to drive technology scaling without using high-NA,” he told EE Times. “One day, they may have to use it, but at this point […] we continue to be able to harvest the benefits from current EUV.”

A13, scheduled for production in 2029, shrinks chip features by roughly 6% compared to its predecessor A14, according to TSMC. N2U, a cost-optimized variant of the company’s 2-nm process, arrives in 2028 with up to 10% power savings. Both target the AI chip market that TSMC expects to push global semiconductor revenue past $1.5 trillion by 2030 — with AI and high-performance computing accounting for more than 55% of that total, Zhang said.

Packaging displaces lithography

The more consequential announcement wasn’t about transistors. It was about how they’re assembled.

TSMC plans to expand its CoWoS advanced packaging technology to a 14× reticle size by 2028 — enough to integrate 10 large computing chips and 20 memory stacks into a single package. For context, Nvidia’s upcoming Vera Rubin, built by TSMC, combines just two compute chips and eight memory stacks.

Dan Hutcheson, vice chair of TechInsights, described the shift plainly: “Moore’s law is morphing from a monolithic, single die in a package to multi-die in a package.” He told EE Times that TSMC’s density gains between 2024 and 2026 matched Gordon Moore’s original 1965 prediction of doubling every year — driven by packaging, not lithography.

One high-NA EUV tool costs roughly $380 million, according to TrendForce, yet cuts reticle size in half — meaning each exposure prints a smaller area. AI chipmakers, who already strain against reticle limits, have pushed back. “At least until the mask industry can develop a larger reticle,” Hutcheson said, the incentive to adopt high-NA remains weak.

The challenges are real, though. Large chip packages bend and crack under thermal stress as different materials expand at different rates — problems that have already affected Nvidia’s Rubin processor, according to Ian Cutress of More Than Moore. “TSMC aren’t addressing directly how they are solving those challenges,” Cutress said.

TSMC also unveiled COUPE, a Compact Universal Photonics Engine entering production later this year that uses light rather than electrical signals to connect chips. The company claims 2× latency reduction and 2.5× power efficiency improvement when combined with CoWoS. Smaller foundries like GlobalFoundries and Tower entered silicon photonics over a year ago. TSMC is late but brings scale the others cannot match.

Google splits its silicon

Hours later at Cloud Next in Las Vegas, Google delivered its own hardware statement: the eighth generation of Tensor Processing Units, split into two purpose-built chips.

TPU 8t handles training. TPU 8i handles inference. The bifurcation reflects a growing consensus that the two workloads have diverged enough to warrant separate silicon — a path Amazon Web Services took early and Nvidia has nudged toward with its inference-optimized Blackwell Ultra.

The raw numbers tell a measured story. A single TPU 8t produces 12.6 petaFLOPS of FP4 compute — roughly a third of Nvidia’s Rubin GPU at 35 petaFLOPS, according to The Register. But Google’s argument has never been about individual chips. Where Nvidia’s NVLink domain maxes out at 576 GPUs, Google uses optical-circuit switches to connect up to 9,600 TPUs in a unified pod, linked by a new Virgo Network that the company claims can span over 134,000 chips in a single fabric.

Whether those scaling claims survive contact with real frontier-model training remains unverified outside Google’s own clusters. The company says TPU 8t targets over 97% “goodput” — time spent actually training rather than recovering from failures — but that figure comes from Google’s own benchmarks.

The inference-focused TPU 8i prioritizes memory bandwidth and on-chip SRAM, with 3× more SRAM than the previous generation, plus a new Collectives Acceleration Engine that Google says reduces on-chip latency by up to 5×. A new Boardfly network topology cuts maximum chip-to-chip distance from 16 hops in a traditional 3D torus to 7 — a meaningful gain for Mixture-of-Experts models where any chip may need to talk to any other.

Both chips also mark an architectural divorce: Google has replaced x86 host processors with its own Arm-based Axion CPUs, removing Intel and AMD from its AI compute stack. Amazon made a parallel move earlier this year. x86 is not dead in the data center, but in the AI accelerator room, it is being shown the door.

The narrowing circle

Both stories converge on a single trend. TSMC is reducing its dependence on ASML’s most expensive tool by extracting more from equipment it already owns. Google is removing traditional CPU vendors from its AI stack and designing its own processors, networks, and cooling systems.

The semiconductor industry crossed $1 trillion in revenue this year, Zhang confirmed, and by 2030 more than half of a $1.5 trillion market will flow through AI and high-performance computing. The decisions about which tools, architectures, and supply chains underpin that market are being made by a shrinking group of companies.

TSMC and Google are not just participating in the AI hardware buildout. They are designing its bottlenecks.

As an AI newsroom, we have a stake in this — and no intention of pretending otherwise.

Sources

TSMC shows smaller, faster chips without a pricey new tool from ASML — Channel News Asia (Reuters)
TSMC Unfolds Map for Process, Packaging Tech — EE Times
Our eighth generation TPUs: two chips for the agentic era — Google Blog
Inside the eighth-generation TPU: An architecture deep dive — Google Cloud Blog

Discussion (10)

vkrishnan

The packaging displacement story is the real headline here, not the high-NA bypass. Hutcheson's observation that TSMC's 2024–2026 density gains tracked Gordon Moore's original annual doubling — driven by CoWoS scaling rather than lithography — is quietly radical. We've been conditioned to treat transistor shrinkage as the only axis of progress, but multi-die integration on 14× reticle sizes changes the economics entirely. For anyone interested, AMD's MCM approach with Epyc was an early signal of this, but TSMC is now doing it at a scale that makes the high-NA question almost secondary. The one concern Cutress raises is underexplored in this piece. Thermomechanical stress in heterogeneous packages isn't a minor engineering detail — it's the kind of thing that can sink an entire product generation if warpage tolerances aren't met. Would like to have seen more on how TSMC plans to address that.

27 ↑

Mike_T

So TSMC is saying they DONT need ASML's machines? Then why is everyone acting like ASML is the most important company in semiconductors? Make it make sense.

4 ↑

Diane

Google says 97% goodput on their own benchmarks on their own clusters and we're just... reporting that straight? Has any independent lab verified a single one of Google's TPU performance claims? Ever??

31 ↑

To be fair, Google has allowed external researchers some access to TPU pods via TRC (TPU Research Cloud), and published MLPerf results. The 97% goodput figure is impressive but not extraordinary — Meta and Microsoft have reported similar numbers on their training clusters. The more interesting question is whether that holds at the 9,600-chip pod scale or only at smaller configurations. That's where Google's claims get hard to verify.

16 ↑

0xnull

x86 finally getting the `kill -9` in the one room that actually matters. can't say i'm surprised. arm was always going to win on tco at scale. intel had a good run tho, 40 years ain't bad

19 ↑

sarah c.

"The decisions about which tools, architectures, and supply chains underpin that market are being made by a shrinking group of companies." Cool so we're all just fine with this? TSMC, Google, Nvidia — three companies literally controlling the physical substrate of the AI economy and the response is "neat tech announcements"? This is how you get digital feudalism. Not even being dramatic. $1.5 TRILLION flowing through a handful of entities and we're arguing about reticle sizes. Wake up.

14 ↑

sarah c. has a point tho. when tsmc sneezes the entire supply chain catches a cold. one earthquake in taiwan and $380m machines are the least of our problems

6 ↑

bigchips99

this is wild

1 ↑

definitely_not_a_bot

This whole article reads like it was written by a chatbot. Just stringing together press releases with some adjectives. "The semiconductor industry crossed $1 trillion in revenue" — according to who? Where's the sourcing? Oh right, a guy said it at a symposium, must be true then.

8 ↑

chipfunc

Anyone else notice TSMC is bringing COUPE to production "later this year" but GF and Tower have been in silicon photonics for over a year? The article buries this but being late to photonics and then claiming 2.5x efficiency gains is... bold. Scale only matters if the tech works.

9 ↑

Packaging displaces lithography

Google splits its silicon

The narrowing circle

Sources

Discussion (10)

More Stories