This chip startup just raised $135 million on a bet that AI's biggest bottleneck is memory, not compute. - BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates

Every time you ask ChatGPT a question, that request triggers a data relay race. Information comes out of memory, passes through the CPU for preprocessing, goes to the GPU for heavy calculations, and then comes back. The entire process is repeated for every word the AI generates.

The bottleneck is structural, meaning each request is routed through some of the industry’s most expensive and power-hungry chips. This inefficiency is exactly what XCENA, a startup with offices in South Korea and the United States, is trying to solve. The four-year-old startup has designed a chip that brings computing capabilities much closer to DRAM, a high-speed, short-term memory chip that stores data that a processor is actively using. This allows routine data operations to be processed close to memory without costly round trips between CPU, GPU, and memory.

If it works at scale, the cost implications of AI infrastructure can be significant, which largely explains the enthusiasm of investors across the country. In fact, XCENA raised $135 million in Series B at a valuation of $570 million, bringing its total funding to $185 million.

XCENA CEO Jin Kim co-founded the startup in 2022 with CTO Dohoon Kim and CPO Harry Joo-hyun Kim, veterans of Samsung and SK Hynix, the memory giants that supply the chips that power NVIDIA’s GPUs. “CPUs and GPUs have both gotten smarter over the decades, but memory has never gotten smarter. XCENA wants to change that,” Kim said in an interview with TechCrunch. “Recent increases in memory prices and related stocks indicate a broader shift in AI infrastructure toward memory-centric architectures,” he added. (This month, the three companies that dominate the global memory chip market, Samsung, SK Hynix, and Micron, each surpassed $1 trillion in valuation for the first time.)

XCENA is betting its business on the hypothesis that “inference is not just a computing problem, but increasingly a memory scaling problem,” Kim said.

XCENA’s chip, MX1, connects to the CPU through a Compute Express Link (CXL), essentially a dedicated express lane between the processor and memory, and processes data before it leaves the memory module. This brings compute to data, not the other way around. The company claims that what previously required 10 servers could now be done on just one.

“While GPUs are great at matrix multiplication (the complex calculations behind AI model training), much of the surrounding data orchestration still occurs on the CPU, such as preprocessing, KV cache management (a system that saves previous conversation context so the model doesn’t have to reprocess it), and data caching. Our chip handles these tasks directly within the memory module itself,” Kim said.

Demand for memory solutions has been surging since the second half of last year, and the company believes the timing is working in its favor.

Talks are in the early stages with several global memory vendors, which Kim declined to name. The company’s ideal customers are hyperscalers that spend tens of billions of dollars annually on AI infrastructure, and even small improvements in memory efficiency can lead to hundreds of millions of dollars in savings.

MX1 is still a prototype. Mass-produced chips are expected to be shipped from Samsung’s foundry lines by the end of 2026, and the company expects to generate revenue starting in 2027.

As neural processing unit (NPU) manufacturers race to take on Nvidia for training workloads, XCENA targets the memory-intensive layers underneath.

XCENA’s closest competitors include Astera Labs and Marvell, both Nasdaq-listed companies working on next-generation memory connectivity. Marvell is already a large, established company operating in the same space, Kim said, adding that the differentiating factor ultimately comes down to intellectual property. “We have thousands of cores,” Kim said. Based on public specifications, Marvell’s approach relies on a comparatively small number of general-purpose cores.

These cores are built on RISC-V, an open source chip design blueprint, and are specifically optimized for data processing, with each core intentionally kept small and efficient. Beyond the core itself, XCENA designs its own internal memory hierarchy, interconnect bus, and DRAM controller. This is the level of vertical integration that most chip companies, including their larger rivals, typically outsource.

Seoul-based venture capital firms Altinum and IMM Investment co-led the Series B round along with Corstone Asia and existing investors SBI Investment and Mirae Asset Capital. The company has more than 90 staff in offices in Pangyo, a technology hub on the outskirts of Seoul, and Sunnyvale, and is in talks with overseas investors to raise additional funding.

If you buy through links in our articles, we may earn a small commission. This does not affect editorial independence.

Source link

What's Hot

Two refugee boats disappear off the coast of Myanmar, leaving more than 500 people missing, UN agency says

The Miz says wife Maryse Mizanin and daughters are his champions

“I may be one of the 10 luckiest people in the world.”

This chip startup just raised $135 million on a bet that AI’s biggest bottleneck is memory, not compute.

Applied Computing wants to provide oil and gas operators with plant-wide AI models

Microsoft is reportedly training salespeople to silence OpenAI and Anthropic

Microsoft patches record number of security vulnerabilities due to use of AI

Whatnot acquires Shaped to power real-time live shopping recommendations

Newly freed hostages face long road to recovery after two years in captivity

Former Kenyan Prime Minister Raila Odinga dies at 80

New NATO member offers to buy more US weapons to Ukraine as Western aid dwindles

Russia expands drone targeting on Ukraine’s rail network

The Miz says wife Maryse Mizanin and daughters are his champions

‘Love Story’ star Sarah Pidgeon reacts to co-star Paul Anthony Kelly’s Emmy snub

Olivia Wilde details unscripted kiss with Penelope Cruz in ‘The Invite’

Ian Somerhalder quits Hollywood acting for Netflix’s ‘V-Wars’

Our Picks

Two refugee boats disappear off the coast of Myanmar, leaving more than 500 people missing, UN agency says

Russia attacks Kiev hours before outgoing British leader Starmer arrives for talks with Zelenskiy

Live updates: US shells Iranian targets for 5th day, hits tanker bound for major oil port

Subscribe to Updates

What's Hot

This chip startup just raised $135 million on a bet that AI’s biggest bottleneck is memory, not compute.

Related Posts

Subscribe to Updates