Get all your news in one place.

100’s of premium titles.
One app.

Get all your news in one place.

100’s of premium titles. One news app.

TechRadar

James Capell

Alibaba unveils the network and datacenter design it uses for large language model training

Alibaba logo.

Alibaba has revealed its datacenter design for LLM training, which apparently consists of an Ethernet-based network in which each host contains eight GPUs and nine NICs that each have two 200 GB/sec ports.

The tech giant, which also offers one of the best large language models (LLM) around via its Qwen model, trained on 110 billion parameters, says this design has been used in production for eight months, and aims to maximize the utilization of a GPU's PCIe capabilities increasing the send/receive capacity of the network.

Another feature that increases speed is the use of NVlink for the intra-host network providing more bandwidth between hosts. Each port on the NICs is connected to a different top-of-rack switch avoiding a single point of failure a design that Alibaba call rail-optimized.

Each pod contains 15,000 GPUs

A new type of network is required because the traffic patterns in LLM training is different from general cloud computing because of low entropy and bursty traffic. there is also a higher sensitivity to faults and single point failures.

"Based on the unique characteristics of LLM training, we decided to build a new network architecture specifically for LLM training. We should meet the following goals; scalability, high performance, and single-ToR fault tolerance," the company said.

Another part of the infrastructure that was revealed was the cooling mechanism. As no vendors could provide a solution to keep chips below 105C, the temperature at which switches begin to shut down, Alibaba designed and created its own vapor chamber heat sink along with using more wicked pillars at the center of chips carrying heat away more efficiently.

The design for LLM training is encapsulated in pods that contain 15,000 GPUs and each pod can be located in a single datacenter. "All datacenter buildings in commission in Alibaba Cloud have an overall power constraint of 18MW, and an 18MW building can accommodate approximately 15K GPUs. In conjunction with HPN, each single building perfectly houses an entire Pod, making predominant links inside the same building." Alibaba wrote.

Alibaba also wrote it expects model parameters to continue to rise by an order of magnitude in the next several years from one trillion to 10 trillion parameters, and that its new architecture is planned to be able to support this and increase to a scale of 100,000 GPUs.

Via The Register

More from TechRadar Pro

These are the best WordPress hosting providers around today
Check out our top picks of the best AI tools
And read our full best small business web hosting guide

Sign up to read this article

Read news from 100’s of titles, curated specifically for you.

Already a member? Sign in here

Top stories on inkl right now

Instagram boss reveals he’s paid $900K per year plus stock worth ‘tens of millions of dollars’ as he denies ‘addiction’ claims

Instagram boss reveals he’s paid $900K per year plus stock worth ‘tens of millions of dollars’ as he denies ‘addiction’ claims

The plaintiff’s lawyer in a landmark social media addiction trial tried to connect Adam Mosseri’s compensation to Instagram’s policy toward filters.

Munich Security Conference: Rubio flies in amid testing times for US-Europe ties – live

Munich Security Conference: Rubio flies in amid testing times for US-Europe ties – live

German chancellor Friedrich Merz among key figures to speak as three-day security gathering opens

The Guardian - AU

Japan seizes Chinese fishing boat inside its economic waters amid rift with Beijing

Japan seizes Chinese fishing boat inside its economic waters amid rift with Beijing

Japan’s fisheries agency said the vessel failed to comply with an order to stop. The incident comes weeks after a row between China and Japan over Taiwan

The Guardian - UK

Who will be Bangladesh’s next PM? Tarique Rahman, once-exiled scion, set to take charge

Who will be Bangladesh’s next PM? Tarique Rahman, once-exiled scion, set to take charge

Bangladesh Nationalist Party (BNP) claims victory in the nation's first parliamentary election since the 2024 uprising, though final results are pending. The election follows a student-led revolt that ousted former Prime Minister Sheikh Hasina. BNP leader Tarique Rahman, son of former President Ziaur Rahman and former Prime Minister Khaleda Zia,…

The Times of India

One subscription that gives you access to news from hundreds of sites

Already a member? Sign in here

Prosecutors move to dismiss charges against men accused of hitting ICE officer with broom and shovel

Prosecutors move to dismiss charges against men accused of hitting ICE officer with broom and shovel

Federal prosecutors in Minneapolis have moved to drop felony assault charges against two Venezuelan men, including one shot in the leg by a immigration officer, after new evidence emerged undercutting the government’s version of events

The Independent UK

Affordable housing residents near Portland ICE building to ask judge to limit feds' use of tear gas

Affordable housing residents near Portland ICE building to ask judge to limit feds' use of tear gas

Residents of an affordable housing complex across from the U.S. Immigration and Customs Enforcement building in Portland, Oregon, are set to testify Friday in a lawsuit seeking to limit federal officers' use of tear gas during protests at the building

The Independent UK

Related Stories

Top stories on inkl right now

Instagram boss reveals he’s paid $900K per year plus stock worth ‘tens of millions of dollars’ as he denies ‘addiction’ claims

Instagram boss reveals he’s paid $900K per year plus stock worth ‘tens of millions of dollars’ as he denies ‘addiction’ claims

The plaintiff’s lawyer in a landmark social media addiction trial tried to connect Adam Mosseri’s compensation to Instagram’s policy toward filters.

Munich Security Conference: Rubio flies in amid testing times for US-Europe ties – live

Munich Security Conference: Rubio flies in amid testing times for US-Europe ties – live

German chancellor Friedrich Merz among key figures to speak as three-day security gathering opens

The Guardian - AU

Japan seizes Chinese fishing boat inside its economic waters amid rift with Beijing

Japan seizes Chinese fishing boat inside its economic waters amid rift with Beijing

Japan’s fisheries agency said the vessel failed to comply with an order to stop. The incident comes weeks after a row between China and Japan over Taiwan

The Guardian - UK

Who will be Bangladesh’s next PM? Tarique Rahman, once-exiled scion, set to take charge

Who will be Bangladesh’s next PM? Tarique Rahman, once-exiled scion, set to take charge

Bangladesh Nationalist Party (BNP) claims victory in the nation's first parliamentary election since the 2024 uprising, though final results are pending. The election follows a student-led revolt that ousted former Prime Minister Sheikh Hasina. BNP leader Tarique Rahman, son of former President Ziaur Rahman and former Prime Minister Khaleda Zia,…

The Times of India

One subscription that gives you access to news from hundreds of sites

Already a member? Sign in here

Prosecutors move to dismiss charges against men accused of hitting ICE officer with broom and shovel

Prosecutors move to dismiss charges against men accused of hitting ICE officer with broom and shovel

Federal prosecutors in Minneapolis have moved to drop felony assault charges against two Venezuelan men, including one shot in the leg by a immigration officer, after new evidence emerged undercutting the government’s version of events

The Independent UK

Affordable housing residents near Portland ICE building to ask judge to limit feds' use of tear gas

Affordable housing residents near Portland ICE building to ask judge to limit feds' use of tear gas

Residents of an affordable housing complex across from the U.S. Immigration and Customs Enforcement building in Portland, Oregon, are set to testify Friday in a lawsuit seeking to limit federal officers' use of tear gas during protests at the building

The Independent UK

Our Picks

Heard it on the grapevine: Polish wine’s quiet renaissance

Once thought of only for vodka and lager, Poland is in the midst of a wine-making revival that’s infiltrating restaurant lists, bars and independent suppliers

The Guardian - UK

In My Tiny Kitchen, Every Inch Counts – Yet Drew Barrymore’s Ingenious Blender Earned a Spot on My Counter (And Replaced 3 Bulky Appliances)

The new 'Kitchen System' is a personal blender and food processor all in one, making it a must-have for small spaces (starting with mine)

Homes & Gardens

Former Texas Roadhouse worker calls out the outlet for ‘scooping out’ a bug from their cinnamon butter, but people noticed something even worse

Everyone has seen the viral videos of secret menu hacks or behind-the-scenes kitchen tours. But one former Texas Roadhouse server’s “throwback” TikTok of a bug in cinnamon butter is a literal nightmare. People with a fear of bugs, you might want to sit this one out.

From YouTuber to NASCAR driver: Cleetus McFarland expands his racing resume at Daytona

Garrett Mitchell is better known as “Cleetus McFarland” to his millions of followers gained over the years as a racing influencer

The Independent UK

Woman orders wedding dress from Dior. Then she sees the box it came in. People are alarmed at the reveal: ‘I thought that was a fridge’

Forget butterflies and happy tears. This Dior bride’s big moment arrived looking less like couture and more like a Home Depot delivery. In a now-viral TikTok, handymen lug a massive wooden crate into her house–except it’s the kind you’d expect to hold a fridge, not a frock.

8 Signs Your Healthy Southern Diet is Actually Causing Inflammation

Traditional Southern flavors often rely on fresh produce and lean proteins, but common preparation methods can unintentionally trigger inflammation. While many believe they are following a wholesome regional diet, specific dietary habits might be working against health goals. Identifying these subtle shifts in nutrition can help restore balance and improve…

Budget and the Bees

Fourteen days free

Download the app

One app. One membership.
100+ trusted global sources.

Download on the AppStore

Get it on Google Play