Exclusive

Breaking News

Eli Tomac competes in the US Nationals

Eli Tomac competes in the US Nationals

Heidi Klum is confused when she appears on American television: this is what Bill Kaulitz says

Heidi Klum is confused when she appears on American television: this is what Bill Kaulitz says

Copilot comes to Outlook mobile apps

Copilot comes to Outlook mobile apps

One week left until the start of the World Cup – Nati arrives in the Czech Republic: the cast enters the final round – Sports

One week left until the start of the World Cup – Nati arrives in the Czech Republic: the cast enters the final round – Sports

Warning signs of Alzheimer’s disease: Researchers find new evidence

Warning signs of Alzheimer’s disease: Researchers find new evidence

Tech

Microsoft Research accelerates LLM processing with Splitwise

Large AI language models (LLMs) require massive power, which generally comes from specialized GPUs. They are expensive and consume a lot of electricity – which is also a cost prohibitive for cloud AI providers.

Researchers from the Microsoft Azure team took on the problem and came up with an amazing solution. A new technology called Splitwise aims to make inference calculations for LLMs significantly more efficient and sustainable. Processing is divided into two phases: fast processing and code generation, and is distributed across different GPU clusters and machines. Splitwise takes advantage of the fact that fast processing requires a large amount of GPU processing capacity, while token generation relies on high memory bandwidth.

Details about Splitwise are described Detailed paper. With Splitwise, Microsoft wants to achieve 1.4 times the throughput at 20 percent lower costs than previous system designs or 2.35 times the throughput for the same costs and power budget. (Yupi)

“Prone to fits of apathy. Zombie ninja. Entrepreneur. Organizer. Evil travel aficionado. Coffee practitioner. Beer lover.”

See also How to skip credits in Mario Kart 8 Deluxe

Leave a Reply Cancel reply

1 min read

Tech

Copilot comes to Outlook mobile apps

May 4, 2024 Gilbert Cox

4 min read

Tech

AOC Graphic Pro U3: A new color-accurate monitor series for creative people

May 3, 2024 Gilbert Cox

3 min read

Tech

Do you already know Ruona? -Dukchik

May 3, 2024 Gilbert Cox

2 min read

Top News

Eli Tomac competes in the US Nationals

May 4, 2024 Jordan Lambert

3 min read

entertainment

Heidi Klum is confused when she appears on American television: this is what Bill Kaulitz says

May 4, 2024 Ulva Robson

1 min read

Tech

Copilot comes to Outlook mobile apps

May 4, 2024 Gilbert Cox

2 min read

sport

One week left until the start of the World Cup – Nati arrives in the Czech Republic: the cast enters the final round – Sports

May 4, 2024 Eileen Curry