New AI standard tests speed of responses to user queries

New AI standard tests speed of responses to user queries – March 27, 2024 at 4:00 pm

The AI benchmarking group MLCommons released a new set of tests and results on Wednesday to evaluate the speed at which cutting-edge devices can run AI applications and respond to user requests.

The two new MLCommons benchmarks measure the speed at which AI chips and systems can generate answers from powerful, data-packed AI models. The results show roughly how quickly an AI application like ChatGPT can provide an answer to a user's query.

One of the new benchmarks added the ability to measure Q&A script speed for large language models. Called Llama 2, it includes 70 billion parameters and was developed by Meta Platforms.

MLCommons officials also added a second text-to-image generator to the benchmarking toolkit, called MLPerf and based on Stability AI's Stable Diffusion XL model.

Servers with Nvidia's H100 chips, built by companies like Google, Supermicro, and Nvidia itself, easily beat the two new benchmarks in terms of raw performance. Several server manufacturers have introduced designs based on the company's less powerful L40S chip.

For the image generation benchmark, server manufacturer Krai presented a design using Qualcomm's AI chip that uses much less power than Nvidia's innovative processors.

Intel also introduced a design based on its Gaudi2 accelerator chips. The company described the results as “strong.”

Initial performance is not the only important dimension of using AI applications. Advanced AI chips consume huge amounts of power, and one of the biggest challenges facing AI companies is developing chips that provide optimal levels of performance with minimal power consumption.

MLCommons has its own reference class for measuring power consumption. (Reporting by Max A. Cherney in San Francisco; Editing by Jamie Freed)

Gilbert Cox

“Prone to fits of apathy. Zombie ninja. Entrepreneur. Organizer. Evil travel aficionado. Coffee practitioner. Beer lover.”

New AI standard tests speed of responses to user queries – March 27, 2024 at 4:00 pm

How did life begin on Earth? Munich researchers find important clues

The “One-Man-Show” Next-Gen Update shows how to please players

NASA receives the message via a laser beam from a distance of 226 million kilometers

Great Britain: King Charles wants to attend public meetings again

Bad neighbors of tomatoes reduce the harvest

Third round of the NHL Round of 16 – Jozy, Fiala, Niederreiter in reserve – Sports

Temperature and humidity: Kachelman explains the phenomenon

Recent Posts

Leave a Reply Cancel reply

More Stories