Home Blogs Data Center Explorer NeuReality announces power-saving AI appliance

NeuReality announces power-saving AI appliance

News

Jul 30, 20243 mins

CPUs and ProcessorsData Center

NeuReality claims its AI appliance can significantly cut costs and energy use in AI data centers.

Speed, fast, faster. Hand turning a dial with a rocket ship icon..

Credit: Olivier Le Moal/Shutterstock

AI appliance vendor NeuReality has announced its NR1-S solution significantly boosts the output of CPU- and GPU-based systems, reducing energy costs and energy efficiency, by taking over the work normally done by the CPU.

The news came in a blog post with the results of multiple tests. NeuReality compared its NR1-S inference appliance paired with Qualcomm Cloud AI 100 Ultra and Pro accelerators against traditional CPU-centric inference servers with Nvidia H100 or L40S GPU cards. The NR1-S demonstrated significantly improved cost savings and energy efficiency compared to the standard CPU-centric systems running standard AI apps. The tests were done using real-world scenarios involving natural language processing (NLP), automatic speech recognition (ASR), and computer vision (CV) commonly used in medical imaging, fraud detection, customer call centers, online assistants and more.

The NR1-S takes over the work normally done by the CPUs in the system, because the CPUs aren’t fast enough to handle all the data movement generated by the GPUs, said Iddo Kadim, CTO for NeuReality.

“Systems built with CPUs simply can’t support and scale the number of accelerators that are put in the system,” he said. “The CPU becomes a data-moving machine, and unfortunately, CPUs were built to compute, not to move a ton of data back and forth. There are a number of reasons why the CPU architecture basically becomes a bottleneck.”

The appliance takes over for the CPU, greatly enhancing the throughput and scalability of the GPUs. This allows GPUs to run much faster and at greater utilization than with CPUs. When paired with Qualcomm’s AI 100 Ultra, NR1-S achieves up to 90% cost savings across various AI data types, such as image, audio and text. Along with the cost saving, the NR1-S shows up to 15 times better energy efficiency compared to traditional CPU-centric systems. Unlike traditional CPU-centric systems, NR1-S can ensure 100% utilization of the integrated AI accelerators without performance drop-offs or delays observed in today’s CPU-reliant systems, the vendor claims.

The tests also measured energy consumption for audio processing per watt. In a voice-to-text test, the NR1-S was able to convert seven seconds of audio using the same amount of power as 0.7 seconds in traditional CPU-centric systems. This translates to a 10-fold increase in performance for the energy used, according to NeuReality. Another audio test showed NR1-S cutting the cost of processing 1 million audio seconds from 43 cents to only 5 cents.

The NR1-S works with existing accelerators, GPUs or otherwise, as long as they are PCI Express-based. The device is a heterogeneous compute device with network and data movement optimization and some compute engines to basically take over the functions that the CPU would take care of.

The appliance comes with a SDK that enables it to convert the processing pipeline automatically, making it a plug-and-play deployment with no modifications required to the hardware environment or the software environment.

The NR1-S appliance is available now.

Americas

Topics

Über uns

Policies

Our Network

Mehr

NeuReality announces power-saving AI appliance

NeuReality claims its AI appliance can significantly cut costs and energy use in AI data centers.

Read more from Andy Patrizio:

More from this author

Ampere unveils 512-core AmpereOne Aurora chip with integrated AI acceleration

Intel highlights new Xeons for AI at Hot Chips 2024

3 takeaways from AMD’s ZT Systems grab

AMD continues to gain ground on Intel

Broadcom distrust drives sales for VMware competition

Enterprise spending on cloud services keeps accelerating

Xeons not impacted by Intel’s crashing CPU scandal

Nvidia Blackwell GPU hit with delays: report

Show me more

2024 network outage report and internet health check

No evidence that TP-Link routers are a Chinese security threat

VMware by Broadcom: Product, service and support news

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

How to determine if a string contains a regular expression

How to use the Grep command

How to see how many days passed since the beginning of the Linux epoch

NeuReality announces power-saving AI appliance

NeuReality claims its AI appliance can significantly cut costs and energy use in AI data centers.

Read more from Andy Patrizio:

Related content

Edgecore unveils high-performance 400G spine switch for data centers

Network jobs watch: Hiring, skills and certification trends

Using genAI in IT operations boosts productivity, but security concerns linger

Top network and data center events 2024

Newsletter Promo Module Test

More from this author

Ampere unveils 512-core AmpereOne Aurora chip with integrated AI acceleration

Intel highlights new Xeons for AI at Hot Chips 2024

3 takeaways from AMD’s ZT Systems grab

AMD continues to gain ground on Intel

Broadcom distrust drives sales for VMware competition

Enterprise spending on cloud services keeps accelerating

Xeons not impacted by Intel’s crashing CPU scandal

Nvidia Blackwell GPU hit with delays: report

Show me more

2024 network outage report and internet health check

No evidence that TP-Link routers are a Chinese security threat

VMware by Broadcom: Product, service and support news

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

How to determine if a string contains a regular expression

How to use the Grep command

How to see how many days passed since the beginning of the Linux epoch