Hewlett Packard Enterprise Debuts End-to-End AI-Native Portfolio for Generative AI

Today at NVIDIA GTC, Hewlett Packard Enterprise (NYSE: HPE) announced updates to one of the industry’s most comprehensive AI-native portfolios to advance the operationalization of generative AI (Gen...

Business Wire

Enterprise-class co-designed software and hardware solutions from HPE and NVIDIA accelerate development and deployment of GenAI applications

SAN JOSE, Calif.: Today at NVIDIA GTC, Hewlett Packard Enterprise (NYSE: HPE) announced updates to one of the industry’s most comprehensive AI-native portfolios to advance the operationalization of generative AI (GenAI), deep learning, and machine learning (ML) applications. The updates include:

Availability of two HPE and NVIDIA co-engineered full-stack GenAI solutions.
A preview of HPE Machine Learning Inference Software.
An enterprise retrieval-augmented generation (RAG) reference architecture.
Support to develop future products based on the new NVIDIA Blackwell platform.

“To deliver on the promise of GenAI and effectively address the full AI lifecycle, solutions must be hybrid by design,” said Antonio Neri, president and CEO at HPE. “From training and tuning models on-premises, in a colocation facility or the public cloud, to inferencing at the edge, AI is a hybrid cloud workload. HPE and NVIDIA have a long history of collaborative innovation, and we will continue to deliver co-designed AI software and hardware solutions that help our customers accelerate the development and deployment of GenAI from concept into production.”

“Generative AI can turn data from connected devices, data centers and clouds into insights that can drive breakthroughs across industries," said Jensen Huang, founder and CEO at NVIDIA. "Our growing collaboration with HPE will enable enterprises to deliver unprecedented productivity by leveraging their data to develop and deploy new AI applications to transform their businesses.”

Supercomputing-powered GenAI training and tuning

Announced at SC23, HPE’s supercomputing solution for generative AI is now available to order for organizations seeking a preconfigured and pretested full-stack solution for the development and training of large AI models. Purpose-built to help customers accelerate GenAI and deep learning projects, the turnkey solution is powered by NVIDIA and can support up to 168 NVIDIA GH200 Grace Hopper Superchips.

The solution enables large enterprises, research institutions, and government entities to streamline the model development process with an AI/ML software stack that helps customers accelerate GenAI and deep learning projects, including LLMs, recommender systems, and vector databases. Delivered with services for installation and set-up, this turnkey solution is designed for use in AI research centers and large enterprises to realize improved time-to-value and speed up training by 2-3X. For more information or to order it today, visit HPE's supercomputing solution for generative AI.

Enterprise-class GenAI tuning and inference

Previewed at Discover Barcelona 2023, HPE’s enterprise computing solution for generative AI is now available to customers directly or through HPE GreenLake with a flexible and scalable pay-per-use model. Co-engineered with NVIDIA, the pre-configured fine-tuning and inference solution is designed to reduce ramp-up time and costs by offering the right compute, storage, software, networking, and consulting services that organizations need to produce GenAI applications. The AI-native full-stack solution gives businesses the speed, scale and control necessary to tailor foundational models using private data and deploy GenAI applications within a hybrid cloud model.

Featuring a high-performance AI compute cluster and software from HPE and NVIDIA, the solution is ideal for lightweight fine-tuning of models, RAG, and scale-out inference. The fine-tuning time for a 70 billion parameter Llama 2 model running this solution decreases linearly with node count, taking six minutes on a 16-node system1. The speed and performance enable customers to realize faster time-to-value by improving business productivity with AI applications like virtual assistants, intelligent chatbots, and enterprise search.

Powered by HPE ProLiant DL380a Gen11 servers, the solution is pre-configured with NVIDIA GPUs, the NVIDIA Spectrum-X Ethernet networking platform, and NVIDIA BlueField-3 DPUs. The solution is enhanced by HPE’s machine learning platform and analytics software, NVIDIA AI Enterprise 5.0 software with new NVIDIA NIM microservice for optimized inference of generative AI models, as well as NVIDIA NeMo Retriever and other data science and AI libraries.

To address the AI skills gap, HPE Services experts will help enterprises design, deploy, and manage the solution, which includes applying appropriate model tuning techniques. For more information or to order it today, visit HPE’s enterprise computing solution for generative AI.

From prototype to productivity

HPE and NVIDIA are collaborating on software solutions that will help enterprises take the next step by turning AI and ML proofs-of-concept into production applications. Available to HPE customers as a technology preview, HPE Machine Learning Inference Software will allow enterprises to rapidly and securely deploy ML models at scale. The new offering will integrate with NVIDIA NIM to deliver NVIDIA-optimized foundation models using pre-built containers.

To assist enterprises that need to rapidly build and deploy GenAI applications that feature private data, HPE developed a reference architecture for enterprise RAG, available today, that is based on the NVIDIA NeMo Retriever microservice architecture. The offering consists of a comprehensive data foundation from HPE Ezmeral Data Fabric Software and HPE GreenLake for File Storage. The new reference architecture will offer businesses a blueprint to create customized chatbots, generators, or copilots.

To aid in data preparation, AI training, and inferencing, the solution merges the full spectrum of open-source tools and solutions from HPE Ezmeral Unified Analytics Software and HPE’s AI software, which includes HPE Machine Learning Data Management Software, HPE Machine Learning Development Environment Software, and the new HPE Machine Learning Inference Software. HPE’s AI software is available on both HPE’s supercomputing and enterprise computing solutions for generative AI to provide a consistent environment for customers to manage their GenAI workloads.

Next-gen solutions built on NVIDIA Blackwell platform

HPE will develop future products based on the newly announced NVIDIA Blackwell platform, which incorporates a second-generation Transformer Engine to accelerate GenAI workloads. Additional details and availability for forthcoming HPE products featuring the NVIDIA GB200 Grace Blackwell Superchip, the HGX B200, and the HGXB100 will be announced in the future.

About Hewlett Packard Enterprise

Hewlett Packard Enterprise (NYSE: HPE) is the global edge-to-cloud company that helps organizations accelerate outcomes by unlocking value from all of their data, everywhere. Built on decades of reimagining the future and innovating to advance the way people live and work, HPE delivers unique, open, and intelligent technology solutions as a service. With offerings spanning Cloud Services, Compute, High Performance Computing & AI, Intelligent Edge, Software, and Storage, HPE provides a consistent experience across all clouds and edges, helping customers develop new business models, engage in new ways, and increase operational performance. For more information, visit: www.hpe.com

1 Based on initial internal benchmarks of llama-recipes finetuning.py that tracked the average epoch time to fine-tune eight nodes at 594 seconds and 16 nodes at 369 seconds with flash attention and parameter efficient fine-tuning.

Fonte: Business Wire

Last News

Sparkle works on environmentally sustainable content distribution

The Italian company partners with MainStreaming for high-performance, energy-efficient video streaming

Libraesva: being specialized is ok again in cybersecurity

Software vendors developing vertical solutions against specific attack vectors are 'cool' again. And when it comes to email security, all companies now…

Fintech: Links tests the use of exponential technologies in the banking…

Links Management and Technology just concluded the testing phase of a research project focused on banking transformation

Axyon AI: Italian Artificial Intelligence for Finance applications

Axyon AI offers an AI platform specifically designed for asset management, with several interesting strengths for those approaching machine/deep learning…

G11 Media Networks

InnovationOpenLab is a channel of BitCity, a newspaper registered at the court of Como ,
n. 21/2007 del 11/10/2007- Registration ROC n. 15698

G11 MEDIA S.R.L. Registered office Via NUOVA VALASSINA, 4 22046 MERONE (CO) - P.IVA/C.F.03062910132 Como business register n. 03062910132 - REA n. 293834 CAPITALE SOCIALE Euro 30.000 i.v.

Global Business VoIP Services Strategic Market Report 2024-2030: Shifting of Voice Traffic Away from PSTN to Mobile and Internet Networks Paves the Way for the Emergence of VoIP - ResearchAndMarkets.com

Bakkt Schedules Conference Call to Discuss Second Quarter 2024 Results

Globalstar, Inc. Earnings Release and Call Notice

Federman & Sherwood Investigates The Medibase Group, Inc. on behalf of Self Regional Healthcare for Data Breach

Veteran Senior Product Leader Allison Lambert-Morton Joins CareHive

Vívaro Providing Live Media Transport Services at the Paris 2024 Summer Olympics for Leading Broadcasters

Offer Update

LambdaTest Unveils Live Inspect for Enhanced App Automation Testing

Hewlett Packard Enterprise Debuts End-to-End AI-Native Portfolio for Generative AI

Related news

Last News

Sparkle works on environmentally sustainable content distribution

Libraesva: being specialized is ok again in cybersecurity

Fintech: Links tests the use of exponential technologies in the banking…

Axyon AI: Italian Artificial Intelligence for Finance applications

Most read

Transact Campus Rolls Out Mobile Credential Technology at the University…

New Emburse Research Finds Strong Link Between Managing Travel Spend and…

Keysight and University of Malaga’s MobileNet join forces to Accelerate…

Emburse Unveils Next Generation of Travel and Expense Technology at GBTA…

G11 Media Networks

Hewlett Packard Enterprise Debuts End-to-End AI-Native Portfolio for Generative AI

Related news

Last News

Most read

Newsletter signup

G11 Media Networks