Subscribe to Digital Engineering
Webcasts · Downloads · Archives
Companies · Glossary · Podcasts

DE · Topics · Engineering Computing · Cloud Computing

NVIDIA Introduces Generative AI Foundry Service on Microsoft Azure

The NVIDIA AI foundry service uses three elements—NVIDIA AI Foundation Models, NVIDIA NeMo framework and tools, and NVIDIA DGX Cloud AI supercomputing services.

Businesses can deploy their customized models with NVIDIA AI Enterprise software to power generative AI applications. Image courtesy of NVIDIA.

Cloud Computing News

Cloud Computing Resources

Latest News

The Changing Role of the Data Center in Design

Report: Majority of Auto Execs to Rely on U.S. Suppliers in 2025

ASTM Gets Funding for Additive Manufacturing Sustainability

Trace.Space Raises Seed Round

Onshape Introduces Onshape CAM Studio

Authentise Integrates Flows with the Autodesk Fusion Industry Cloud

All posts

By DE Editors

November 17, 2023

NVIDIA has introduced an artificial intelligence foundry service for the development and tuning of custom generative AI applications for enterprises and startups deploying on Microsoft Azure.

The NVIDIA AI foundry service pulls together three elements—a collection of NVIDIA AI Foundation Models, NVIDIA NeMo framework and tools, and NVIDIA DGX Cloud AI supercomputing services—that give enterprises an end-to-end solution for creating custom generative AI models, the company reports. Businesses can then deploy their customized models with NVIDIA AI Enterprise software to power generative AI applications, including intelligent search, summarization and content generation.

“Enterprises need custom models to perform specialized skills trained on the proprietary DNA of their company—their data,” says Jensen Huang, founder and CEO of NVIDIA. “NVIDIA’s AI foundry service combines our generative AI model technologies, LLM training expertise and giant-scale AI factory. We built this in Microsoft Azure so enterprises worldwide can connect their custom model with Microsoft’s world-leading cloud services.”

“Our partnership with NVIDIA spans every layer of the Copilot stack—from silicon to software—as we innovate together for this new age of AI,” says Satya Nadella, chairman and CEO of Microsoft. “With NVIDIA’s generative AI foundry service on Microsoft Azure, we’re providing new capabilities for enterprises and startups to build and deploy AI applications on our cloud.”

NVIDIA’s AI foundry service can be used to customize models for generative AI-powered applications across industries, including enterprise software. Once ready to deploy, enterprises can use a technique called retrieval-augmented generation (RAG) to connect their models with their enterprise data and access new insights.

Curated, Optimized Models

Customers using the NVIDIA foundry service can pick from several NVIDIA AI Foundation models, including a family of NVIDIA Nemotron-3 8B models hosted in the Azure AI model catalog. Developers can also access the Nemotron-3 8B models on the NVIDIA NGC catalog, as well as community models such as Meta’s Llama 2 models optimized for NVIDIA for accelerated computing, which are also coming soon to the Azure AI model catalog.

Optimized with 8 billion parameters, the Nemotron-3 8B family includes versions tuned for different use cases and have multilingual capabilities for building custom enterprise generative AI applications.

NVIDIA DGX Cloud Now Available

NVIDIA DGX Cloud AI supercomputing is available today on Azure Marketplace. It features instances customers can rent, scaling to thousands of NVIDIA Tensor Core GPUs, and comes with NVIDIA AI Enterprise software, including NeMo, to speed LLM customization.

More NVIDIA Coverage

NVIDIA GeForce RTX 50 Series GPUs Supported by Select BOXX Systems

NVIDIA Blackwell GeForce RTX 50 Series Highlights AI Computer Graphics

NVIDIA Unveils AI Foundation Models for RTX AI PCs

NVIDIA Expands Omniverse With Generative Physical AI

CES 2025: NVIDIA Adds Generative Physical AI to Omniverse

Share This Article

Subscribe to our FREE magazine,
FREE email newsletters or both!

Join over 90,000 engineering professionals who get fresh engineering news as soon as it is published.

Join Now

Latest News

The Changing Role of the Data Center in Design

Report: Majority of Auto Execs to Rely on U.S. Suppliers in 2025

ASTM Gets Funding for Additive Manufacturing Sustainability

Trace.Space Raises Seed Round

Onshape Introduces Onshape CAM Studio

Authentise Integrates Flows with the Autodesk Fusion Industry Cloud

All posts

About the Author

DE Editors

DE’s editors contribute news and new product announcements to Digital Engineering.
Press releases may be sent to them via DE-Editors@digitaleng.news.

Follow DE

Digital Engineering https://www.digitalengineering247.com/article/nvidia-introduces-generative-ai-foundry-service-on-microsoft-azure/cloud-computing https://www.digitalengineering247.com/article/nvidia-introduces-generative-ai-foundry-service-on-microsoft-azure/cloud-computing Last updated February 4, 2025

#28352

New & Noteworthy

New & Noteworthy: Future-Proof Foundation for Employee Training and Education

Eagle Point Software's Peak Experience for Pinnacle Series adds AI chat, improved...

Eliminate Physical Clamping – With Simulation

The Virtual Clamping tool in ANSA (VCA) from BETA CAE Systems eliminates...

New & Noteworthy: Fast, Flexible and Scalable Simulation – In the Cloud

Ansys Access on Microsoft Azure enables seamless deployment of industry-leading simulation tools...

New & Noteworthy: Safe, Cost-Effective Metal 3D Printing - Anywhere

Desktop Metal’s Studio System offers turnkey metal printing for prototypes and...

Design

Simulate

Additive Manufacturing

Digital Thread

Engineering Computing

Companies

Glossary

Podcasts

Webcasts

Downloads

Reviews

Subscribe

Press Releases

Advertise

Customer Service

NVIDIA Introduces Generative AI Foundry Service on Microsoft Azure

The NVIDIA AI foundry service uses three elements—NVIDIA AI Foundation Models, NVIDIA NeMo framework and tools, and NVIDIA DGX Cloud AI supercomputing services.

Cloud Computing News

Cloud Computing Resources

Latest News

By DE Editors

November 17, 2023

Curated, Optimized Models

NVIDIA DGX Cloud Now Available

More NVIDIA Coverage

Share This Article

Subscribe to our FREE magazine,
FREE email newsletters or both!

Latest News

About the Author

Related Topics

NVIDIA Introduces Generative AI Foundry Service on Microsoft Azure

The NVIDIA AI foundry service uses three elements—NVIDIA AI Foundation Models, NVIDIA NeMo framework and tools, and NVIDIA DGX Cloud AI supercomputing services.

Cloud Computing News

Cloud Computing Resources

Latest News

By DE Editors

November 17, 2023

Curated, Optimized Models

NVIDIA DGX Cloud Now Available

More NVIDIA Coverage

Share This Article

Subscribe to our FREE magazine, FREE email newsletters or both!

Latest News

About the Author

Related Topics

Subscribe to our FREE magazine,
FREE email newsletters or both!