What GPU capabilities does OpenNebula provide for AI workloads?

OpenNebula supports GPU passthrough for direct hardware access by VMs, NVIDIA vGPU for distributing GPU resources across multiple virtual machines, and PCI device management for discovering and assigning specialized hardware. The platform also supports CPU pinning to ensure dedicated processor cores for AI workloads.

How does OpenNebula help organizations maintain data sovereignty for AI applications?

OpenNebula enables organizations to build private cloud infrastructure for AI workloads, keeping data and models on-premises rather than using public cloud services. This is particularly important for European organizations concerned about data privacy, intellectual property protection, and regulatory compliance.

What types of AI applications are users running on OpenNebula?

Production use cases include private code copilots that scan internal codebases, automated documentation generation, AI-powered monthly reporting, web search tools, and no-code application development platforms. Organizations are deploying LLM servers with OpenAI-compatible APIs for various internal applications.

AI Workload Management with OpenNebula: User Experiences

Name: AI Workload Management with OpenNebula: User Experiences
Uploaded: 2026-05-08T19:18:16-04:00
Duration: 28 min 46 s
Description: Artificial Intelligence is no longer the future of computing, but rather an exciting present. With the surge of Large Language Models, the need to train these models, which requires significant computational power, has increased. In this session, we'l...

Open Nebula

05/08/2026

0 (0%)

Report Like Favorite

TL;DR

OpenNebula provides GPU passthrough, vGPU support, and Enhanced Platform Awareness features that enable organizations to run AI workloads with direct hardware access and optimized resource allocation across private cloud infrastructure.
Iguane Solutions has built production AI platforms on OpenNebula for six years, delivering private LLM services with OpenAI-compatible APIs while ensuring data sovereignty and intellectual property protection for customers who cannot use public clouds.
AI Sweden operates a multi-site testbed across three locations using OpenNebula to manage heterogeneous hardware and serve multiple concurrent AI projects for public, private, and academic partners throughout Sweden.
Both organizations emphasize data sovereignty as a key driver for European private AI infrastructure, with OpenNebula serving as the orchestration layer that abstracts hardware complexity while maintaining flexibility.
Future development focuses on native AI appliances, Kubernetes device plugin integration, unified metrics dashboards, and continued collaboration between OpenNebula and production users to enhance AI-specific capabilities.

OpenNebula's AI Infrastructure Capabilities

This session explores how OpenNebula's cloud orchestration platform supports the computational demands of modern AI workloads, particularly Large Language Models. João Pita Costa, Senior Technologist in AI at OpenNebula, introduces the platform's Enhanced Platform Awareness (EPA) features, which enable fine-grained matching of processor capabilities to VMs and Kubernetes workflows. The platform supports GPU passthrough for direct hardware access, NVIDIA vGPU for resource distribution across multiple VMs, and PCI device management for specialized hardware allocation. OpenNebula's architecture allows organizations to deploy AI processing clusters with optimized scheduling, placing execution closer to data sources to reduce transfer times and bandwidth usage. The platform facilitates a 'Cloud for AI' paradigm where users can provision LLM virtual servers with pre-trained models as a service.

Production AI Platforms: Iguane Solutions Case Study

Jean-Philippe Foures, VP of Products at Iguane Solutions, details how his company leverages OpenNebula to build AI platforms for customers who cannot use public cloud services. Their architecture spans multiple layers: infrastructure (GPU servers with NVIDIA H100/H200, StorePool storage, VXLAN networking), cloud orchestration (OpenNebula with GPU passthrough and CPU pinning), LLM core services (model servers, API proxies, observability stack), and applications (private copilots, web search, no-code development tools). Iguane Solutions has used OpenNebula in production for over six years, managing platforms that provide OpenAI-compatible APIs while ensuring data privacy and intellectual property protection. Their internal use cases include a private GitHub Copilot alternative that scans codebases for context-aware suggestions, automated documentation maintenance, and AI-generated monthly customer reports.

Multi-Site AI Testbed: AI Sweden Implementation

Kim Henriksson, SVP for Technology, Innovation and Ecosystems at AI Sweden, describes how the Swedish National Center for Applied AI uses OpenNebula to manage on-premises infrastructure across three physical locations. The organization serves as a neutral nonprofit facilitating collaboration between private sector, public sector, and academia, providing hands-on access to AI infrastructure for experimentation and learning. AI Sweden's testbed handles heterogeneous hardware from multiple vendors, various GPU types (NVIDIA, AMD, Intel), and diverse edge devices, requiring extreme flexibility in orchestration. The platform supports multiple concurrent projects with different frameworks and workloads, using OpenNebula as an abstraction layer between hardware and software. Key capabilities that enable this complexity include easy host addition/removal, template management, SSH contextualization, live migration, VM resizing, and GPU passthrough. The organization operates with administrators from both AI Sweden and partner organizations, emphasizing the importance of intuitive configuration and management.

Strategic Considerations and Future Directions

Both speakers emphasize data sovereignty as a critical driver for private AI infrastructure in Europe, with organizations seeking to maintain control over their data and models rather than relying on public cloud providers. Iguane Solutions is working with OpenNebula to develop native integrations including AI-specific appliances, NVIDIA K8s device plugin equivalents, and unified metrics dashboards. The company positions its offering as a plug-and-play private AI stack with infrastructure management, analytics, and user-facing features included. AI Sweden highlights OpenNebula's price competitiveness, noting they alternate between the open-source community edition and enterprise version depending on requirements. The session concludes with discussion of key metrics for AI platforms: model usage patterns, query success rates, and system resource utilization for capacity planning.

Chapters

0:00 - Introduction
1:40 - OpenNebula Features Overview
3:36 - Enhanced Platform Awareness
4:43 - Iguane Solutions Case Study
9:13 - AI Platform Architecture
11:57 - Production Use Cases
14:46 - Future Roadmap
16:48 - AI Sweden Implementation
20:00 - Multi-Site Testbed Architecture
23:35 - Key Capabilities and Learnings
25:40 - Q&A Discussion

Key Quotes

5:57 "Today, everyone is aware of AI. I know you listened to this word for a long time since GPT released its first release two years ago. So, I will use the AI terms a lot in the next 10 minutes. So, for me, contrary to the word tree or crypto, AI is not a hype. I think I'm convinced that we are at the beginning of a new era, and we are convinced that AI will help a lot of people in doing better their job."
7:24 "Ninety-nine developers on 10 use AI-based tools, and they use, essentially, GitHub Copilot. I don't know if you know this tool, but it's amazing for developers. And 70% of them say that it adds them significant benefits for their usage, and it improves them doing better code."
7:57 "But all organizations can't go on public cloud, and all organizations can't use on-demand AIs. That's the case we want. That's the problem we want to solve at Equant Solutions."
12:36 "In our use case, we use it as a private co-pilot for our developers. It's crazy, because this private co-pilot runs on a private platform, so it means that there is no leak of intellectual property, no risk of that. Obviously, there is no risk of data privacy breach."
15:45 "Open Nebula and EGWEN story starts six years ago, and we believe in Open Nebula as a pillar of our AI platform, and we are working closely with Open Nebula to build the next generation of Open Nebula clouds."
17:09 "I come from an organization called AI Sweden, and we're a national center for applied AI. Our purpose is to accelerate the use of AI for our society, our competitors, and everyone living in Sweden."
19:00 "We have a saying, and that's invest together and share with many. And with that, a part of it, a really important part is speed, and how can we make it quicker for our partners to reach what I would say, let's call it organizational confidence."
21:40 "We've also, and it's not part of this presentation really, but we've actually spread this out over three physical locations, which, if you think about it, having one data center spread out geographically and trying to administer it from one point is actually quite a challenge."
24:34 "Then, of course, the piece de l'horizonte, GPU pass-through. Absolute must for any type of AI workloads. Works. No real questions there."
27:49 "I would say the sovereignty, both from a Swedish perspective, but also European perspective. And I guess, Sean, Filip, and you guys have also seen that we want to host our own models. We want to secure our data. We don't want to know where it is."

Categories:

Tags:

Show more Show less

Browse videos

Upcoming Webinar Calendar

05/12/2026

01:00 PM

05/12/2026

Transforming Black Box to Glass Box: Revealing Hidden Threats and AI Risks through Data Lineage

https://www.truthinit.com/index.php/channel/1895/transforming-black-box-to-glass-box-revealing-hidden-threats-and-ai-risks-through-data-lineage/
05/12/2026

11:30 PM

05/12/2026

Implementing Effective Strategies for Active Directory Security and Data Protection

https://www.truthinit.com/index.php/channel/1888/implementing-effective-strategies-for-active-directory-security-and-data-protection/
05/13/2026

01:00 AM

05/13/2026

Transforming the Black Box: Revealing AI Risks and Hidden Threats through Data Lineage

https://www.truthinit.com/index.php/channel/1890/transforming-the-black-box-revealing-ai-risks-and-hidden-threats-through-data-lineage/
05/13/2026

05:00 AM

05/13/2026

Transforming Black Box to Glass Box: Revealing AI Risks and Hidden Threats through Data Lineage

https://www.truthinit.com/index.php/channel/1894/transforming-black-box-to-glass-box-revealing-ai-risks-and-hidden-threats-through-data-lineage/
05/19/2026

01:00 PM

05/19/2026

Establishing a Robust AI Governance Framework for GenAI Throughout Its Lifecycle

https://www.truthinit.com/index.php/channel/1936/establishing-a-robust-ai-governance-framework-for-genai-throughout-its-lifecycle/
05/20/2026

10:00 PM

05/20/2026

APAC: Establishing an AI Governance Framework for GenAI Throughout the Deployment Process

https://www.truthinit.com/index.php/channel/1953/establishing-an-ai-governance-framework-for-genai-throughout-the-deployment-process/
05/21/2026

11:00 AM

05/21/2026

The Autonomous Era: Orchestrating a Resilient Enterprise

https://www.truthinit.com/index.php/channel/1372/the-autonomous-era-orchestrating-a-resilient-enterprise/
05/27/2026

04:00 AM

05/27/2026

Rivoluziona i rischi dell'AI in opportunità con Netskope AI Security

https://www.truthinit.com/index.php/channel/1925/rivoluziona-i-rischi-dellai-in-opportunità-con-netskope-ai-security/
05/28/2026

10:00 AM

05/28/2026

Harnessing AI: Transforming Perception into Purposeful Mastery

https://www.truthinit.com/index.php/channel/1924/harnessing-ai-transforming-perception-into-purposeful-mastery/
05/28/2026

01:00 PM

05/28/2026

AI in the Fast Lane: Effectively Managing AI Security for Small Teams

https://www.truthinit.com/index.php/channel/1951/ai-in-the-fast-lane-effectively-managing-ai-security-for-small-teams/
06/02/2026

01:00 PM

06/02/2026

Satori Spring: Insights from Recent Research on the 2026 Threat Landscape

https://www.truthinit.com/index.php/channel/1930/satori-spring-insights-from-recent-research-on-the-2026-threat-landscape/
06/04/2026

02:00 AM

06/04/2026

Mastering the Unseen: Managing Shadow AI and Agentic MCP Traffic

https://www.truthinit.com/index.php/channel/1948/mastering-the-unseen-managing-shadow-ai-and-agentic-mcp-traffic/
06/16/2026

07:00 AM

06/16/2026

Transforming Data Risk into Actionable Priorities: What to Address First

https://www.truthinit.com/index.php/channel/1952/transforming-data-risk-into-actionable-priorities-what-to-address-first/