Truth in IT
    • Sign In
    • Register
        • Videos
        • Channels
        • Pages
        • Galleries
        • News
        • Events
        • All
Truth in IT Truth in IT
  • Data Management ▼
    • Converged Infrastructure
    • DevOps
    • Networking
    • Storage
    • Virtualization
  • Cybersecurity ▼
    • Application Security
    • Backup & Recovery
    • Data Security
    • Identity & Access Management (IAM)
    • Zero Trust
    • Compliance & GRC
    • Endpoint Security
  • Cloud ▼
    • Hybrid Cloud
    • Private Cloud
    • Public Cloud
  • Webinar Library
  • TiPs
  • DRAW

OneLLM: Native AI Inference in OpenNebula Sunstone

Open Nebula
06/13/2026
0 (0%)
Share
  • Comments
  • Download
  • Transcript
Report Like Favorite
  • Share/Embed
  • Email
Link
Embed

Transcript


In this screencast, we will show the preview of the upcoming 1LLM feature that brings the AI Inference directly into the platform. Running AI Inference workloads on-premises might easily become challenging. Both the administrators and users often struggle with the lack of unification across the process, finding themselves managing GPU servers, model weights, and inference software outside of the graphical UI. In the upcoming release of Open Nebula, we are addressing these challenges by adding an AI Inference section directly to Sunstone, a single place to define hardware profiles, curate AI models, and deploy production-ready models with just a few clicks. 1LLM is a new section inside Sunstone that brings AI Inference into the platform. No external tools, no separate infrastructure to manage. There are two perspectives in this demonstration. First, the admin, who sets the things up, then the tenant, who puts them to work. As an admin, you start by defining instance types. Each one specifies the GPU, the VRAM, the compute tier, and the model it can handle. Small for lightweight use, medium for most workloads, large for the biggest models. Every type is fully specified. H100, 10 gigs of VRAM, the exact model size range it supports. Define it once, reuse it everywhere. The second thing an admin manages is the model catalog, the library of AI models available inside the data center. Models are downloaded, versioned, and access controlled. Tenants only see what's ready. The admin decides what's available. From here, this is what a tenant experiences. They pick a ready model, pick their instance type, and deploy. OpenAbility handles the rest. It provisions the virtual machine, loads the weights, and starts the Inference engine. No manual steps, no SSH, no scripts. When it's live, the tenant gets an OpenAI-compatible API endpoint. Any app already leveraging the OpenAI SDK connects with zero code changes. And they can test it right here, inside Sunstone. A live conversation with the model. No external tooling needed. Admin sets it up, tenant puts it to work. One platform, from the bare metal, to a virtual machine. And they can test it right here, inside Sunstone. A live conversation with the model. Admin sets it up, tenant puts it to work. One platform, from the bare metal, to a live AI endpoint. And this concludes this feature preview demonstration. Thank you for watching, and see you in the next screencast.

TL;DR

  • OneLLM brings native AI inference capabilities into OpenNebula's Sunstone GUI, eliminating the need for external tools or separate infrastructure management for on-premises AI workloads.
  • Administrators define reusable hardware profiles specifying GPU types, VRAM, and supported model sizes, then curate a versioned model catalog with granular access controls for tenant consumption.
  • Tenants deploy production-ready inference endpoints by selecting pre-configured models and instance types, with OpenNebula automatically provisioning VMs, loading weights, and exposing OpenAI-compatible APIs for zero-code integration.

Summary

This demonstration previews OneLLM, an upcoming OpenNebula feature that integrates AI inference capabilities directly into the Sunstone GUI. The feature addresses common challenges organizations face when running on-premises AI workloads by eliminating the need to manage GPU servers, model weights, and inference software outside the platform. OneLLM provides a unified interface where administrators can define hardware profiles with specific GPU and VRAM configurations, curate AI model catalogs with version control and access management, and enable tenants to deploy production-ready inference endpoints with minimal configuration. The system provisions virtual machines, loads model weights, and exposes OpenAI-compatible API endpoints that work with existing SDK integrations, allowing organizations to run AI inference workloads alongside their existing cloud and edge infrastructure without external tooling or manual intervention.

Chapters

0:00 - Introduction to OneLLM
0:52 - Admin Perspective: Instance Types
1:35 - Admin Perspective: Model Catalog
1:49 - Tenant Workflow: Deployment

Key Quotes

0:16 "Both the administrators and users often struggle with the lack of unification across the process, finding themselves managing GPU servers, model weights, and inference software outside of the graphical UI."
0:33 "We are addressing these challenges by adding an AI Inference section directly to Sunstone, a single place to define hardware profiles, curate AI models, and deploy production-ready models with just a few clicks."
2:14 "Any app already leveraging the OpenAI SDK connects with zero code changes."

FAQ

What problem does OneLLM solve for organizations running AI workloads on-premises?

OneLLM addresses the lack of unification in managing on-premises AI inference by bringing GPU server management, model weights, and inference software directly into OpenNebula's Sunstone GUI. This eliminates the need to manage these components separately outside the platform, providing a single interface for defining hardware profiles, curating AI models, and deploying production-ready endpoints.

How does OneLLM handle compatibility with existing AI applications?

OneLLM exposes OpenAI-compatible API endpoints for deployed models, allowing any application already using the OpenAI SDK to connect with zero code changes. This ensures seamless integration with existing AI workflows and tooling without requiring custom development or API adaptation.


Categories:
  • » Cybersecurity » Cloud Security
  • » Data Protection
Channels:
News:
Events:
Tags:
  • AI & Machine Learning
  • Cloud Security
  • Technical Deep Dive
  • Demo
  • Getting Started
  • AI inference
  • GPU resource management
  • LLM deployment
  • on-premises AI infrastructure
  • model catalog management
  • OpenAI API compatibility
  • cloud management platform
  • multi-tenancy
Show more Show less

Browse videos

  • Related
  • Featured
  • By date
  • Most viewed
  • Top rated
  •  

              Video's comments: OneLLM: Native AI Inference in OpenNebula Sunstone

              Upcoming Webinar Calendar

              • 06/17/2026
                12:00 PM
                06/17/2026
                Action1: The Remediation Gap: Vulnerability Management in the Age of AI
                https://www.truthinit.com/index.php/channel/2010/action1-the-remediation-gap-vulnerability-management-in-the-age-of-ai/
              • 06/23/2026
                01:00 PM
                06/23/2026
                The AI-Powered VMware Alternative
                https://www.truthinit.com/index.php/channel/2009/the-ai-powered-vmware-alternative/
              • 06/24/2026
                11:00 AM
                06/24/2026
                LATAM: Accelerating Insights on AI Through an Engaging Webinar Series
                https://www.truthinit.com/index.php/channel/2012/accelerating-insights-on-ai-through-an-engaging-webinar-series/
              • 06/25/2026
                01:00 PM
                06/25/2026
                Generative AI Security: Preventing AI from Becoming a Data Breach Multiplier
                https://www.truthinit.com/index.php/channel/1998/generative-ai-security-preventing-ai-from-becoming-a-data-breach-multiplier/
              • 06/30/2026
                01:00 PM
                06/30/2026
                Master Active Directory Certificate Services for Long-term Success
                https://www.truthinit.com/index.php/channel/2018/master-active-directory-certificate-services-for-long-term-success/
              • 07/01/2026
                04:00 AM
                07/01/2026
                Integrating Security in AI: Automated Red Teaming Strategies for Private Models
                https://www.truthinit.com/index.php/channel/1969/integrating-security-in-ai-automated-red-teaming-strategies-for-private-models/
              • 07/01/2026
                04:00 AM
                07/01/2026
                Schutz von KI in Anwendungen, Agenten und APIs.
                https://www.truthinit.com/index.php/channel/2008/schutz-von-ki-in-anwendungen-agenten-und-apis/
              • 07/01/2026
                01:00 PM
                07/01/2026
                Stop Your AI from Controlling You: Strategies for Retaining Power
                https://www.truthinit.com/index.php/channel/2021/stop-your-ai-from-controlling-you-strategies-for-retaining-power/
              • 07/02/2026
                10:00 AM
                07/02/2026
                When the cloud goes dark: Resilience lessons from hybrid threats
                https://www.truthinit.com/index.php/channel/2011/resilience-insights-from-hybrid-threats-when-the-cloud-faces-challenges/
              • 07/14/2026
                11:00 AM
                07/14/2026
                In-Depth Analysis of the Latest Features in Netwrix 1Secure
                https://www.truthinit.com/index.php/channel/2014/in-depth-analysis-of-the-latest-features-in-netwrix-1secure/
              • 07/21/2026
                04:00 AM
                07/21/2026
                Strategies for Managing AI Governance and Securing App-to-LLM API Traffic
                https://www.truthinit.com/index.php/channel/1967/strategies-for-managing-ai-governance-and-securing-app-to-llm-api-traffic/
              • 07/22/2026
                06:30 AM
                07/22/2026
                Insights and Strategies for Effective Data Privacy and Protection Practices
                https://www.truthinit.com/index.php/channel/2000/insights-and-strategies-for-effective-data-privacy-and-protection-practices/
              • 07/29/2026
                04:00 AM
                07/29/2026
                Real-Time Strategies for Safeguarding Against Prompt Injections
                https://www.truthinit.com/index.php/channel/1968/real-time-strategies-for-safeguarding-against-prompt-injections/
              • 09/30/2026
                04:00 AM
                09/30/2026
                Shadow AI, MCP, and Emerging Risks of Artificial Intelligence
                https://www.truthinit.com/index.php/channel/2024/shadow-ai-mcp-and-emerging-risks-of-artificial-intelligence/

              Upcoming Events

              • Jun
                17

                Action1: The Remediation Gap: Vulnerability Management in the Age of AI

                06/17/202612:00 PM ET
                • Jun
                  23

                  The AI-Powered VMware Alternative

                  06/23/202601:00 PM ET
                  • Jun
                    24

                    LATAM: Accelerating Insights on AI Through an Engaging Webinar Series

                    06/24/202611:00 AM ET
                    • Jun
                      25

                      Generative AI Security: Preventing AI from Becoming a Data Breach Multiplier

                      06/25/202601:00 PM ET
                      • Jun
                        30

                        Master Active Directory Certificate Services for Long-term Success

                        06/30/202601:00 PM ET
                        More events
                        Truth in IT
                        • Sponsor
                        • About Us
                        • Terms of Service
                        • Privacy Policy
                        • Contact Us
                        • Preference Management
                        Desktop version
                        Standard version