Truth in IT
    • Sign In
    • Register
        • Videos
        • Channels
        • Pages
        • Galleries
        • News
        • Events
        • All
Truth in IT Truth in IT
  • Data Management ▼
    • Converged Infrastructure
    • DevOps
    • Networking
    • Storage
    • Virtualization
  • Cybersecurity ▼
    • Application Security
    • Backup & Recovery
    • Data Security
    • Identity & Access Management (IAM)
    • Zero Trust
    • Compliance & GRC
    • Endpoint Security
  • Cloud ▼
    • Hybrid Cloud
    • Private Cloud
    • Public Cloud
  • Webinar Library
  • TiPs
  • DRAW

Veeam: Backup & Recovery for PGVector Databases on OpenShift

Veeam
06/16/2026
0 (0%)
Share
  • Comments
  • Download
  • Transcript
Report Like Favorite
  • Share/Embed
  • Email
Link
Embed

Transcript


with Veeam Kasten. This environment is an OpenShift cluster running container and VM workloads. For this demo, the key workload is a cloud-native Postgres cluster with the PG Vector extension enabled. PG Vector is commonly used by AI applications to store embeddings for Retrieval Augmented Generation, or RAG. The database is running as a highly available Postgres cluster in its own namespace. Kasten is also installed on the cluster and has already discovered the applications running across OpenShift. Now we'll protect the PG Vector workload. In the Kasten UI, I'll open the policy for the PG Vector demo and run it on demand. I'll also set an expiration date so this recovery point does not live forever and consume unnecessary storage. This policy is doing more than a basic snapshot. It is configured to protect the PG Vector namespace, create a backup, and export that backup off the cluster to object storage. That gives us a real recovery point outside the local OpenShift environment. The important part is application consistency. This policy uses a Kasten blueprint to run a pre- and post-action around the backup. In this case, the backup is taken from the secondary Postgres replica, not the primary. The pre-action pauses the replica, Kasten captures the data, and the post-action resumes it. That helps avoid impact to the production writer while still giving us a clean backup of the database. Now the backup has completed successfully. Before we simulate a failure, let's confirm the database has data. Here we can see document embeddings stored in PG Vector. Next we'll simulate a bad event. This could be an accidental deletion, a rogue administrator, or even an AI-driven workflow that changes or deletes the wrong data. I'll drop the embeddings table and confirm the data is gone. At this point, the RAG application would lose access to the context stored in the vector database. Now we'll recover with Kasten. I'll go to the restore points for the PG Vector demo and select the most recent recovery point. For this demo, I'll use the local restore point, but the exported copy could also be used if the cluster copy was unavailable. During restore, we use the same blueprint to run a before action that deletes the cluster resource first. This is important because the operator will otherwise try to recreate pods and persistent volumes while Kasten is restoring them. Removing the cluster resource, or CR, first allows Kasten to perform a clean recovery. Kasten restores the Kubernetes metadata and persistent volumes, then the Postgres cluster comes back online. Finally, we'll verify the result. The restore is completed successfully, the Postgres cluster is running again, and the PG Vector embeddings are back. This is how Veeam Kasten can protect and recover PG Vector databases running on OpenShift AI, including application-consistent backup, off-cluster export, and clean recovery for AI and RAG workloads.

TL;DR

  • Veeam Kasten provides application-aware backup for PGVector databases on OpenShift, using blueprints to orchestrate pre/post-actions that ensure consistency without impacting production workloads.
  • The solution backs up from secondary Postgres replicas rather than primary instances, pausing replication during snapshot capture to maintain data integrity for AI RAG applications.
  • Recovery includes automated handling of Kubernetes operator conflicts by removing cluster resources before restoration, enabling clean recovery of both metadata and persistent volumes for AI workloads.

Summary

This technical demonstration shows how Veeam Kasten protects PGVector databases running on Red Hat OpenShift for AI Retrieval Augmented Generation (RAG) workloads. The demo walks through creating an application-consistent backup of a cloud-native Postgres cluster with the PGVector extension enabled, which stores embeddings critical to AI applications. Kasten's approach uses blueprints to orchestrate pre- and post-backup actions that pause the secondary Postgres replica during backup to avoid production impact while ensuring data consistency. The demonstration includes simulating a data loss event by dropping the embeddings table, then performing a complete recovery using Kasten's restore capabilities. The solution exports backups off-cluster to object storage for true disaster recovery protection, and handles the complexity of Kubernetes operator-managed workloads by removing cluster resources before restoration to prevent conflicts during recovery.

Chapters

0:00 - Environment Overview
0:42 - Creating Application-Consistent Backup
1:35 - Verifying Data and Simulating Failure
2:04 - Restoring PGVector Database

Key Quotes

1:10 "The important part is application consistency."
1:18 "In this case, the backup is taken from the secondary Postgres replica, not the primary."
1:29 "That helps avoid impact to the production writer while still giving us a clean backup of the database."

FAQ

Why does Kasten back up from the secondary Postgres replica instead of the primary?

Backing up from the secondary replica allows Kasten to pause replication and capture a consistent snapshot without impacting the primary database that is actively serving production AI application writes. This approach maintains application performance while ensuring backup consistency.

What happens if the local OpenShift cluster is completely unavailable during a disaster?

Kasten exports backups off-cluster to object storage, creating a true recovery point outside the local OpenShift environment. This exported copy can be used for disaster recovery even if the entire cluster is lost, providing protection beyond local snapshots.


Categories:
  • » Data Protection » Backup & Recovery
  • » Cybersecurity » Cloud Security
  • » Data Protection
Channels:
News:
Events:
Tags:
  • Data Protection
  • Cloud Security
  • AI & Machine Learning
  • Demo
  • Technical Deep Dive
  • Kubernetes backup
  • AI workload protection
  • PGVector database
  • Red Hat OpenShift
  • RAG applications
  • Application-consistent backup
  • Disaster recovery
  • Vector database protection
Show more Show less

Browse videos

  • Related
  • Featured
  • By date
  • Most viewed
  • Top rated
  •  

              Video's comments: Veeam: Backup & Recovery for PGVector Databases on OpenShift

              Upcoming Webinar Calendar

              • 06/17/2026
                12:00 PM
                06/17/2026
                Action1: The Remediation Gap: Vulnerability Management in the Age of AI
                https://www.truthinit.com/index.php/channel/2010/action1-the-remediation-gap-vulnerability-management-in-the-age-of-ai/
              • 06/23/2026
                01:00 PM
                06/23/2026
                The AI-Powered VMware Alternative
                https://www.truthinit.com/index.php/channel/2009/the-ai-powered-vmware-alternative/
              • 06/24/2026
                11:00 AM
                06/24/2026
                LATAM: Accelerating Insights on AI Through an Engaging Webinar Series
                https://www.truthinit.com/index.php/channel/2012/accelerating-insights-on-ai-through-an-engaging-webinar-series/
              • 06/25/2026
                01:00 PM
                06/25/2026
                Generative AI Security: Preventing AI from Becoming a Data Breach Multiplier
                https://www.truthinit.com/index.php/channel/1998/generative-ai-security-preventing-ai-from-becoming-a-data-breach-multiplier/
              • 06/30/2026
                01:00 PM
                06/30/2026
                Master Active Directory Certificate Services for Long-term Success
                https://www.truthinit.com/index.php/channel/2018/master-active-directory-certificate-services-for-long-term-success/
              • 07/01/2026
                04:00 AM
                07/01/2026
                Integrating Security in AI: Automated Red Teaming Strategies for Private Models
                https://www.truthinit.com/index.php/channel/1969/integrating-security-in-ai-automated-red-teaming-strategies-for-private-models/
              • 07/01/2026
                04:00 AM
                07/01/2026
                Schutz von KI in Anwendungen, Agenten und APIs.
                https://www.truthinit.com/index.php/channel/2008/schutz-von-ki-in-anwendungen-agenten-und-apis/
              • 07/01/2026
                01:00 PM
                07/01/2026
                Stop Your AI from Controlling You: Strategies for Retaining Power
                https://www.truthinit.com/index.php/channel/2021/stop-your-ai-from-controlling-you-strategies-for-retaining-power/
              • 07/02/2026
                10:00 AM
                07/02/2026
                When the cloud goes dark: Resilience lessons from hybrid threats
                https://www.truthinit.com/index.php/channel/2011/resilience-insights-from-hybrid-threats-when-the-cloud-faces-challenges/
              • 07/14/2026
                11:00 AM
                07/14/2026
                In-Depth Analysis of the Latest Features in Netwrix 1Secure
                https://www.truthinit.com/index.php/channel/2014/in-depth-analysis-of-the-latest-features-in-netwrix-1secure/
              • 07/21/2026
                04:00 AM
                07/21/2026
                Strategies for Managing AI Governance and Securing App-to-LLM API Traffic
                https://www.truthinit.com/index.php/channel/1967/strategies-for-managing-ai-governance-and-securing-app-to-llm-api-traffic/
              • 07/22/2026
                06:30 AM
                07/22/2026
                Insights and Strategies for Effective Data Privacy and Protection Practices
                https://www.truthinit.com/index.php/channel/2000/insights-and-strategies-for-effective-data-privacy-and-protection-practices/
              • 07/29/2026
                04:00 AM
                07/29/2026
                Real-Time Strategies for Safeguarding Against Prompt Injections
                https://www.truthinit.com/index.php/channel/1968/real-time-strategies-for-safeguarding-against-prompt-injections/
              • 09/30/2026
                04:00 AM
                09/30/2026
                EMEA: Shadow AI, MCP, and Emerging Risks of Artificial Intelligence
                https://www.truthinit.com/index.php/channel/2024/shadow-ai-mcp-and-emerging-risks-of-artificial-intelligence/

              Upcoming Events

              • Jun
                17

                Action1: The Remediation Gap: Vulnerability Management in the Age of AI

                06/17/202612:00 PM ET
                • Jun
                  23

                  The AI-Powered VMware Alternative

                  06/23/202601:00 PM ET
                  • Jun
                    24

                    LATAM: Accelerating Insights on AI Through an Engaging Webinar Series

                    06/24/202611:00 AM ET
                    • Jun
                      25

                      Generative AI Security: Preventing AI from Becoming a Data Breach Multiplier

                      06/25/202601:00 PM ET
                      • Jun
                        30

                        Master Active Directory Certificate Services for Long-term Success

                        06/30/202601:00 PM ET
                        More events
                        Truth in IT
                        • Sponsor
                        • About Us
                        • Terms of Service
                        • Privacy Policy
                        • Contact Us
                        • Preference Management
                        Desktop version
                        Standard version