Why do AI code assistants struggle with generating proper Terraform configurations compared to Python code?

Terraform and HCL are severely underrepresented in AI training data. GitHub contains only 2 million HCL files compared to over 32 million Python files—a 32x difference. Additionally, enterprise infrastructure code is rarely published publicly due to its sensitive nature, so models lack exposure to production-grade configurations, security best practices, and the live runtime context needed to generate proper resource dependencies and environment-specific settings.

What's the difference between synthesis AI and generative AI, and which is better for DevOps?

Generative AI takes minimal input and creates new content (like code generation), while synthesis AI takes large amounts of existing data and extracts insights from it (like log analysis). Synthesis AI currently achieves higher accuracy for DevOps operations because the solution is contained within the input data—it's finding patterns rather than creating new content. For tasks like root cause analysis or correlating alerts across systems, synthesis AI excels because it works with a constrained solution space rather than open-ended generation.

How can Graph RAG improve AI-generated infrastructure code?

Graph RAG treats infrastructure as an interconnected graph of resources (VPCs, subnets, IAM roles) with defined relationships, rather than flat documents. This enables AI to understand actual infrastructure topology and dependencies when generating code. Instead of producing hard-coded values or missing dependencies, a Graph RAG-powered system could query the infrastructure graph to generate configurations with proper resource references, security policies, and environment-specific context—essentially giving the AI the full picture of how resources relate to each other in your specific environment.

GenAI for Infrastructure: Capabilities & Limitations

Name: GenAI for Infrastructure: Capabilities & Limitations
Uploaded: 2026-04-12T14:21:58-04:00
Duration: 30 min 5 s
Description: TL;DR LLMs generate infrastructure code probabilistically by predicting the next most likely token, but Terraform/HCL is severely underrepresented in training data (32x less than Python on GitHub), leading to missing best practices and hard-coded value...

HashiCorp

04/12/2026

0 (0%)

Report Like Favorite

TL;DR

LLMs generate infrastructure code probabilistically by predicting the next most likely token, but Terraform/HCL is severely underrepresented in training data (32x less than Python on GitHub), leading to missing best practices and hard-coded values instead of proper resource dependencies.
AI-generated infrastructure code poses security risks because models can propagate vulnerabilities from training data, including open ports and potentially malicious providers if adversaries poison public repositories—making deterministic security scanning tools essential.
Synthesis AI (analyzing existing data to find patterns) achieves higher accuracy than generative AI (creating new content) for DevOps tasks like log analysis and root cause correlation, because the solution space is constrained to what's already in the input.
The future of AI for infrastructure depends on Graph RAG technology that encodes infrastructure as interconnected resource graphs rather than flat documents, enabling context-aware code generation that understands dependencies, security policies, and environment-specific configurations.
Current AI code assistants lack the live production context needed for enterprise-grade infrastructure generation, but combining LLMs with graph-based infrastructure knowledge could overcome limitations by providing the full environmental context models need to generate correct, secure configurations.

How Large Language Models Generate Infrastructure Code

Roxane Fischer, CEO of AnyShift.io and former AI researcher, explains the fundamental mechanics of how LLMs work and their application to infrastructure as code. Neural networks learn patterns through training on massive datasets, encoding information into mathematical representations that enable them to predict the next most likely token in a sequence. When applied to Terraform code generation, these models use probabilistic prediction to suggest configurations based on patterns learned from public repositories. However, the presentation reveals a critical limitation: infrastructure code is severely underrepresented in training data, with only 2 million HCL files on GitHub compared to over 32 million Python files—a 32x difference. This data scarcity means models often miss best practices, generate hard-coded values instead of proper resource dependencies, and lack the live context of production infrastructure that would enable them to generate enterprise-grade configurations.

Security Risks and the Probabilistic Nature Problem

The presentation highlights serious security concerns with AI-generated infrastructure code. Because LLMs are probabilistic rather than deterministic, they can propagate vulnerabilities found in their training data, such as overly permissive security group rules with open ports (0.0.0.0/0). More concerning is the potential for adversarial attacks: if malicious actors publish modules with security flaws or malicious providers to GitHub, subsequent model retraining could incorporate these patterns, leading AI assistants to recommend compromised configurations. Fischer emphasizes that the probabilistic nature of neural networks means they never generate code with 100% certainty, making deterministic security scanning tools like Checkov and Snyk essential safeguards. The risk extends beyond simple misconfigurations to potential credential theft through malicious provider imports that models might suggest based on poisoned training data.

Synthesis AI vs. Generative AI: Different Tools for Different Jobs

Fischer draws a crucial distinction between two AI paradigms in DevOps. Generative AI takes minimal input and creates new content—an open-ended process with a large solution space that's prone to hallucination and inaccuracy. Synthesis AI, by contrast, takes large amounts of existing data and extracts insights from it, offering higher accuracy because the solution is contained within the input. For infrastructure operations, synthesis AI excels at log analysis and root cause analysis, finding patterns across millions of log entries or correlating customer alerts with system logs across heterogeneous data sources. This approach is already proving valuable in tools like Google Cloud Ops AI, which can identify the needle in the haystack by recognizing patterns that human operators might miss. Understanding when to use each approach is critical for effective AI adoption in infrastructure management.

The Future: Context-Aware AI Through Graph RAG

The presentation concludes with Fischer's vision for overcoming current limitations through context-aware AI systems. The solution lies in Retrieval Augmented Generation (RAG) technology, specifically Graph RAG, which treats infrastructure as an interconnected graph of resources rather than flat documents. Traditional RAG encodes company knowledge into searchable vector representations, but infrastructure requires understanding relationships between VPCs, subnets, IAM roles, and other resources. Graph RAG constructs a knowledge graph where nodes represent resources and edges represent relationships, enabling AI to query based on actual infrastructure topology rather than simple text similarity. The core challenges are constructing meaningful relationship definitions (how a VPC connects to subnets differs from tag-based connections) and efficiently traversing this graph at query time. When combined with LLMs, this context-aware approach could finally enable AI to generate infrastructure code with proper dependencies, security configurations, and enterprise-grade practices tailored to specific environments.

Chapters

0:00 - Introduction
1:39 - How AI Models Work
4:08 - Training and Encoding Information
6:53 - Code Generation with LLMs
10:01 - Limitations of LLMs for IaC
11:04 - Data Scarcity Problem
14:04 - Missing Best Practices
16:14 - Security Risks
19:14 - Generative vs Synthesis AI
22:22 - Context-Based Infrastructure
23:49 - RAG Technology
25:51 - Graph RAG for Infrastructure
29:07 - Conclusion

Key Quotes

12:33 "The issue is that with infrastructure as code, so Terraform in particular, the datasets are quite sparse. Why? Because all those amazing generative models that have been trained for the code generation parts have been mostly trained on GitHub. The issue is you don't put your infrastructure on clear on GitHub. It's very sensitive information."
13:28 "You can see that you have like more than dozens of millions of Python files on GitHub. You only have two million on HCL files, which is even less for Terraform, so it's like a 32-factor between Python to HCL, not even Terraform."
17:05 "Imagine you have 1,500 public modules on GitHub. You have some attacker that is going to actually create 200 new ones. Your next generation of models are going to be retrained on GitHub and are going to be actually trained on those new modules with those bad configurations."
17:36 "Because of this issue and the probabilistic nature of neural network and those LLMs, they will predict next code tokens based on probability, but never with 100% certainty. It's highly recommended to use deterministic tools, so, tools that will always give you the same output if you give the same input, such as like Chekhov or Snyk."
20:34 "Synthesis AI, on the contrary, is where you give a lot of information, but you don't want to create anything new. You want to find something within this information. You want to synthesise it."
20:47 "Because of that, synthesis AI tends to have better accuracy and results than generative AI for now, because the solution space is way smaller. You give a lot of information into the input, but the solution is contained within, and you just need to find it."

Categories:

Tags:

Show more Show less

TL;DR

How Large Language Models Generate Infrastructure Code

Security Risks and the Probabilistic Nature Problem

Synthesis AI vs. Generative AI: Different Tools for Different Jobs

The Future: Context-Aware AI Through Graph RAG

Chapters

Key Quotes

Service Account Security in the Age of AI: From Legacy Accounts to Agentic Identities

Beyond the Alert – Building the Human Centric Agentic SOC

How Purpose Brands scales IT with Zendesk ITAM

Insights from the 2026 Keepit Annual Data Report on SaaS Data Protection

The New Economics of VMware Exit