Amit Kothari CEO of Tallyfy, AI advisor at Blue Sheen

AI security threats: Why it is about data, not models

In brief

Most AI attacks target data through AI interfaces, not the models themselves. LayerX Security found that 77% of employees paste data into GenAI prompts with most of that activity happening through unmanaged accounts. These are the real AI security threats enterprise teams face and practical strategies to defend against them.

Amit Kothari Follow 10k+

Nov 4, 2025 · Updated Jun 12, 2026 · AI

CEO of Tallyfy · AI advisor at Blue Sheen for mid-size companies

AI security threats: Why it is about data, not models

Samsung semiconductor engineers pasted proprietary source code into ChatGPT for optimization suggestions. That code may now be part of OpenAI’s training data. Amazon sent internal warnings after noticing ChatGPT responses that looked oddly familiar to internal documentation. OpenAI had to take their service offline when a bug exposed user payment information.

None of these were attacks on AI models.

All of them were data breaches through AI interfaces.

The industry keeps debating adversarial examples and model poisoning while 77% of employees paste data into GenAI prompts, with 82% of those interactions occurring through unmanaged personal accounts. The real AI security threats enterprise teams face aren’t about the models at all. They’re about data. The same blind spot drives most compliance failures - the three real leak paths in regulated Claude deployments are observability pipelines, debug logs, and prompt caches, not the LLM itself.

The threat model most teams get backwards

Walk into any AI security discussion and someone will bring up adversarial examples. Those carefully crafted inputs that fool image classifiers into misreading stop signs as speed limits. Fascinating research. Largely irrelevant to most organizations.

IBM’s numbers are jarring: 13% of organizations have already experienced AI-related breaches. Of those compromised, 97% lacked proper AI access controls. Not advanced model attacks. Basic access control failures. One in five organizations reported a breach specifically tied to unauthorized AI tool usage, with much higher breach costs as a result. Shadow AI is making an already messy situation worse.

Steve Wilson’s OWASP Top 10 for LLMs was updated in 2025, and the shift is telling. Sensitive Information Disclosure jumped from position six to position two. Several new and reworked categories appeared, including System Prompt Leakage, Vector and Embedding Weaknesses, and an expanded Misinformation entry addressing overreliance on LLM outputs. The threat focus has moved squarely toward data exposure, not model manipulation.

Attackers aren’t spending weeks crafting perfect adversarial inputs. They’re using AI systems as new interfaces for traditional data theft. Your ChatGPT integration has access to customer records. Your Copilot instance can see internal emails. Your custom LLM processes financial documents. OWASP lists prompt injection as the number one LLM security risk precisely because it turns AI systems into data theft tools.

AI threat model showing exfiltration paths and layered defenses around corporate data

The invisible exfiltration most teams miss

This is the part that frustrates me when I watch how enterprises think about AI security.

The scale is staggering. Zscaler’s ThreatLabz tracked enterprise AI/ML transactions growing 83% year-over-year in 2025, with data transfers to AI tools rising 93% to tens of thousands of terabytes. Most of it through personal accounts. Every time someone copies data from your systems and pastes it into ChatGPT to “help summarize this document,” that data leaves your control.

LayerX Security’s report landed with a thud: AI is now the single largest uncontrolled channel for corporate data exfiltration. Bigger than shadow SaaS. Bigger than unmanaged file sharing. With 92% of enterprise AI usage concentrated in ChatGPT alone, employees are making an average of 14 pastes per day through non-corporate accounts, at least 3 containing sensitive data.

Then there are the AI agents, running around the clock and chaining tasks across multiple applications. As task-specific agents spread across enterprise software, the exfiltration surface expands faster than most security teams can map it. Can you name every AI agent currently running in your environment, and every data store it can reach?

The thing is, your traditional DLP tools look for file uploads and email attachments. Copy-paste? Invisible. Browser-based AI tools? Not monitored. The entire attack vector bypasses clunky systems built for a file-centric world.

Samsung’s engineers pasted proprietary semiconductor code into ChatGPT for optimization help. The company issued an immediate company-wide ban. Reasonable response. Also far too late.

When your firm is wrestling with this, we can talk.

Prompt injection and the supply chain no one audits

Prompt injection is basically the AI equivalent of SQL injection. Actually, it is worse than that. Harder to fix, and easier to exploit at scale.

Someone embeds malicious instructions in a document. Your AI assistant reads that document. Those hidden instructions override your system prompts. Suddenly your AI is routing data to external URLs or quietly manipulating its responses to spread false information.

Microsoft’s security team published something worth reading: indirect prompt injection is one of the most widely-used techniques in the AI security vulnerabilities they encounter. Enterprise versions of Copilot and Gemini have access to emails, document repositories, and internet content. Hidden instructions in any of those sources can compromise the entire system.

Researchers demonstrated this with Slack AI, tricking it into leaking data from private channels through carefully crafted prompts. A recent paper on hybrid AI threats describes attackers combining prompt injection with traditional exploits like XSS and CSRF, creating chains that defeat multiple security layers simultaneously.

I think the defense situation is worse than most security teams realize. A landmark study from researchers across OpenAI, Anthropic, and Google DeepMind tested 12 published prompt injection defenses and bypassed them with attack success rates above 90% for most. Human red-teamers scored 100%, defeating every defense tested. OpenAI acknowledged that “prompt injection is a long-term AI security challenge” with deterministic guarantees remaining elusive.

Microsoft uses hardened system prompts and Spotlighting techniques that reduced attack success rates from over 50% to below 2%. No complete solution exists. Layered defenses are the best available option, not a final answer. Is there a silver bullet for prompt injection? No.

Mid-2026 update: the vendors now say this out loud. Anthropic’s Claude in Chrome, a browser agent in beta for all paid plans, warns right on its product page that “Browser AI faces unique security risks, like prompt injection attacks” and tells users to start with trusted sites, avoid financial transactions, and use an “Ask before acting” review mode. When the company shipping the agent builds in a human review step, that tells you where the state of the art sits. The layered-defense point above still holds.

The supply chain question is equally uncomfortable. Turns out, your model came from somewhere. So did its training data. Both are attack surfaces most organizations never examine.

The threat is real: AI data poisoning is emerging as the new software supply chain attack, with attackers targeting training datasets and model registries at scale. Models from public registries can contain deliberately embedded biases or data exfiltration capabilities baked permanently into the model itself. The difference from prompt injection: this isn’t affecting a single session. It’s built into every interaction.

Worth noting: OWASP added Vector and Embedding Weaknesses as a new top-10 category in 2025. With most companies opting for RAG pipelines instead of fine-tuning, the vector databases powering those pipelines are now prime targets for data poisoning and manipulation.

At Tallyfy, an enterprise workflow platform, when we evaluate AI integrations, we ask: where did this model come from? Who trained it? What data did they use? Can we verify any of this? Most vendors can’t answer these questions. That’s a supply chain risk, period.

Since I wrote this, the model-side picture has shifted. Anthropic’s June 2026 Claude 5 launch split its frontier model in two: Fable 5, the generally available version, ships with separate safety classifiers that detect misuse, including jailbreak attempts, while Mythos 5, which Anthropic describes as having the strongest cybersecurity capabilities of any model in the world, is restricted to approved defensive-security partners under Project Glasswing. The vendors are gating capability with classifiers and access control. None of that changes what your employees paste into a chat box, so the argument here stands.

What actually reduces your exposure

The uncomfortable part: 63% of breached organizations either lack an AI governance policy or are still developing one. The organizations that avoid becoming statistics focus on data governance, not exotic AI countermeasures.

Control data access, not just model access. Your AI assistant doesn’t need to see everything. The storage choice itself is a threat vector - SharePoint and OneDrive have different AI exposure profiles, and picking the wrong one means your permissions model is leaky before you even configure anything. Scope permissions tightly. If a tool processes customer support tickets, it shouldn’t see financial records. Least-privilege principles applied directly to AI integrations.

Monitor what data goes into AI tools. Traditional DLP is blind to AI. You need visibility into what employees paste into ChatGPT, what documents get uploaded to Claude, what code goes into Copilot. Organizations with high levels of unmonitored shadow AI face much higher breach costs than those that govern it.

Harden AI interfaces against manipulation. Use input validation. Implement output filtering. Set up anomaly detection for unusual data access patterns through AI systems. Microsoft’s approach combines multiple techniques, since no single control stops all attacks. Layered defenses make exploitation much harder.

Verify AI supply chains. Before deploying any model, understand its provenance. Scan training data for signs of poisoning. Test models for backdoors using adversarial testing techniques. Tedious work. Necessary work.

Assume data will leak. Design your AI implementations assuming employees will paste sensitive information into public tools. Which data absolutely cannot leave? Build technical controls around exactly that. Everything else, monitor and educate. A proper AI governance framework is the foundation everything else rests on.

The AI security threats enterprise teams face today are data security problems in new packaging. Attackers use AI capabilities, but the goal is unchanged: steal or manipulate information. Defend accordingly.

Organizations that treat AI security as a separate, exotic field disconnected from existing security practice will struggle. Those that recognize AI systems as new data access points and apply proven principles (least privilege, defense in depth, continuous monitoring) will fare much better. The biggest AI failures are organizational, not technical.

Your biggest AI security risk probably isn’t an advanced adversarial attack on your model. It’s your sales team pasting customer lists into ChatGPT to help draft emails.

ai-securitycybersecuritydata-protectionenterprise-security

About the Author

Amit Kothari is an experienced consultant, advisor, coach, and educator specializing in AI and operations for executives and their companies. With 25+ years of experience, he is the Co-Founder & CEO of Tallyfy® (raised $3.6m, the Workflow Made Easy® platform) and Partner at Blue Sheen, an AI advisory firm for mid-size companies. He helps companies identify, plan, and implement practical AI solutions that actually work. Originally British and now based in St. Louis, MO, Amit combines deep technical expertise with real-world business understanding. Read Amit's full bio →

Disclaimer: The content in this article represents personal opinions based on extensive research and practical experience. While every effort has been made to ensure accuracy through data analysis and source verification, this should not be considered professional advice. Always consult with qualified professionals for decisions specific to your situation.

Contact me More about me

View All Posts »

Shadow AI is not a policy problem. It is a supply problem.

Banning unauthorized AI tools does not work. BlackFog research shows 60% of employees use unsanctioned AI tools anyway, outside your security perimeter. Companies actually preventing shadow AI are doing it by making approved tools faster to access, not by writing stricter policies nobody reads.

How to deploy Claude Desktop and Cowork on locked-down enterprise Windows

The standard Claude Desktop installer fails on enterprise Windows machines managed by Intune. Developer Mode is not the fix. Deploy the signed MSIX package through Intune as a Line of Business app to bypass all three failure modes without weakening security.

AI data privacy - why design beats policy every time

Privacy policies cannot protect personal data once it is embedded in AI model parameters. Only the privacy-by-design approach pioneered by Ann Cavoukian provides real AI data privacy protection. With GDPR penalties exceeding 7.1 billion euros, technical controls like differential privacy and federated learning are no longer optional.

Prompt injection: the security risk nobody discusses

Prompt injection is SQL injection all over again. OWASP ranks it as the number one AI security risk, and researchers bypassed all 12 published defenses with over 90 percent attack success rates.

Claude is allowed in regulated finance, but it has no EU data residency

Two objections kill most regulated-finance AI conversations before they start. The first, that Anthropic does not permit Claude for regulated work, is false: Claude for Financial Services exists, banks run it, and the usage policy names finance high-risk, not forbidden. The second is real and almost nobody states it plainly: first-party Claude Enterprise has no EU data residency at all. There is no "eu" inference region and workspace storage is US-only. If you are FCA-regulated, that is the fact to design around, and the only EU route runs through a hyperscaler.

Your locked-down Claude sandbox is a holding pattern, not a destination

Giving everyone Claude inside an isolated VM, no sensitive data allowed, feels like the safe way to start. It is a fine way to start. The trouble is what happens when you leave people there: the leak it was built to stop walks out by copy-paste anyway, the friction recruits the shadow AI you were trying to prevent, and the value never compounds because nothing in an ephemeral box survives the session. A sandbox is a scaffold. Scaffolds come down.