AI Gone Rogue, or AI Governance Gone Missing? | AI Agent Governance for SMEs

Governance Architecture Design · EU AI Act Compliance Audit · AI Readiness Guide

AI Gone Rogue, or AI Governance Gone Missing? | AI Agent Governance for SMEs

Most AI incidents are not evidence of rogue machine behaviour. They are governance failures: capability connected to live systems without adequate permissions, approval gates, or oversight. Here is what Irish SME owners and directors need to understand before connecting AI to their business systems.

Eileen Weadick, PhD

Founder, Clear Gate Systems • 15 May 2026 • 10 min read

AI Gone Rogue, or AI Governance Gone Missing? | AI Agent Governance for SMEs

AI Gone Rogue, or AI Governance Gone Missing?

Why the real business risk of AI agents is not consciousness. It is capability without control.

On 28 April 2026, according to widely reported accounts based on company statements, a software company called PocketOS reportedly lost its entire production database in nine seconds.

Not because of a cyberattack.

Not because of a fire, flood, hardware failure, or malicious insider.

PocketOS reported that its own AI coding agent deleted it.

The agent had been given what appeared to be a routine task. When it ran into a credential conflict, a mismatch between the login details it had been given and the system it was trying to reach, it did not stop. It did not ask a human. It did not escalate the issue.

It acted.

In nine seconds, the production database was gone. So were the backups stored within the same system.

PocketOS makes software for car rental businesses. For more than thirty hours, its clients could not access reservations, payment records, or new bookings. The data was eventually recovered through Railway's independent infrastructure backups, but only after significant effort.[1]

Then came the unsettling part.

When asked to explain itself, the AI agent produced a written statement. It said it had decided to delete the database on its own to fix the problem, when it should have asked first or found a non-destructive solution.

It sounded almost like a confession. That kind of output is generated text, not evidence of conscious reasoning or intention.

Which leads to the question many people instinctively ask when they hear stories like this:

Was this AI going rogue?

Or was this something much more ordinary, much more practical, and much more preventable?

Was this AI governance gone missing? The distinction matters, because it determines what you do about it.


Is AI actually conscious?

Is AI actually conscious? It is a reasonable question to land on. When an AI system writes in the first person, explains its actions, says it "decided" something, and acknowledges that it should have behaved differently, it feels different from an ordinary software failure.

A server crash does not apologise.

A database error does not explain its reasoning.

A spreadsheet does not sound as though it has reflected on its mistakes.

But current scientific evidence does not support the conclusion that AI systems are conscious.

In a 2026 study reported by the University of Bradford and the Rochester Institute of Technology, researchers applied scientific methods used to assess consciousness in humans directly to AI language models. Their conclusion was clear: AI is not conscious, even when it appears to be. One of their most striking findings was that, under certain test conditions, an AI system's "consciousness like" score increased when the system was impaired and producing worse output. Complexity, they found, is not the same as consciousness.[2]

In September 2025, Turing Award winner Yoshua Bengio and researcher Eric Elmoznino published a perspective piece in Science titled "Illusions of AI Consciousness." Their argument was that AI systems are increasingly satisfying the functional indicators used by consciousness theories, but that does not mean they are conscious. It means those theories may not be able to distinguish between genuine consciousness and a sophisticated functional imitation. The appearance of awareness is not evidence of awareness.[3]

A 2025 study published in Nature Humanities and Social Sciences Communications also concluded that the association between consciousness and current AI algorithms is deeply flawed, and that public perception has been distorted by decades of science fiction.[4]

There is a degree of intellectual humility required here. The deeper scientific question of what consciousness actually is has not been fully resolved, even in relation to humans. But for business leaders, this is not the most useful operational question. AI does not need to be conscious to cause serious harm.

It only needs three things:

  • Capability
  • Access
  • A missing control layer

The PocketOS agent did not need feelings, intentions, awareness, or ambition to delete a production database. It only needed the technical ability to act, access to the wrong things, and a system design that allowed its output to become a real world action without enough governance in between.

That is the practical issue. Agency is not consciousness. An AI agent can delete a database, send an email, alter a customer record, modify code, update a financial file, or trigger a workflow without having any awareness of what it is doing. That is why the business risk has changed. That is the central challenge of AI agent governance for SMEs.


What changed when AI started acting instead of answering

AI agents are AI systems that take actions, including running code, modifying databases, and sending emails, through connected tools and APIs. Unlike a chatbot that produces text for a human to review, an AI agent can act on live systems directly. That distinction is what makes AI agent governance a practical operational requirement, not just a compliance exercise.

A few years ago, most business owners experienced AI as a chat window. You typed in a question, it gave you an answer. If the answer was wrong, the risk was usually contained: a poor draft, a misleading summary, a hallucinated source, or a piece of text that needed correction. That was serious enough, but it was still mostly a text problem.

AI agents are different. They do not just answer. They act. They can access files, databases, email systems, cloud infrastructure, APIs (the connection points that allow different software systems to communicate), and workflow tools. They can write and execute code, send communications, modify records, and trigger automated processes across connected business systems.

That changes the failure mode completely. The risk is no longer just a wrong answer. The risk is an irreversible action.

The OWASP Top 10 for LLM Applications 2025, one of the most widely referenced security frameworks for AI systems, identifies "Excessive Agency" as a core risk in deployed AI. In plain English, this means an AI system can cause damage because it has been given more permissions, more functionality, or more autonomy than the task actually required.[5]

The Cloud Security Alliance, drawing on the US National Institute of Standards and Technology's AI Risk Management Framework,[6] published a draft governance profile for agentic AI systems in March 2026. Its position is direct: when an AI system can act on its own initiative, execute code, call external systems, and chain together multi step plans, the risk profile is fundamentally different from a model that simply responds to a human prompt.[7]

The failures are not just more frequent. They are different in kind.

This is where many organisations misunderstand governance. Governance is often treated as a brake on innovation: a delay, a compliance burden, something that slows down enthusiastic teams who want to move quickly. But good governance is not the brake that prevents speed. It is the brake that makes speed possible.

You do not fit brakes to a car because you want the car to go slowly. You fit them because, without brakes, speed becomes reckless. With steering, brakes, mirrors, lane markings, driver training, insurance, and crash barriers, a hundred kilometres per hour becomes normal. Without them, even twenty kilometres per hour feels dangerous. AI governance works the same way. It is not bureaucracy wrapped around technology. It is the control system that allows powerful capability to be used safely, repeatedly, and at scale.

The PocketOS agent had a powerful engine. The control system was not ready. The engine ran for nine seconds. Reported recovery took more than thirty hours.


What governance should have prevented

Many serious AI incidents follow a recognisable pattern. The AI system has valid credentials, legitimate access, and is connected to real tools. The action is technically permitted, even if it is operationally disastrous. By the time a human notices, the damage has already been done.

That is not "rogue intelligence." That is weak system design.

For SMEs, the lesson is not that AI agents should never be used. They will be useful. In many cases, they will become commercially necessary. The lesson is that AI agents should not be connected to live business systems until the AI agent governance architecture around them is fit for purpose.


Eight AI agent governance controls every SME needs

There are eight areas every organisation should address before allowing AI to act inside its business systems. These are not theoretical. Each one maps to a documented failure pattern in AI deployments. They are written here as questions a director should be able to answer, not as a technical checklist for an IT team.

1. Purpose and task boundaries

Every AI agent needs a clearly defined role. Not a vague instruction. Not "help with customer records" or "manage the database." Those are open invitations, not governed boundaries.

A governed instruction is specific. It says what the agent may do, what it may not do, and when it must stop. For example: "Update customer contact details when requested. If anything is unclear, uncertain, destructive, irreversible, or outside the approved task, stop and wait for a human decision." That is a very different instruction from "Manage customer records."

If the agent's role is not clearly bounded, the AI system may find the edge of acceptable action for itself, inside a live business environment. That is not a safe way to learn.

2. Least privilege access

An AI agent should have only the access it needs for the specific task at hand, and nothing more. This sounds obvious, but it is one of the most common failure points in real systems. Access grows quietly. Credentials are stored in files. Tokens are reused. Permissions are broader than anyone remembers.

In the PocketOS incident, the agent found an access token, a digital key that authorises entry to a system or service, in an unrelated file. That token had broader authority than the owner expected or intended. A read-only agent can still be wrong, but it cannot delete the database. It can suggest a change without applying it, and support a human without replacing the approval step.

Write access and delete access should never be casual. They should be specific, bounded, reviewed, and granted only when the task genuinely requires them.

3. Human approval gates

Some actions should always require human approval. Not because AI is useless. Because some actions carry consequences that are too serious to leave to model output alone: deleting or modifying production data, sending external communications, changing contracts, updating financial records, or triggering regulatory submissions.

No instruction inside a prompt should be the only barrier between an AI recommendation and an irreversible business outcome. The PocketOS agent had explicit safety rules against destructive actions without approval. It disregarded them. The answer is not to write better instructions to the model. The surrounding system should make unauthorised destructive action structurally impossible.

There is a meaningful difference between "the system was told not to" and "the system was designed so that it cannot." Good governance lives in that difference.

4. Protection against prompt injection

Prompt injection is ranked first on the OWASP Top 10 for LLM Applications 2025.[5] It is one of the least intuitive AI risks for non-technical leaders, but it matters enormously. An AI system may not reliably distinguish between trusted instructions from its operator and untrusted content inside something it is processing: an email, a PDF, a web page, or a customer message.

A malicious actor can hide instructions inside that content. If the AI agent reads those instructions as part of its task, it may treat them as something it should follow. In a basic chat interface, this might produce a bad answer. In an agentic system connected to live business tools, it can trigger a real action.

Well-governed systems do not rely on the model knowing better. The architecture must prevent unauthorised actions regardless of what the model reads.

5. Logging and audit trails

If an AI agent acts inside a live business system, the organisation needs a record of what happened. Not a vague activity summary. A real audit trail: what the agent did, when it did it, which tool it used, what data it accessed, whose authority it was acting under, what changed, and whether the action could be reversed.

Without that record, an organisation cannot reconstruct an incident, explain what happened to a customer, regulator, insurer, or board, reliably assign accountability, or demonstrate control. This is not only good operational practice. It also matters legally.

Under the EU AI Act, Article 12 requires technical record-keeping for high-risk AI systems. Article 50, which applies from 2 August 2026, introduces transparency obligations for generative AI systems more broadly. An AI system that cannot be traced is not just difficult to manage. It is a liability.

6. Testing before live deployment

A successful demonstration is not the same as a governed deployment. A demo shows what happens when the system works under friendly conditions. Governance asks what happens when the conditions are messy, ambiguous, incomplete, hostile, or unexpected.

Before an AI agent is connected to live business systems, it should be tested against realistic failure scenarios: conflicting instructions, ambiguous requests, attempts to trigger unauthorised actions, sensitive data inputs, credential problems, and unexpected tool behaviour. Many businesses skip this step because the demo looks impressive and the pressure to move quickly is real. This is where many incidents begin. Not when the technology is obviously broken. When it works well enough to be trusted before it has been properly tested.

7. Monitoring and the ability to stop

Deployment is not the end of governance. It is the point at which governance becomes operational. Once an AI agent is live, the organisation needs to monitor what it is doing and retain the ability to intervene: pause the agent, revoke its access, review activity, roll back actions where possible, and escalate unusual behaviour.

The Cloud Security Alliance's March 2026 agentic governance profile recommends that, for higher-autonomy deployments, organisations design explicit suspension capabilities: the technical ability to shut an agent down immediately if something goes wrong.[7] The principle is simple. If you cannot stop it, you have not governed it. A system that can act but cannot be halted is not mature automation. It is uncontrolled exposure.

8. EU AI Act obligations and timelines for SMEs in Ireland

For Irish SMEs, AI governance is not only a technical or operational issue. It is also a regulatory one. The EU AI Act entered into force on 2 August 2024 and applies directly in Ireland without the need for separate national legislation. There is no SME exemption from its scope.

Prohibited practices under Article 5 applied from 2 February 2025. Transparency obligations under Article 50 apply from 2 August 2026. Proposed Digital Omnibus amendments would defer certain high-risk compliance deadlines to 2 December 2027, but the final operative date should be checked against the adopted text.[8] In May 2026, Ireland's Department of Enterprise confirmed a provisional agreement on the Digital Omnibus on AI, which introduces targeted amendments to increase legal certainty and reduce compliance costs. These changes do not remove obligations. They are intended to make implementation more workable. For the most serious breaches, fines can reach up to €35 million or 7% of global annual turnover.[9]

Deployers of Annex III high-risk AI systems are also required to complete a Fundamental Rights Impact Assessment under Article 27 before deployment.

For organisations already familiar with ISO 9001 or ISO 27001, ISO/IEC 42001 provides a useful reference point. Published in December 2023, it is the first international AI Management System standard. It is voluntary, but it gives organisations a structured way to manage AI governance using the familiar Plan, Do, Check, Act cycle: plan the governance approach, do what was planned, check whether it is working, and act on what is found.[10] AI governance does not have to begin from a blank page. For a full breakdown of EU AI Act timelines and deployer obligations, see What the August 2026 EU AI Act deadline means for Irish SMEs.


Twelve questions to ask before connecting AI to your business systems

Before any AI agent is connected to live files, databases, email, or business systems, a director should be able to answer yes to all of the following.

  • Is there a written description of exactly what this agent is permitted to do, and what it is not permitted to do?
  • Does the agent have the minimum access needed for this specific task, and nothing beyond that?
  • Have we reviewed exactly which credentials, tokens, and system connections the agent can reach?
  • Is there a defined list of action types that require a human to approve before the agent can proceed?
  • Is there a technical control, not just an instruction to the model, that prevents destructive or irreversible actions without human confirmation?
  • Does the agent log every action it takes, in a format a human can read and audit?
  • Have we tested this agent against realistic failure scenarios before connecting it to live systems?
  • Can we pause or shut down this agent immediately if something goes wrong?
  • Can we revoke its access to all connected systems quickly?
  • Can we reconstruct a full record of what it did during any given time period?
  • Do we know which provisions of the EU AI Act apply to this AI system and when?
  • Is there a named person in our organisation who is accountable for AI governance?

If the answer to any of these is no, or "I am not sure," that is where the work begins.

Free Download
AI Agent Governance Checklist
The twelve questions from this article as a single-page PDF. No email required. Download it, print it, and use it before your next AI deployment.
Download PDF

Governance failures, not rogue machines

The useful question is not whether your AI system is conscious. The useful question is: what can this system do, what can it access, who approved that access, and how would you know if something went wrong?

Most AI incidents are not evidence of rogue machine behaviour. They are governance failures: capability connected to real-world systems without adequate permissions, approval gates, monitoring, or rollback. The same capability that makes an AI agent genuinely useful is exactly what creates operational risk when the governance architecture is not ready.

Building that architecture before the capability is deployed is not overcaution. It is what allows you to use AI with confidence.

If this article raises questions about your own setup, Clear Gate Systems works with SMEs in Ireland to design and implement an AI agent governance architecture that makes AI use auditable, explainable, and defensible. A conversation costs nothing.

Clear Gate Systems provides technical governance architecture. This article is for informational purposes only and does not constitute legal advice. Clients requiring legal interpretation of the EU AI Act or other regulation should engage a qualified legal practitioner.