Athena AgenticAthena Agentic.
Athena Agentic Research

The Measurable Benefits of Agentic-First Cybersecurity Operations

MB
Michael Bertamini
Chief Executive Officer · Athena Agentic
Published
June 2026
Evidence sources
32 reviewed · 15 excluded
Classification
Public Research
The global cybersecurity workforce cannot scale to meet demand. This research synthesises evidence from government bodies, independent academic institutions, and professional associations to evaluate the measurable operational benefits of adopting an agentic-first approach to cybersecurity operations — one in which AI agents handle high-volume, defined tasks while human analysts supervise by exception. All vendor-sponsored research has been excluded. Evidence is classified by confidence level throughout.
Section 1

Executive Summary

4.8M
unfilled cybersecurity roles globally, growing 19% year on year
ISC2 Cybersecurity Workforce Study 2024 · High confidence
66%
of security professionals say their role is more stressful than five years ago
ISACA State of Cybersecurity 2024 · High confidence
61%
reduction in analyst alert volume achieved by automated triage in a live production SOC
arXiv 2505.09843 (2025) · Medium confidence
72%
of organisations report an increase in organisational cyber risk
WEF Global Cybersecurity Outlook 2025 · High confidence

The global cybersecurity landscape is characterised by a structural workforce deficit, escalating attack volumes, and mounting pressure on security teams operating with constrained budgets. Evidence from independent sources — including ISC2, ISACA, ENISA, the World Economic Forum, and peer-reviewed academic research — converges on a consistent finding: the current model of human-centric, reactive security operations is insufficient to meet modern threat volumes and is causing measurable harm through analyst burnout, workforce attrition, missed detections, and extended breach dwell times.

This paper evaluates the potential measurable benefits of adopting an agentic-first approach: one in which AI agents autonomously execute defined security tasks — detection, triage, investigation, containment, compliance evidence gathering — while human analysts supervise by exception.

Key Finding 1 — Workforce Gap

The global cybersecurity workforce gap reached approximately 4.8 million unfilled roles in 2024 (ISC2 2024, n=15,852 respondents), a 19.1% year-on-year increase. Ninety percent of practitioners report skills gaps on their teams. This gap cannot be closed by hiring alone — automation of analyst tasks is the only scalable near-term mechanism to expand effective security operations capacity.

Key Finding 2 — Burnout Risk

Sixty-six percent of cybersecurity professionals report their role is more stressful than five years ago (ISACA 2024). Nearly half report current burnout (ISC2 2024). Burned-out analysts miss detections, respond slowly, and leave — creating both a security and a human capital problem. Automating high-volume, low-complexity alert triage is the most tractable near-term intervention.

Key Finding 3 — Alert Automation

Academic research indicates that false positive rates in SOC environments can be extremely high. An Oxford University study (USENIX Security 2022) found practitioners characterised some tool-category rates as approaching 99%. Automated triage systems have demonstrated the ability to reduce alerts shown to analysts by up to 61% with a 1.36% false negative rate over millions of alerts (arXiv 2505.09843).

Key Finding 4 — AI Productivity

A peer-reviewed study by Brynjolfsson, Li & Raymond (NBER 2023) found generative AI assistance boosted knowledge worker productivity by 14% on average, with a 34% improvement for less-experienced workers. This finding — from a different domain (customer support) — provides directional evidence applicable to security triage and investigation tasks, but requires domain-specific validation before direct transposition.

Research Limitation

This paper is based on secondary research using publicly available sources. Vendor-sponsored research has been excluded from primary claims. Several benefit areas — particularly MSP-specific efficiency gains — lack independent peer-reviewed evidence and are treated as inference rather than established findings. All claims are classified by confidence level throughout.

Section 2

Research Scope and Methodology

Research Question

What measurable operational, economic, and security outcome benefits can organisations reasonably expect from adopting an agentic-first approach to cybersecurity and IT operations, and what is the quality of the evidence supporting those expectations?

Source Hierarchy

Evidence was collected through systematic searches targeting non-vendor-conflicted primary and secondary sources. The following hierarchy was applied:

TierCategorySourcesWeight
Tier 1Government & regulatory bodiesNIST, CISA, ENISA, ISC2, ISACA, WEF, OECD, US BLS, GAO, UK DCMS, Jobs and Skills AustraliaHighest
Tier 2Independent academic institutionsNBER (MIT/Stanford), USENIX (Oxford), ACM Computing Surveys, arXiv (methodology reviewed)High
Tier 3Industry surveys with disclosed methodologyKaseya MSP Benchmark, Sophos Active Adversary, TinesMedium
ExcludedVendor-conflicted researchIBM, Microsoft, CrowdStrike, Palo Alto Networks, SentinelOne, Fortinet, Splunk, Accenture, Deloitte, PwC, McKinsey, GartnerNot used

Evidence Classification

Throughout this paper, claims are explicitly classified as:

  • Evidence — Supported by two or more credible, non-conflicted sources
  • Single-source directional — Supported by one credible source but not independently confirmed
  • Inference — Reasoned logical extension from established evidence, not directly evidenced
  • Hypothesis — Plausible but not evidenced; requires empirical validation

32 documented evidence sources were reviewed across ten research topics. 15 sources were rejected for vendor conflict, methodology weakness, or non-independence.

Section 3

What “Agentic-First” Means in Cybersecurity Operations

Definitional Context

Traditional security operations rely on human analysts to receive alerts, manually triage them, investigate suspicious activity, and escalate for action. “AI-assisted” approaches layer machine learning outputs onto this fundamentally human workflow — analysts still perform the core cognitive and operational tasks.

An agentic-first approach inverts this model. AI agents autonomously execute defined security tasks within prescribed authority boundaries, while human analysts supervise outcomes, set policy, and make judgments at exception points. The human moves from being “in the loop” (required for every action) to “on the loop” (monitoring and correcting, intervening when genuinely needed).

Agentic vs. AI Co-pilot: A Structural Comparison

DimensionAI Co-pilot / AssistantAgentic-First
Primary executorHuman analystAI agent
Human rolePrimary responderSupervisor / exception handler
Alert throughputLimited by analyst capacityScales with compute
Coverage hoursLimited to staffed hours24/7 continuous
ConsistencyVariable (human fatigue)Consistent within policy
Audit trailPartial (analyst notes)Complete (every agent step)
Skill requirementSenior expertise to operateSenior expertise to govern

Regulatory Alignment

NIST SP 800-207 (Zero Trust Architecture) and the CISA Zero Trust Maturity Model v2.0 (April 2023) both explicitly include “Automation and Orchestration” as a cross-cutting capability. CISA describes this as capabilities that “leverage insights to support robust and streamlined operations to handle security incidents and respond to events.”

NIST AI RMF 1.0 (January 2023) provides a governance structure for AI systems through four functions: Govern, Map, Measure, and Manage. Agentic security platforms operating under explicit authority matrices with full audit logging are architecturally aligned with AI RMF governance requirements. Evidence

Section 4

Market Context

4.1 The Cybersecurity Workforce Crisis

The cybersecurity workforce deficit is the most robustly evidenced structural problem in this research domain.

4.8M
unfilled cybersecurity roles globally
ISC2 Workforce Study 2024 (n=15,852)
90%
of practitioners report skills gaps on their teams
ISC2 2024
300K
estimated EU cybersecurity workforce shortage
ENISA 2024 NIS Investments Report
+29%
projected US cybersecurity employment growth 2024–2034
US Bureau of Labor Statistics, May 2024

Evidence The ISC2 2024 Cybersecurity Workforce Study (n=15,852 practitioners) found the global workforce gap at approximately 4,763,963 people — 47% of total global need unmet. The gap grew 19.1% year on year. 58% state skills gaps put their organisation at significant risk. 37% faced budget cuts in 2024.

Evidence ENISA’s 2024 NIS Investments report, drawing on 1,080 professionals across EU 27 Member States, estimates an EU workforce shortage of approximately 300,000. The UK Government’s Cyber Security Skills in the UK Labour Market 2023 report found 50% of all UK businesses have a basic cybersecurity skills gap.

Evidence The US Bureau of Labor Statistics (May 2024) projects 29% employment growth for information security analysts 2024–2034, yet the ISC2 data shows the gap itself grew 19.1% in a single year — indicating supply cannot catch demand.

Inference Given that the workforce gap cannot be closed through hiring alone within any operational planning horizon, automation of analyst tasks represents the only scalable near-term mechanism to expand effective security operations capacity.

4.2 Analyst Burnout and Its Operational Consequences

66%
say their role is more stressful than five years ago
ISACA State of Cybersecurity 2024 (n=1,800+)
~50%
of cybersecurity professionals at all levels report current burnout
ISC2 Workforce Study 2024
46%
cite high work stress as a reason practitioners leave their organisation
ISACA 2024
55%
of organisations report difficulties retaining qualified cybersecurity staff
ISACA 2024

Evidence ISACA’s 2024 State of Cybersecurity survey (n=1,800+): 66% of professionals say their role is more stressful than five years ago; 81% cite an increasingly complex threat landscape as the primary stressor; 55% report difficulties retaining qualified candidates; 46% cite high work stress as a reason practitioners leave.

Evidence ISC2 2024 Workforce Study found that nearly half of cybersecurity professionals at all levels currently report burnout, with teams expected to do more with fewer resources in increasingly complex environments.

Inference Burnout-induced performance degradation — including slower triage, higher missed detection rates, and analyst turnover — creates a compounding security risk distinct from headcount shortage. Automating high-volume, low-complexity alert triage is the most operationally tractable intervention for this specific problem.

4.3 Alert Volume and False Positive Rates

~50%
of security alerts in typical SOC environments are false positives
ACM Computing Surveys 2025
61%
reduction in analyst alert volume, real production SOC, six months
arXiv 2505.09843 (AACT study, 2025)
1.36%
false negative rate achieved across millions of alerts in the same study
arXiv 2505.09843 (AACT study, 2025)

Evidence (academic) Alahmadi, Axon & Martinovic (USENIX Security ’22, University of Oxford), in a qualitative study of SOC practitioners, found they characterised false positive rates as extremely high — in some tool categories approaching what practitioners describe as “99% false positives” as an experiential reality. Researchers found most false positive alerts are explained by benign triggers rather than malicious activity, suggesting detection rule calibration and contextual enrichment could dramatically reduce analyst burden.

Single-source directional A research synthesis in ACM Computing Surveys (Tariq et al., 2025) indicates approximately 46% of all alerts across typical SOC environments are false positives.

Single-source directional An Automated Alert Classification and Triage system (AACT, arXiv 2505.09843, 2025) evaluated in a real production SOC over six months demonstrated a 61% reduction in alerts requiring human analyst attention, with a 1.36% false negative rate over millions of alerts. This represents a genuine measured outcome in a live environment, not a vendor claim or demo scenario.

Important caveat: Alert volume and false positive rates vary substantially by environment, tool set, and industry sector. The AACT study result derives from a single SOC environment and should not be assumed as a universal or guaranteed outcome.

4.4 AI Governance and Regulatory Context

Evidence NIST AI RMF 1.0 (January 2023) and the Generative AI Profile (NIST-AI-600-1, July 2024) provide the primary US governance framework for AI systems. The four functions — Govern, Map, Measure, and Manage — require organisations to maintain audit trails, assess risks, and implement controls.

Evidence CISA Zero Trust Maturity Model v2.0 (April 2023) includes Automation and Orchestration as a required cross-cutting capability for federal agencies pursuing zero trust architecture, with a FY2024 implementation deadline.

Evidence The WEF Global Cybersecurity Outlook 2025 found that 66% of organisations expect AI to have major impact on cybersecurity in 2025, yet only 37% have processes in place to assess AI tools before deployment — indicating a significant governance gap.

4.5 Economic Context of Cyber Incidents

Evidence WEF Global Cybersecurity Outlook 2025: 72% of respondents report an increase in organisational cyber risks; ransomware is the top concern for 45% of respondents.

Evidence ENISA Threat Landscape 2024 documented approximately 2,580 incidents across EU member states in its reporting period, with availability attacks, ransomware, and data attacks as the top three categories.

Single-source directional Average ransomware recovery costs (excluding ransom payment) were approximately $1.53 million in 2025 (Sophos Active Adversary data), down from $2.73 million in 2024, with improved recovery speed cited as a contributing factor. Note: Sophos is a security vendor; IR case selection may not represent all organisations.

Note: The most widely cited breach cost figures (IBM Cost of a Data Breach Report) derive from a vendor-commissioned study and have been excluded from this research as a primary evidence source.

Section 5

Evidence-Based Benefit Categories

5.1 Operational Efficiency

5.1.1 Alert Triage and Processing Capacity

The most tractable and evidenced efficiency gain from agentic automation is in alert triage. The AACT academic study (arXiv 2505.09843) demonstrated a 61% reduction in alerts shown to analysts with a 1.36% false negative rate over millions of alerts in a real SOC environment. Single-source academic

Inference If analysts spend a material portion of their time on alert triage (estimates range from 27%–33% from available industry surveys, though these are not peer-reviewed), a 60% reduction in triage volume would release significant analyst capacity — equivalent to one to two additional analyst-equivalents for a team of eight.

Hypothesis Whether similar triage reduction rates are achievable in agentic-first platforms operating across heterogeneous enterprise environments at scale remains to be demonstrated in peer-reviewed studies.

5.1.2 Incident Response Speed

Single-source directional Sophos Active Adversary Report (2023) indicates median attacker dwell time was eight days for all attacks and five days for ransomware in H1 2023 — the lowest recorded since systematic tracking began. This improvement is attributed to improved detection tooling, but methodology does not isolate the specific contribution of automated versus human detection.

Inference Machine-speed detection and automated containment (operating in seconds rather than hours or days) would logically reduce dwell time relative to human-only SOC operations. The relationship between reduced dwell time and reduced breach cost is established in principle; quantification requires organisation-specific baseline data.

5.1.3 24/7 Coverage Without Linear Staffing Cost

Inference Maintaining 24/7 human security coverage requires approximately four to five analysts per coverage position (accounting for shift rotation, leave, training, and turnover). Agentic automation that operates continuously without shift constraints provides 24/7 coverage at platform cost rather than a multiple of human staffing. For MSPs and MSSPs, this changes the economic model from linear (more customers requires proportionally more analysts) toward non-linear (agent capacity scales faster than analyst headcount).

5.2 Economic Impact

5.2.1 Analyst Capacity and Cost

Evidence US Bureau of Labor Statistics (May 2024) reports the median annual wage for information security analysts at $124,910, with 29% employment growth projected 2024–2034. Fully loaded cost (salary plus benefits and overhead) is typically 1.35–1.5× base salary.

Inference If agentic automation doubles an MSP analyst’s effective coverage (a conservative estimate with no specific peer-reviewed evidence), a $124,910 median-salary analyst effectively becomes the economic equivalent of two analysts for monitoring and triage functions.

5.2.2 Skills Gaps and Breach Probability

Evidence ISACA 2024 found that 22% of organisations with critical or significant skills gaps experienced material breaches, versus 17% of organisations with no skills gaps — a 5 percentage point differential. This establishes an association (though not fully demonstrated causality) between skills gaps and breach probability.

Economic Limitation

Independent quantification of per-breach average costs by sector and organisation size is limited in non-vendor peer-reviewed literature. The IBM Cost of Data Breach Report — the most widely cited source — has been excluded from this research due to vendor conflict of interest. This is a genuine evidence gap for economic impact modelling.

5.3 Security Outcome Impact

5.3.1 Detection Coverage

Evidence (academic) The USENIX Security ’22 study (Oxford University) demonstrates that high false positive rates paradoxically reduce effective detection — when analysts experience alert fatigue, genuine threats may be dismissed or deprioritised. Automated triage filtering false positives with high accuracy should improve genuine threat detection rates.

5.3.2 Consistency of Response

Inference Human analysts vary in capability, attention, and performance — particularly across shift boundaries, after overnight periods, and when fatigued. AI agents executing defined response playbooks are consistent by design. Whether this consistency translates to materially better security outcomes in practice depends on playbook quality and coverage.

5.3.3 AI-Assisted Knowledge Work Productivity

Evidence (different domain) A peer-reviewed study by Brynjolfsson, Li & Raymond (NBER Working Paper No. w31161, 2023 — MIT and Stanford researchers) conducted a randomised evaluation of AI assistance across 5,179 customer support agents at a Fortune 500 enterprise. AI assistance improved productivity by 14% on average, with a 34% improvement for less-experienced workers, reduced customer escalations, and improved retention.

This represents the most rigorous non-conflicted productivity evidence available. It supports a directional expectation that AI assistance improves knowledge worker performance, with the largest benefits for less-experienced workers — who are disproportionately represented in analyst teams experiencing skills shortages. Direct transposition to cybersecurity requires explicit domain disclosure and further validation.

5.4 Enterprise and Government Impact

Evidence GAO reports (2023–2024) identify consistent gaps in federal agency cybersecurity performance measurement and zero trust implementation. Four DOD programs had not developed zero trust architecture plans by the 2027 deadline as of 2024.

Evidence CISA Zero Trust Maturity Model v2.0 requires Automation and Orchestration capabilities, creating a regulatory driver for agentic approaches in US federal contexts.

Evidence More than 50% of Australian government agencies experience critical cybersecurity skills shortages (State of the Service Report 2023–24).

5.5 MSP / MSSP Business Model Impact

Evidence (directional) Kaseya 2023 MSP Benchmark Survey (n=1,091) indicates that only 8% of MSP executives report technicians managing more than 750 endpoints, with the gold standard cited at approximately 350 managed endpoints per technician.

Inference Agentic automation handling first-line monitoring, alert triage, and routine response could allow MSP/MSSP analysts to oversee higher customer ratios. Increasing managed endpoints per analyst from 350 to 700 would double effective revenue capacity per analyst. Whether this ratio is achievable in practice depends on the proportion of analyst time currently consumed by automatable tasks.

Section 6

MSP / MSSP Impact Model

Important Caveat

The scenarios below are modelling exercises based on inference and partial evidence. They should not be presented as guaranteed outcomes. Actual results depend on: the proportion of analyst time currently spent on automatable tasks (not independently measured), automation reliability in the specific environment, playbook quality, and customer environment complexity.

Representative MSSP Baseline

For a representative MSSP with 15 analysts managing approximately 30–50 customers, using evidence-based benchmarks:

  • Median US analyst fully loaded cost: approximately $175,000–$190,000 annually (BLS median salary $124,910 plus 40–50% benefits/overhead)
  • Managing approximately 350 endpoints per technician (Kaseya 2023 gold standard benchmark)
  • Average alert volume: estimated 5,000–8,000 alerts per analyst per month (industry survey; varies widely)

Estimated Analyst Time Distribution

Task CategoryEstimated % Analyst TimeEvidence Basis
Alert triage and first-line investigation27–33%Industry surveys (partially disclosed methodology)
Report generation and documentation10–15%Inference from knowledge work automation research
Compliance evidence collection10–20%GRC automation literature (directional)
Ticket management and escalation routing5–10%MSP benchmark data (indirect)
Genuine investigation and containment35–50%Residual estimation

These percentages are estimates based on partial evidence. No peer-reviewed study specifically quantifies MSSP analyst time distribution across task types.

Efficiency Scenarios

20%
Conservative: automation handles initial triage of clearly benign alerts. 15 analysts operate as 18 analyst-equivalents. ~$1.05M annual cost avoidance.
Conservative scenario
40%
Expected: automation handles triage, routine reporting, compliance evidence. 15 analysts operate as 25 analyst-equivalents. ~$3.5M capacity expansion.
Expected scenario
60%
Optimistic: automation handles triage, routine investigation, documentation, compliance. 2–3× customer base scaling capacity. ~$7.7M capacity.
Optimistic scenario
Section 7

Enterprise Impact Model

Representative Enterprise Baseline

For a mid-market enterprise with 5,000 endpoints, 8 security analysts, mixed on-premises/cloud infrastructure, and compliance obligations across 2–3 frameworks (NIST CSF, SOC 2, ISO 27001):

Analyst Capacity Uplift Scenarios

  • Conservative (20–25% triage burden reduction): Equivalent to 2 additional analyst-equivalents from existing 8-person team. Cost value: approximately $250,000–$375,000 per analyst-equivalent (fully loaded).
  • Expected (35–40% reduction): Equivalent to 3 additional analyst-equivalents, approximately $750,000–$1.1M in freed capacity value.

Strategic Outcomes Beyond Cost

  • Board-level visibility: Persona-tuned dashboards enable CISOs to present real-time security posture without manual aggregation
  • AI governance compliance: Shadow AI discovery addresses the WEF-evidenced gap (only 37% have AI assessment processes)
  • Regulatory readiness: Automated compliance evidence supports NIS2, SOC 2, ISO 27001, HIPAA, and other framework obligations
  • Skills gap bridging: NBER research (directional) suggests AI assistance provides the greatest uplift — up to 34% — for less-experienced workers

Inference Compliance automation may reduce compliance-related labour costs by 20–25% (directional, GRC automation literature). For an enterprise spending $300,000–$500,000 annually on compliance activities, this suggests $60,000–$125,000 in potential annual savings. These figures require organisation-specific validation.

Section 8

Government and Public Sector Impact Model

Evidence More than 50% of Australian government agencies experience critical cybersecurity skills shortages (State of the Service Report 2023–24).

Evidence US GAO 2024 identified that four major DOD programs had not developed zero trust architecture implementation plans by their 2027 deadline, and that the National Cybersecurity Strategy implementation plan lacked outcome-oriented performance measures.

Evidence ENISA 2024 documented that 220 of approximately 2,580 EU cyber incidents specifically targeted two or more EU member states simultaneously — indicating cross-border threat actors targeting public sector infrastructure.

Regulatory Drivers

  • US Federal: CISA Zero Trust Maturity Model v2.0 (April 2023) requires Automation and Orchestration as a cross-cutting capability for all federal agencies
  • EU: NIS2 Directive creates mandatory incident reporting and security measure requirements for essential and important entities, including public sector
  • NIST AI RMF: Governance framework for AI system deployment that government agencies are increasingly expected to follow
  • Australia: Jobs and Skills Australia (2024) documents sustained demand for cybersecurity skills against constrained public sector hiring capacity

Government-Specific Adoption Considerations

  • Procurement processes typically require independent evaluation and certification before deployment
  • Sovereignty and data residency requirements may constrain deployment architecture
  • Authority matrices must reflect legal constraints on automated action, particularly in defence contexts
  • AI governance requirements (NIST AI RMF compliance) should be verified and documented in procurement
Section 9

Risk, Governance, and Responsible AI

Balanced Treatment Required

Any responsible evaluation of agentic security automation must acknowledge the real risks. These are not hypothetical concerns — they are documented governance requirements and operational realities.

Risks of Agentic Automation

  • Over-reliance risk: Organisations that reduce analyst headcount based on automation capability projections take on risk if the automation underperforms, is unavailable, or is circumvented by adversaries. NIST AI RMF explicitly addresses this through the “Measure” function, requiring ongoing performance monitoring.
  • Adversarial risk: AI systems can be manipulated. Adversaries may probe agentic systems to identify policy boundaries, exploit automation gaps, or deliberately trigger containment actions to create operational disruption. This is an open research question.
  • False negative risk: No automated triage system achieves zero false negative rates. The AACT study reported 1.36% false negatives over millions of alerts — a low rate that nevertheless means genuine threats are occasionally missed. System design must account for this through escalation and periodic human review.
  • Governance and accountability: When an autonomous agent takes a containment action that turns out to be incorrect, clear accountability frameworks are needed. Reversibility, audit logging, and defined escalation paths are minimum requirements.
  • Regulatory uncertainty: Autonomous security agents are not yet specifically regulated in most jurisdictions, but this may change. EU AI Act classifications may apply to high-impact automated decision systems.
  • Skills erosion: Over time, if human analysts rely heavily on agents for triage and investigation, the skills required to supervise, govern, and replace automated systems must be actively maintained.

Responsible Deployment Framework

Evidence suggests (NIST AI RMF 1.0; CISA Zero Trust Maturity Model; WEF 2025 guidance) that responsible deployment of agentic AI in security operations requires:

  1. Explicit authority governance: Every autonomous action capability must be explicitly configured, constrained, and documented
  2. Continuous performance monitoring: Automated system outcomes must be regularly reviewed against human-verified ground truth
  3. Reversibility by design: Every agent action must have a defined undo path
  4. Human oversight at exception points: Genuine decision uncertainty must trigger human escalation, not autonomous default action
  5. Skills preservation: Human analysts must maintain sufficient skills to supervise, govern, and when necessary replace automated systems

AI Governance as a Product Feature

Evidence WEF Global Cybersecurity Outlook 2025: 66% of organisations expect AI to have major cybersecurity impact, but only 37% have processes to assess AI tools before deployment. Shadow AI (unauthorised use of AI tools within organisations) is identified as an emerging risk by both ENISA and WEF.

Agentic security platforms that include shadow AI discovery and governance features address a documented enterprise need — the ability to identify and govern AI usage within the organisation — that is distinct from the security operations efficiency benefits.

Section 10

Metrics Approved for Public Use

The following metrics are supported by evidence of sufficient quality (Tier 1 or Tier 2 sources, non-vendor-conflicted) for use in external communications, provided they include appropriate attribution and context:

MetricValueSourceConfidence
Global cybersecurity workforce gap4.8 million unfilled rolesISC2 Workforce Study 2024High
Security teams reporting skills gaps90%ISC2 2024High
Professionals reporting increased stress66%ISACA 2024High
Analysts reporting burnout~50%ISC2 2024High
Organisations with increased cyber risk72%WEF Global Cybersecurity Outlook 2025High
EU cybersecurity workforce shortage300,000ENISA 2024High
Organisations ranking ransomware #145%WEF 2025High
US cybersecurity employment growth (projected)29% by 2034US Bureau of Labor Statistics, May 2024High
Alert volume reduction with automated triage61%arXiv 2505.09843 (2025)Medium
AI knowledge worker productivity uplift14% average; 34% noviceBrynjolfsson, Li & Raymond, NBER 2023Medium (different domain)
Organisations with AI assessment processes37%WEF Global Cybersecurity Outlook 2025Medium
Median attacker dwell time (H1 2023)8 daysSophos Active Adversary Report 2023Medium (IR firm data)
Attribution Requirement

Any use of these metrics in marketing materials must include source attribution. Metrics must not be combined into derived calculations without explicit disclosure of the calculation methodology. Medium-confidence metrics must note their limitation when cited in formal contexts (investor briefings, government procurement).

Section 11

ROI Calculator Default Inputs

The following parameters may be used as evidence-based defaults in an ROI modelling tool, provided the calculator is transparent about the basis for each default and allows user override with organisation-specific data:

ParameterDefault ValueSourceNotes
US cybersecurity analyst median salary$124,910/yearBLS May 2024Adjust by geography; fully loaded = 1.35–1.5×
False positive alert rate40–60%USENIX ’22; ACM 2025Environment-specific; use as range
% analyst time on triage27–33%Industry surveys (methodology partially disclosed)Conservative end recommended for modelling
Automated triage reduction potential40–60%AACT study (arXiv 2505.09843)Use conservative end (40%) pending broader validation
Analyst productivity uplift (AI-assisted)14% averageNBER w31161, Brynjolfsson 2023Different domain; directional only; include caveat
MSP managed endpoints per technician350Kaseya 2023 MSP Benchmark SurveyIndustry benchmark; vendor-sponsored survey; user-adjustable
% analysts reporting burnout~50%ISC2 2024Turnover risk input
24/7 FTE per coverage position4–5Shift planning arithmeticInference; not peer-reviewed; standard HR planning
Section 12

Metrics Excluded from External Claims

The following metrics are not suitable for use in external marketing, investor, or procurement communications without independent third-party validation:

MetricReason for ExclusionRequired Action
Average data breach cost ($4.44M)Sourced to IBM Cost of Data Breach Report — vendor-commissioned research with material conflict of interestFind independent source or remove
AI automation savings ($1.9M per breach)Derived from excluded IBM sourceRemove; replace with non-conflicted evidence or model explicitly
Adversary breakout time (29 minutes)Sourced to CrowdStrike 2026 Global Threat Report — excluded vendor sourceReplace with Sophos/ENISA independent IR data (8-day dwell median)
“96% auto-resolution rate”Vendor-claimed; no independent verification; methodology undisclosedIndependent third-party audit required before external use
“3.4-second containment time”Demo scenario; may not represent production performanceReal-world measurement across customer environments required
“1,284 actions per day”Vendor capability claim; context and methodology unclearDefine conditions precisely; independent validation recommended
Section 13

Research Limitations

This research is subject to the following limitations, which must be acknowledged in any derivative materials:

  1. No primary data collection. All findings are based on secondary research. No surveys, interviews, or empirical measurement of Athena Agentic customer outcomes were conducted.
  2. Domain transfer problem. The most robust AI productivity evidence (Brynjolfsson et al., 2023) comes from customer support, not cybersecurity. Direct transposition of 14% productivity gains to security operations is inference, not evidence.
  3. Alert triage research specificity. The AACT study (61% alert reduction) was conducted in a specific SOC environment with specific tool sets. Generalisability to diverse enterprise or MSSP environments is not established.
  4. Vendor conflict in key metrics. Average breach costs and adversary speed statistics (IBM, CrowdStrike) are the most commonly cited industry metrics but come from excluded sources. This creates a genuine gap in economic impact quantification that is difficult to fill with fully independent data.
  5. MSSP economics gap. No independent peer-reviewed research specifically quantifies MSSP analyst time distribution across task types, making efficiency models heavily dependent on inference and industry survey data with limited methodology disclosure.
  6. Temporal relevance. Some evidence pre-dates significant AI capability advances (post-2022 GPT-era). Some older studies on alert fatigue and dwell time may not reflect current SOC environments.
  7. Geographic concentration. Much of the strongest evidence is US-centric (BLS data, ISC2 US subset, CISA guidance). Application to UK, EU, Australian, or government contexts requires geographic adjustment.
  8. Agentic AI is nascent. As of mid-2026, agentic security AI at production scale is a recent development. Long-term outcome data, independent audits of production performance, and peer-reviewed case studies are not yet available in the literature.
  9. Counterfactual gap. We cannot directly observe what security outcomes would have been achieved without agentic automation in specific environments. All benefit claims are necessarily comparative or inferential.
Section 14

Conclusion

The evidence base for adopting agentic-first cybersecurity platforms is strongest in the foundational problem areas — workforce shortage, analyst burnout, alert volume, and the need for continuous automated coverage — where government, academic, and professional association sources converge on a consistent picture of structural insufficiency in the current human-centric SOC model.

The evidence for specific operational and financial benefits of agentic automation is directional and requires customer-specific validation. The most robust transfer of evidence comes from: (a) academic automated triage research demonstrating genuine reduction in analyst burden; (b) AI productivity research demonstrating measurable uplift in knowledge work; and (c) economic modelling based on verified labour cost data and transparent assumptions.

For MSPs and MSSPs, the structural economic argument is compelling: the workforce shortage makes proportional headcount scaling economically and practically unviable, and agentic automation is architecturally suited to the multi-tenant, high-volume, continuous monitoring use case. For enterprises, the compliance automation and 24/7 coverage arguments are strongest. For government, the zero trust automation mandate and compliance burden reduction are the most robustly policy-supported benefits.

Recommended Path to Stronger Evidence

The research gap most worth closing is production performance data: deploying instruments to measure analyst time before/after automation deployment, conducting independent third-party performance audits, publishing case studies with disclosed methodology, and engaging academic institutions for peer-reviewed evaluation. These activities would be the first independent peer-reviewed studies in the space and would represent significant industry credibility.

References

Full References

All sources below are non-vendor-conflicted primary and secondary sources. Vendor-commissioned research has been excluded; see Section 12 for the excluded metrics register.

[1]
ISC2. (2024). ISC2 Cybersecurity Workforce Study 2024. ISC2. https://www.isc2.org/Insights/2024/10/ISC2-2024-Cybersecurity-Workforce-Study
[2]
ISC2. (2024, September). Growth of Cybersecurity Workforce Slows in 2024 as Economic Uncertainty Persists. ISC2. https://www.isc2.org/Insights/2024/09/ISC2-Publishes-2024-Cybersecurity-Workforce-Study-First-Look
[4]
ENISA. (2024). 2024 Report on the State of Cybersecurity in the Union. European Union Agency for Cybersecurity. https://www.enisa.europa.eu/publications/2024-report-on-the-state-of-the-cybersecurity-in-the-union
[5]
ENISA. (2024). ENISA Threat Landscape 2024. European Union Agency for Cybersecurity. https://securitydelta.nl/media/com_hsd/report/690/document/ENISA-Threat-Landscape-2024.pdf
[6]
World Economic Forum. (2025). Global Cybersecurity Outlook 2025. WEF. https://reports.weforum.org/docs/WEF_Global_Cybersecurity_Outlook_2025.pdf
[7]
NIST. (2023). AI Risk Management Framework (AI RMF 1.0). National Institute of Standards and Technology. https://www.nist.gov/itl/ai-risk-management-framework
[8]
NIST. (2024). Artificial Intelligence Risk Management Framework: Generative AI Profile (NIST-AI-600-1). National Institute of Standards and Technology. https://www.nist.gov/publications/artificial-intelligence-risk-management-framework-generative-artificial-intelligence
[9]
CISA. (2023). Zero Trust Maturity Model Version 2.0. Cybersecurity and Infrastructure Security Agency. https://www.cisa.gov/sites/default/files/2023-04/zero_trust_maturity_model_v2_508.pdf
[10]
NIST. (2020, updated 2021). Zero Trust Architecture: NIST Special Publication 800-207. National Institute of Standards and Technology. https://nvlpubs.nist.gov/nistpubs/SpecialPublications/NIST.SP.800-207.pdf
[11]
Brynjolfsson, E., Li, D., & Raymond, L. (2023). Generative AI at Work. NBER Working Paper No. w31161. National Bureau of Economic Research. https://www.nber.org/papers/w31161
[12]
Alahmadi, B. A., Axon, L., & Martinovic, I. (2022). 99% False Positives: A Qualitative Study of SOC Analysts’ Perspectives on Security Alarms. 31st USENIX Security Symposium. University of Oxford. https://www.usenix.org/conference/usenixsecurity22/presentation/alahmadi
[13]
Tariq, M., et al. (2025). Alert Fatigue in Security Operations Centres: Research Challenges and Opportunities. ACM Computing Surveys, Vol. 57, No. 9, Article 224. https://dl.acm.org/doi/10.1145/3723158
[14]
Jalalvand, M., et al. (2024). Alert Prioritisation in Security Operations Centres: A Systematic Survey on Criteria and Methods. ACM Computing Surveys, Vol. 57, No. 2. https://dl.acm.org/doi/10.1145/3695462
[15]
US Bureau of Labor Statistics. (2024). Occupational Outlook Handbook: Information Security Analysts. US Department of Labor. https://www.bls.gov/ooh/computer-and-information-technology/information-security-analysts.htm
[16]
US Government Accountability Office. (2024). Cybersecurity: National Cyber Director Needs to Take Additional Actions (GAO-24-106916). GAO. https://www.gao.gov/products/gao-24-106916
[17]
US Government Accountability Office. (2025). IT Systems Annual Assessment: DOD Needs to Improve Performance Reporting and Cybersecurity Planning (GAO-25-107649). GAO. https://www.gao.gov/products/gao-25-107649
[19]
UK Government / DCMS. (2023). Cyber Security Skills in the UK Labour Market 2023. https://gov.uk/government/publications/cyber-security-skills-in-the-uk-labour-market-2023
[21]
Kaseya. (2023). 2023 MSP Benchmark Survey. Kaseya. https://www.kaseya.com/blog/2023-msp-benchmark-survey-findings/ (Note: vendor-sponsored survey; treat as industry benchmark, not independent research)
[22]
AACT Research. (2025). Automated Alert Classification and Triage (AACT): An Intelligent System for the Prioritisation of Cybersecurity Alerts. arXiv:2505.09843. https://arxiv.org/html/2505.09843v1
[23]
Stanford HAI. (2024). AI Index Report 2024 — Economy Chapter. Stanford Human-Centered Artificial Intelligence. https://hai.stanford.edu/ai-index/2024-ai-index-report/economy
[24]
Jobs and Skills Australia. (2024). Cyber Security Skills in Demand as Labour Market Evolves. https://www.jobsandskills.gov.au/news/cyber-security-skills-demand-labour-market-evolves
[25]
ISACA. (2024). Cybersecurity Teams Can’t Keep Up with Growing Levels of Cyberattacks. Press Release. https://www.isaca.org/about-us/newsroom/press-releases/2024/cybersecurity-teams-cant-keep-up-with-growing-levels-of-cyberattacks-new-research-reveals
[26]
OECD. (2024). The Impact of Artificial Intelligence on Productivity, Distribution and Growth. OECD. https://www.oecd.org/en/publications/2024/04/the-impact-of-artificial-intelligence-on-productivity-distribution-and-growth_d54e2842.html
[27]
Australian Government. (2024). State of the Service Report 2023–24. Australian Public Service Commission. https://www.apsc.gov.au/state-of-the-service
[28]
World Economic Forum. (2024). Global Cybersecurity Outlook 2024. WEF. https://www3.weforum.org/docs/WEF_Global_Cybersecurity_Outlook_2024.pdf
[29]
Kaseya. (2024). 2024 MSP Benchmark Survey Report. Kaseya. https://www.kaseya.com/wp-content/uploads/dlm_uploads/2024/03/Whitepaper-2024-MSP-Benchmark-Survey_Kaseya.pdf (Note: vendor-sponsored)
[30]
CSIS. (Various). Net Losses: Estimating the Global Cost of Cybercrime. Center for Strategic and International Studies. https://www.csis.org/analysis/net-losses-estimating-global-cost-cybercrime
Research Disclaimer. This paper was prepared by Michael Bertamini, CEO, Athena Agentic, June 2026. It does not constitute legal, financial, or investment advice. Statistics and findings are presented with appropriate caveats reflecting the actual quality of available evidence; they should not be extracted and used without the full context and limitations stated herein. All vendor-sponsored research has been excluded from primary claims. Medium-confidence claims are directional indicators only and require customer-specific validation before application to specific investment or procurement decisions. Athena Agentic has a commercial interest in the adoption of agentic security platforms; this interest has been managed by excluding Athena-specific performance claims and relying exclusively on independent third-party sources throughout.