ITSM/MSP Agentic Weekly

Issue #4 June 1, 2026 PagerDuty + AIOps

🟑 New Finds

Spring 2026 LaunchApril 29, 2026

SRE Agent Becomes a Virtual Responder

PagerDuty's SRE Agent evolved from an assistant into a first-line virtual responder. It now handles detection, triage, and initial diagnostics autonomously β€” escalating to humans only when it hits its confidence boundary. This is the closest any major ops platform has come to fully autonomous incident response.

  • Handles detection β†’ triage β†’ initial diagnostics without human intervention
  • Escalates with full context package when it hits a wall
  • Positioned as shift-left prevention, not just response
Agentic SREVirtual ResponderShift-Left
ArchitectureSpring 2026

Multi-Agent Ecosystem via MCP

PagerDuty opened its agent platform through Model Context Protocol (MCP), enabling third-party AI agents to plug into the Operations Cloud as coordinated responders. Think: Datadog agent detects anomaly β†’ PagerDuty agent triages β†’ Slack agent notifies β†’ human validates. All coordinated, not siloed.

  • MCP connectors for agent-to-agent handoffs across vendors
  • Operations Cloud acts as the coordination layer, not just a ticketing system
  • Partners include observability, DevOps, and coding tools
MCPMulti-AgentOpen Protocol
Workflow AutomationApril 28, 2026

5-Step Agentic Workflow Builder

PagerDuty published their framework for automating critical workflows with AI agents: Identify β†’ Design β†’ Build β†’ Test β†’ Deploy. Not just marketing β€” it's a structured onboarding path with real hooks into their platform. Targets the "islands of automation" problem where teams have tools that don't talk to each other.

  • Framework guides teams from manual chaos β†’ autonomous operations
  • Each step maps to PagerDuty product capabilities
  • Designed for non-AI-engineers to configure agents
Workflow BuilderLow-CodeOnboarding
Slack-FirstSpring 2026

Slack-Native Incident Management

Deeper Slack integration means incident response happens where engineers already are. AI agents run diagnostics, pull runbooks, and update status β€” all within Slack threads. No context-switching to a separate dashboard.

  • Full incident lifecycle inside Slack threads
  • AI agents respond directly in-channel
  • Reduces mean-time-to-engage by eliminating tool switching
ChatOpsSlackUX

πŸ” Agentic Incident Lifecycle

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                    AGENTIC INCIDENT LIFECYCLE                     β”‚
β”‚                                                                   β”‚
β”‚  TRIGGERS          AI CLASSIFY        AI TRIAGE                   β”‚
β”‚  ─────────         ───────────        ─────────                   β”‚
β”‚  β€’ Alert fires      β€’ PagerDuty       β€’ SRE Agent                 β”‚
β”‚  β€’ Metric breach      Advance reads     assesses severity         β”‚
β”‚  β€’ Log anomaly        alert payload   β€’ Pulls similar             β”‚
β”‚  β€’ User report      β€’ ML noise          past incidents            β”‚
β”‚  β€’ API webhook        reduction       β€’ Checks affected           β”‚
β”‚                      β€’ Groups related   services                  β”‚
β”‚                        alerts                                     β”‚
β”‚       β”‚                  β”‚                  β”‚                      β”‚
β”‚       β–Ό                  β–Ό                  β–Ό                      β”‚
β”‚  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”       β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”       β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”               β”‚
β”‚  β”‚ INGEST  │──────▢│CLASSIFY │──────▢│  TRIAGE   β”‚               β”‚
β”‚  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜       β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜       β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜               β”‚
β”‚                                              β”‚                     β”‚
β”‚                    β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”‚
β”‚                    β”‚                         β”‚              β”‚     β”‚
β”‚                    β–Ό                         β–Ό              β–Ό     β”‚
β”‚             β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”           β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”   β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”‚
β”‚             β”‚AUTO-HEAL  β”‚           β”‚ ESCALATE  β”‚   β”‚SUPPRESS β”‚  β”‚
β”‚             β”‚(Agent     β”‚           β”‚ to Human  β”‚   β”‚(Noise)  β”‚  β”‚
β”‚             β”‚ executes   β”‚           β”‚ w/Context β”‚   β”‚         β”‚  β”‚
β”‚             β”‚ runbook)   β”‚           β”‚ Package   β”‚   β”‚         β”‚  β”‚
β”‚             β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜           β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜   β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜  β”‚
β”‚                    β”‚                         β”‚                     β”‚
β”‚                    β–Ό                         β–Ό                     β”‚
β”‚             β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”           β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”                β”‚
β”‚             β”‚ VERIFY &  β”‚           β”‚ HUMAN     β”‚                β”‚
β”‚             β”‚ CLOSE     β”‚           β”‚ RESOLVES  β”‚                β”‚
β”‚             β”‚(Agent      β”‚           β”‚           β”‚                β”‚
β”‚             β”‚ confirms   β”‚           β”‚           β”‚                β”‚
β”‚             β”‚ fix)       β”‚           β”‚           β”‚                β”‚
β”‚             β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜           β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜                β”‚
β”‚                    β”‚                         β”‚                     β”‚
β”‚                    β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜                     β”‚
β”‚                              β–Ό                                     β”‚
β”‚                     β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”                              β”‚
β”‚                     β”‚ FEEDBACK LOOP β”‚                              β”‚
β”‚                     β”‚ β€’ Update KB   β”‚                              β”‚
β”‚                     β”‚ β€’ Retrain ML  β”‚                              β”‚
β”‚                     β”‚ β€’ Refine      β”‚                              β”‚
β”‚                     β”‚   runbooks    β”‚                              β”‚
β”‚                     β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜                              β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
    

Where AI Intervenes vs. Where Humans Take Over

StageAI AgentHumanBoundary Condition
Ingestβœ“ Noise reduction, groupingβ€”Fully automated
Classifyβœ“ ML + LLM classificationβ€”Uncertain β†’ flag for review
Triageβœ“ Severity, impact, similar pastReview if novelConfidence < threshold
Auto-Healβœ“ Runbook executionApprove if destructiveDestructive actions gated
Escalateβœ“ Context package assemblyβœ“ ResolutionHandoff with full context
Verifyβœ“ Health check, automated testsSpot-checkSampling for QA
Closeβœ“ Auto-close with summaryReview summaryPost-mortem triggers
Learnβœ“ KB update, model feedbackβœ“ Post-mortem writeupHuman writes, AI enriches

πŸ’° Pricing Intelligence

PagerDuty AIOps + Advance (as of June 2026)

ProductStarting PriceModelWhat You Get
AIOps $699/mo (annual)
$799/mo (monthly)
Consumption-based ML noise reduction, event grouping, real-time context, event-driven automation
Advance (Gen AI Add-on) $415/mo (annual only) Per-seat add-on AI Agents, generative runbooks, intelligent triage, Slack-native AI, post-incident summaries
Operations Cloud (Bundle) Custom pricing Platform + consumption Incident Mgmt + AIOps + Automation + Customer Service Ops

Key takeaway: Advance requires annual commitment and is add-on only β€” you can't buy it standalone. The AI agents live behind a paywall that assumes you're already deep in the PagerDuty ecosystem. No transparent per-resolution pricing (unlike Atlassian Rovo's $1/resolution model).

How This Compares to Last Week's Pricing

VendorAI Pricing UnitEntry PriceTransparency
Atlassian Rovo (JSM)Per resolution ($1.00)$0 + consumptionβœ… Transparent
ServiceNow Now AssistPer user/month (tiered)Custom quote❌ Opaque
PagerDuty AdvanceFlat add-on ($415/mo)$415/mo + AIOps base⚠️ Partial
Freshworks Freddy AIPer agent/month$49/agent/moβœ… Transparent

PagerDuty's flat add-on model is simpler to budget for than consumption pricing β€” but it also means you pay the same whether your AI agents resolve 10 incidents or 10,000. No incentive alignment on resolution volume.

πŸ”­ Worth Watching

Competitive Positioning: PagerDuty vs. The Field

DimensionPagerDutyServiceNowAtlassian (JSM+Rovo)
Agent Autonomy⭐⭐⭐⭐ SRE Agent as first responder⭐⭐⭐ Now Assist aids humans⭐⭐ Rovo agents, early stage
Multi-Agent Protocol⭐⭐⭐⭐ MCP β€” open, cross-vendor⭐⭐ Proprietary only⭐⭐ MCP connectors, but ecosystem-locked
ITSM Depth⭐⭐ Incident-only, no ITSM⭐⭐⭐⭐⭐ Full ITIL suite⭐⭐⭐⭐ JSM covers ITSM well
MSP Readiness⭐⭐ Multi-tenant, but no PSA⭐⭐⭐ MSP partner program⭐ No native MSP features
Pricing Clarity⭐⭐⭐ Flat add-on, partial⭐ Opaque enterprise⭐⭐⭐⭐ Per-resolution, transparent
Slack/DevOps Native⭐⭐⭐⭐⭐ Best in class⭐⭐ Separate module⭐⭐⭐ Good, not ops-first

PagerDuty's Architecture: Operations Cloud as Coordination Layer

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚                  OPERATIONS CLOUD                          β”‚
β”‚  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”‚
β”‚  β”‚Incident β”‚  β”‚ AIOps   β”‚  β”‚Automationβ”‚  β”‚Customer   β”‚  β”‚
β”‚  β”‚Mgmt     β”‚  β”‚         β”‚  β”‚          β”‚  β”‚Service Opsβ”‚  β”‚
β”‚  β””β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”˜  β””β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”˜  β””β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”˜  β””β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”˜  β”‚
β”‚       β”‚            β”‚            β”‚               β”‚         β”‚
β”‚       β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜         β”‚
β”‚                    β”‚            β”‚                          β”‚
β”‚            β”Œβ”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”                  β”‚
β”‚            β”‚     PAGERDUTY ADVANCE      β”‚                  β”‚
β”‚            β”‚  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”  β”‚                  β”‚
β”‚            β”‚  β”‚     AI AGENTS        β”‚  β”‚                  β”‚
β”‚            β”‚  β”‚  β€’ SRE Agent         β”‚  β”‚                  β”‚
β”‚            β”‚  β”‚  β€’ Triage Agent      β”‚  β”‚                  β”‚
β”‚            β”‚  β”‚  β€’ Diagnostics Agent β”‚  β”‚                  β”‚
β”‚            β”‚  β”‚  β€’ Runbook Agent     β”‚  β”‚                  β”‚
β”‚            β”‚  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜  β”‚                  β”‚
β”‚            β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜                  β”‚
β”‚                          β”‚                                  β”‚
β”‚            β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”                  β”‚
β”‚            β”‚    MCP CONNECTOR LAYER     β”‚                  β”‚
β”‚            β”‚  β”Œβ”€β”€β”€β”€β”€β”€β” β”Œβ”€β”€β”€β”€β”€β”€β” β”Œβ”€β”€β”€β”€β”€β”β”‚                  β”‚
β”‚            β”‚  β”‚Slack β”‚ β”‚GitHubβ”‚ β”‚Data-β”‚β”‚  ... more        β”‚
β”‚            β”‚  β”‚Agent β”‚ β”‚Agent β”‚ β”‚dog  β”‚β”‚  partners        β”‚
β”‚            β”‚  β””β”€β”€β”€β”€β”€β”€β”˜ β””β”€β”€β”€β”€β”€β”€β”˜ β””β”€β”€β”€β”€β”€β”˜β”‚                  β”‚
β”‚            β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜                  β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

  THIRD-PARTY AGENTS (MCP)
  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”
  β”‚Datadog β”‚  β”‚  New   β”‚  β”‚Custom  β”‚
  β”‚  AI    β”‚  β”‚ Relic  β”‚  β”‚ Agent  β”‚
  β””β”€β”€β”€β”€β”¬β”€β”€β”€β”˜  β””β”€β”€β”€β”¬β”€β”€β”€β”€β”˜  β””β”€β”€β”€β”¬β”€β”€β”€β”€β”˜
       β”‚          β”‚           β”‚
       β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                  β”‚
          [MCP Protocol]
                  β”‚
        Operations Cloud Coordination
      

The MCP layer is the big bet: PagerDuty doesn't need to build every agent. They want to be the coordination fabric that routes work between specialized agents from any vendor. If this takes off, they become the "Kubernetes of operational AI."

πŸ› οΈ MSP Relevance Assessment

Where PagerDuty Fits (and Doesn't) for MSPs

MSP NeedPagerDuty FitGap
Multi-tenant alert routingβœ… Teams + escalation policiesNo native client-separation billing
Automated NOCβœ… SRE Agent as first responderRequires runbook investment
Client-facing AI⚠️ Customer Service Ops existsNot MSP-grade yet
PSA integration❌ None nativeConnectWise/Autotask missing
RMM alert ingestionβœ… Via webhooks + 700+ integrationsRequires custom mapping
Per-client billing❌ Not designed for MSP modelFlat per-seat pricing doesn't align
Co-managed IT⚠️ Possible via teamsNo co-managed workflow templates

Bottom line: PagerDuty is ops-native, not MSP-native. The SRE Agent is genuinely impressive for internal DevOps teams. But MSPs need PSA integration, client billing, and co-managed workflows β€” none of which PagerDuty has prioritized. Not a drop-in MSP solution.

πŸ“‹ Action Items

Next Research Candidates

  • Issue #5 (June 8): ConnectWise + RMM AI β€” the MSP-native angle. How does the dominant PSA/RMM vendor approach agentic AI?
  • Issue #6 (June 15): Datadog AIOps/Watchdog β€” observability-first agentic AI, natural complement to PagerDuty
  • Backlog candidates: Atomicwork (emerging ITSM AI), NinjaOne (AI Co-Pilot for MSPs), HaloPSA (AI features)

Design Takeaways for SuperOps

  • PagerDuty's MCP-based multi-agent protocol is the standard to watch β€” if you're building an AI service desk, MCP compatibility should be on the roadmap
  • The "virtual responder" pattern (agent as first line, human as escalation) is the UX model to steal β€” it's cleaner than chatbot-in-a-widget
  • PagerDuty's flat add-on pricing ($415/mo) vs Atlassian's consumption ($1/resolution) β€” two competing models, worth tracking which wins
  • Slack-native incident management is the bar now β€” if your AI doesn't work where engineers live, it's friction