Data Template: Incident Management
Your Incident Management Data Template
- Recommended attributes to collect
- Key activities to track
- Extraction guidance for ServiceNow Problem Management
Incident Management Attributes
| Name | Description | ||
|---|---|---|---|
|
Activity Name
ActivityName
|
The name of the specific event or task that occurred at a point in time within the incident lifecycle. | ||
|
Description
The Activity Name describes a specific step or status change in the incident management process, such as 'Incident Created', 'Assigned To Agent', or 'Incident Closed'. This data is typically derived from changes in key incident fields like 'State' or 'Assignment Group', or from specific log entries. This attribute is crucial for building the process map. It defines the nodes in the process graph, allowing analysts to visualize the flow of incidents, identify common pathways, discover bottlenecks between activities, and analyze process variants. The granularity and accuracy of these activity names directly impact the quality of the process analysis.
Why it matters
It defines the steps in the process map, which is the foundation for all process mining analysis and visualization.
Where to get
This is a derived attribute, typically generated by the data transformation logic based on changes in fields like
Examples
Incident CreatedAssignment Group ChangedResolution ProposedIncident Closed
|
|||
|
Event Time
EventTime
|
The precise timestamp indicating when the activity occurred. | ||
|
Description
Event Time, often known as the timestamp, records the exact date and time that an activity was completed or a status change occurred. In ServiceNow, this is typically captured in the This attribute is essential for sequencing events correctly and for all time-based analysis. It is used to calculate cycle times, queue times, and durations between activities, which are fundamental for identifying bottlenecks, measuring performance against SLAs, and understanding process efficiency. The accuracy of these timestamps is critical for the validity of any duration-based metrics.
Why it matters
This timestamp orders all activities chronologically and enables the calculation of all duration-based metrics, like cycle times and bottlenecks.
Where to get
ServiceNow
Examples
2023-04-15T10:05:21Z2023-04-15T11:22:00Z2023-04-16T09:00:30Z
|
|||
|
Incident ID
IncidentId
|
The unique identifier for each incident record, serving as the primary key for tracking the entire lifecycle. | ||
|
Description
The Incident ID is the unique reference number assigned to every reported incident in ServiceNow. It acts as the core case identifier that links all related activities, updates, and communications from the moment an incident is created until it is closed. In process mining analysis, this ID is fundamental. It allows the tool to stitch together the sequence of events for each individual case, forming the basis for discovering process maps, analyzing variants, and calculating end-to-end durations. Without a unique Incident ID for each case, it would be impossible to trace the journey of an incident through the resolution process.
Why it matters
This is the essential Case ID that connects all events in an incident's lifecycle, making end-to-end process analysis possible.
Where to get
ServiceNow
Examples
INC0010001INC0010045INC0010239
|
|||
|
Last Data Update
LastDataUpdate
|
The timestamp indicating when the data for this record was last refreshed from the source system. | ||
|
Description
This attribute records the date and time of the most recent data extraction or update from ServiceNow. It is a metadata field that reflects the freshness of the data being analyzed, not an event in the process itself. This information is vital for understanding the timeliness of the analysis. It tells users how current the data is, which is important for operational dashboards and for making decisions based on recent events. It helps manage expectations about the data's relevance and recency.
Why it matters
It informs users about the freshness of the data, which is critical for the relevance and accuracy of the analysis.
Where to get
This timestamp is generated and populated by the data extraction tool or process at the time of the data load.
Examples
2023-10-27T02:00:00Z2023-10-28T02:00:00Z
|
|||
|
Source System
SourceSystem
|
The system from which this data was extracted. | ||
|
Description
This attribute identifies the origin of the incident data, which in this case is ServiceNow Problem Management. It is typically a static value added during the data extraction and transformation process. In environments where data from multiple systems might be combined for analysis, this field is critical for data lineage and segregation. It helps ensure that metrics and processes are analyzed within the correct context and allows analysts to compare processes across different source systems.
Why it matters
It provides crucial context about data origin, ensuring data lineage and enabling correct interpretation in multi-system environments.
Where to get
This is typically a static value added during the data extraction process.
Examples
ServiceNow Problem ManagementServiceNow
|
|||
|
Assigned To
AssignedTo
|
The individual user or agent currently assigned to work on the incident. | ||
|
Description
This attribute identifies the specific support agent responsible for the incident at a given point in time. This information is critical for understanding workload distribution, agent performance, and handoffs between individuals. In analysis, 'Assigned To' helps visualize resource allocation and identify overburdened agents. It's also used in the Handoff and Reassignment dashboard to track how many times an incident changes individual owners, which can be an indicator of inefficiency or knowledge gaps. Analyzing resolution times by agent can also highlight top performers or those needing additional training.
Why it matters
It enables analysis of agent workload, performance, and individual handoffs, which are key to understanding resource efficiency.
Where to get
ServiceNow
Examples
Beth AnglinDavid LooHoward Johnson
|
|||
|
Assignment Group
AssignmentGroup
|
The support team or group responsible for handling the incident. | ||
|
Description
The Assignment Group represents the team of agents tasked with resolving the incident. Incidents are often routed between different groups, such as from a Level 1 help desk to a specialized Level 2 network team. This attribute is essential for analyzing inter-team handoffs and identifying systemic bottlenecks. The 'Handoff and Reassignment Rate' dashboard heavily relies on this data to show which teams are frequently involved in transfers. It also allows for performance comparisons between different support groups and helps in understanding where resolution expertise lies within the organization.
Why it matters
This tracks which team is responsible, enabling analysis of team performance, workload, and handoffs between groups.
Where to get
ServiceNow
Examples
Service DeskNetwork SupportDatabase Administrators
|
|||
|
Incident State
IncidentState
|
The current status of the incident in its lifecycle. | ||
|
Description
The Incident State indicates the incident's current stage, such as 'New', 'In Progress', 'On Hold', or 'Resolved'. State changes are often the primary source for generating activities in the event log for process mining. Analyzing the time spent in each state is a powerful way to identify bottlenecks. For example, a long duration in the 'On Hold' state might indicate dependencies on external factors or users. The sequence of state changes also forms the basis of the process map, showing how incidents progress toward resolution.
Why it matters
It tracks the incident's progress and is key to analyzing time spent in different stages and identifying process bottlenecks.
Where to get
ServiceNow
Examples
NewIn ProgressAwaiting User InfoResolvedClosed
|
|||
|
Priority
Priority
|
The priority level of the incident, which dictates the required urgency of response. | ||
|
Description
Priority is a key field in ServiceNow that determines the order and speed for handling an incident. It is typically derived from the incident's impact and urgency, and it directly influences SLA targets. This attribute is fundamental for segmentation and performance analysis. The 'SLA Compliance Overview' dashboard uses Priority to assess whether high-priority incidents are being resolved within their target times. Analyzing cycle times by priority helps confirm that critical incidents are indeed being processed faster than less important ones. It is a critical dimension for almost every KPI and dashboard.
Why it matters
It allows for segmenting incidents by business importance, which is critical for SLA compliance monitoring and resource allocation.
Where to get
ServiceNow
Examples
1 - Critical2 - High3 - Moderate4 - Low
|
|||
|
Caller
CallerId
|
The user who initially reported the incident. | ||
|
Description
The Caller identifies the end-user or customer who is affected by the incident and reported it. This information provides context on who is being impacted by service disruptions. While not always central to the process flow itself, analyzing incidents by caller can reveal if specific individuals or departments are disproportionately affected by issues. This can point to training needs or localized environmental problems. It also provides a direct link to the customer for satisfaction surveys and communication.
Why it matters
It identifies the affected user, allowing for analysis by department or individual and providing context for user communication.
Where to get
ServiceNow
Examples
Abel TuterCarolina PashDon Goodliffe
|
|||
|
Category
Category
|
The high-level classification of the incident, such as Hardware, Software, or Network. | ||
|
Description
The Category provides a broad classification of an incident's nature. This, often in combination with a subcategory, helps in routing the incident to the correct support team and is used for reporting and trend analysis. For process mining, this attribute is crucial for the 'Incident Categorization Accuracy' and 'Recurring Incident Volume' dashboards. By analyzing incidents where the category is changed mid-process, organizations can identify issues with initial triage. Filtering the process map by category can also reveal if certain types of incidents follow different resolution paths or experience unique bottlenecks.
Why it matters
It enables analysis of incident types, helps measure categorization accuracy, and is vital for routing and trend analysis.
Where to get
ServiceNow
Examples
HardwareSoftwareNetworkDatabase
|
|||
|
Configuration Item
ConfigurationItem
|
The specific IT component, service, or asset affected by the incident. | ||
|
Description
The Configuration Item (CI) is the asset from the Configuration Management Database (CMDB) that is affected by the incident. This could be a server, application, laptop, or network device. Analyzing incidents by CI is extremely valuable for identifying unreliable assets or services. It helps pinpoint which parts of the IT infrastructure are generating the most incidents, which can guide investment in upgrades or replacements. In process mining, filtering by CI can reveal if incidents related to critical applications are handled differently or more efficiently than others.
Why it matters
It identifies the affected asset, helping to pinpoint problematic components in the IT infrastructure and focusing improvement efforts.
Where to get
ServiceNow
Examples
SAP ERP ProductionOracle DB Server 05Email Service
|
|||
|
Cycle Time
CycleTime
|
The total duration from the time an incident was created until it was closed. | ||
|
Description
Cycle Time is a calculated metric representing the end-to-end duration of the incident management process for a single case. It is typically calculated as the difference between the 'Closed' timestamp and the 'Created' timestamp. This is one of the most fundamental KPIs in process mining, directly supporting the 'Incident Resolution Cycle Time' dashboard. Analyzing average cycle time, as well as its distribution, helps organizations measure overall efficiency, set performance baselines, and identify the impact of process changes. Slicing this metric by attributes like priority or category reveals which types of incidents take the longest to resolve.
Why it matters
This core KPI measures the end-to-end efficiency of the process and is used to identify long-running cases and track overall performance.
Where to get
Calculated by subtracting the first event timestamp from the last event timestamp for each Incident ID.
Examples
2592008640001209600
|
|||
|
Is Reopened
IsReopened
|
A flag indicating if an incident was reopened after being resolved. | ||
|
Description
This boolean flag is set to true if an incident's state was changed back to an active one (e.g., 'In Progress') after it had previously reached a 'Resolved' or 'Closed' state. This is typically identified by looking for a 'Incident Reopened' activity in the event log. Reopened incidents are a strong indicator of incomplete or ineffective resolutions. Analyzing these cases helps to identify premature closures or recurring issues that were not properly fixed the first time. A high reopen rate can negatively impact user satisfaction and team productivity, making this a key metric for quality control.
Why it matters
This flag identifies failures in the resolution process, highlighting incidents that required additional work after being considered resolved.
Where to get
Calculated by checking the sequence of activities for each incident to see if an 'open' state follows a 'resolved' state.
Examples
truefalse
|
|||
|
Is SLA Breached
IsSlaBreached
|
A flag that indicates whether the incident resolution exceeded its SLA due date. | ||
|
Description
This is a calculated boolean attribute that flags incidents as having breached their Service Level Agreement. It is derived by comparing the actual resolution timestamp with the 'SLA Due Date'. If the resolution time is after the due date, the flag is set to true. This attribute is the foundation for the 'SLA Compliance Overview' dashboard and related KPIs. It simplifies analysis by converting a complex time comparison into a simple true/false dimension, making it easy to filter for all breached incidents and analyze their common characteristics, such as category, assignment group, or priority.
Why it matters
It simplifies SLA compliance analysis, allowing for easy filtering and deep-dive analysis of all incidents that failed to meet their targets.
Where to get
Calculated by comparing the 'Resolved At' timestamp with the 'SlaDueDate' timestamp. (Resolved At > SlaDueDate).
Examples
truefalse
|
|||
|
Problem ID
ProblemId
|
The identifier of the associated Problem record if the incident is linked to a larger issue. | ||
|
Description
The Problem ID links an incident to a corresponding record in the Problem Management module. This is done when an incident is identified as being a symptom of a larger, underlying issue that affects multiple users or services. This linkage is critical for the 'Recurring Incident Volume' dashboard and the 'Recurring Incident Rate' KPI. It allows analysts to group incidents that stem from the same root cause, measure the full impact of a problem, and track the effectiveness of problem resolution efforts. A high number of incidents linked to problems indicates a reactive support environment.
Why it matters
It links incidents to a root cause, which is essential for analyzing recurring issues and measuring the impact of problem management.
Where to get
ServiceNow
Examples
PRB0040001PRB0040015PRB0040102
|
|||
|
Reassignment Count
ReassignmentCount
|
The number of times the incident has been reassigned to a different group or agent. | ||
|
Description
This field tracks the total number of times an incident has been transferred between different assignment groups. It is a direct measure of process friction and is often used as a key performance indicator. This attribute is the primary driver for the 'Handoff and Reassignment Rate' dashboard and the 'Average Handoffs per Incident' KPI. A high reassignment count often indicates issues such as incorrect initial routing, lack of skills in a support tier, or unclear process ownership. Reducing this count is a common goal for process improvement initiatives as it typically leads to faster resolution times.
Why it matters
This directly measures process handoffs, a key indicator of inefficiency, incorrect routing, and opportunities for improvement.
Where to get
ServiceNow
Examples
0135
|
|||
|
Resolution Code
ResolutionCode
|
A code indicating how the incident was ultimately resolved. | ||
|
Description
The Resolution Code specifies the nature of the solution applied, for instance, whether it was solved by a user, resolved by a known error, or if a workaround was provided. This field is typically filled in by the agent upon closing the incident. This attribute directly supports the 'Resolution Type Effectiveness' dashboard. It allows for analysis of how many incidents are closed with permanent fixes versus temporary workarounds, which is a key indicator of service quality and long-term stability. A high rate of workarounds might suggest underlying problems are not being adequately addressed.
Why it matters
It clarifies the resolution method, enabling analysis of permanent fixes versus temporary workarounds and supporting root cause analysis.
Where to get
ServiceNow
Examples
Solved (Workaround)Solved (Permanently)Not Solved (User Canceled)Known Error
|
|||
|
Severity
Severity
|
The level of business impact caused by the incident. | ||
|
Description
Severity defines how much an incident is impacting business operations. Along with urgency, it is often used to automatically calculate the incident's priority. In analysis, severity is a key dimension for the 'SLA Compliance Overview' dashboard. It helps organizations understand if they are meeting service levels for the most disruptive incidents. It provides a business-centric view of performance, complementing the operational view provided by priority.
Why it matters
It measures the business impact of an incident, providing a crucial dimension for prioritizing efforts and analyzing performance on critical issues.
Where to get
ServiceNow
Examples
1 - High2 - Medium3 - Low
|
|||
|
SLA Due Date
SlaDueDate
|
The target date and time by which the incident is expected to be resolved according to its SLA. | ||
|
Description
The SLA Due Date is a calculated timestamp that represents the resolution deadline for an incident. This date is determined by the Service Level Agreement (SLA) associated with the incident's characteristics, such as its priority. This attribute is essential for the 'SLA Compliance Overview' dashboard and the 'Critical Incident SLA Breach Rate' KPI. It serves as the benchmark against which the actual resolution time is compared. Analyzing incidents that are close to their SLA due date can help with proactive escalation and prioritization.
Why it matters
It defines the resolution target, making it possible to measure SLA compliance and identify incidents at risk of breaching their targets.
Where to get
This value is typically found in the
Examples
2023-05-20T17:00:00Z2023-06-01T09:00:00Z
|
|||
Incident Management Activities
| Activity | Description | ||
|---|---|---|---|
|
Assignment Group Changed
|
Represents a handoff where an incident is transferred from one support group to another. This is captured by observing subsequent changes to the 'assignment_group' field after its initial population. | ||
|
Why it matters
Frequent reassignments can signify incorrect initial routing, process complexity, or knowledge gaps. This activity is critical for measuring the 'Average Handoffs per Incident' KPI.
Where to get
Inferred from the 'sys_audit' table by tracking any change to the 'assignment_group' field after the initial assignment.
Capture
Identify each timestamped change to the 'assignment_group' field in the audit log.
Event type
inferred
|
|||
|
Incident Assigned To Group
|
This activity occurs when an incident is assigned to a specific support group for handling. It is a key step in the routing process and is captured by observing changes to the assignment group field. | ||
|
Why it matters
Tracking assignments is essential for analyzing handoffs, queue times for each group, and identifying routing inefficiencies or bottlenecks.
Where to get
Inferred from the 'sys_audit' table by tracking when the 'assignment_group' field on the 'incident' table is populated or changed.
Capture
Use the timestamp from the audit log for changes to the 'assignment_group' field.
Event type
inferred
|
|||
|
Incident Closed
|
This is the final activity in the lifecycle, signifying that the incident is fully resolved, confirmed, and no further action is needed. The event is captured explicitly via the closure timestamp. | ||
|
Why it matters
As the definitive end event, this activity is essential for calculating the total incident lifecycle duration and analyzing the time taken for post-resolution processing.
Where to get
The 'closed_at' timestamp on the 'incident' table serves as the explicit event timestamp. This is typically set when the 'state' field changes to 'Closed'.
Capture
Use the 'closed_at' timestamp from the incident record.
Event type
explicit
|
|||
|
Incident Created
|
Marks the beginning of the incident lifecycle when a new incident is formally logged in ServiceNow. This event is captured explicitly using the creation timestamp of the incident record. | ||
|
Why it matters
This is the primary start event for the process. Analyzing the time from this activity to resolution is fundamental for measuring overall cycle time and SLA compliance.
Where to get
The 'sys_created_on' timestamp on the 'incident' table serves as the explicit event timestamp for this activity.
Capture
Use the 'sys_created_on' timestamp from the incident record.
Event type
explicit
|
|||
|
Resolution Proposed
|
Marks the point when a support agent has implemented a fix and moved the incident to a 'Resolved' state. This is a key milestone preceding final closure. | ||
|
Why it matters
This activity signals the end of active work and the start of the confirmation phase. The time between this and 'Incident Closed' can reveal delays in user confirmation or verification.
Where to get
Inferred from the 'sys_audit' table when the 'state' field on the 'incident' table changes to 'Resolved'. The 'resolved_at' timestamp is often populated at this time.
Capture
Use the timestamp when the 'state' field becomes 'Resolved' or the 'resolved_at' timestamp.
Event type
inferred
|
|||
|
Work Started
|
Indicates that an agent has begun actively investigating or working on the incident. This is typically inferred when the incident's state changes from 'New' or 'Assigned' to an active state like 'In Progress'. | ||
|
Why it matters
This milestone marks the end of the initial queue time and the beginning of active resolution efforts. Measuring the time until work starts is key for bottleneck analysis.
Where to get
Inferred from the 'sys_audit' table by identifying when the 'state' field on the 'incident' table changes to a value representing active work, such as 'In Progress'.
Capture
Identify the timestamp when the 'state' field changes to 'In Progress' or a similar value.
Event type
inferred
|
|||
|
Awaiting User Confirmation
|
The incident is in a pending state, waiting for the user to confirm that the proposed resolution was successful. This is typically inferred from a specific state like 'Awaiting User Info' after being resolved. | ||
|
Why it matters
This state can be a significant bottleneck if users are slow to respond. Measuring time spent in this activity helps identify communication gaps and opportunities to automate closure.
Where to get
Inferred from the 'sys_audit' table by identifying a change to a specific pending state after resolution. The state name may be custom, such as 'Awaiting Caller'.
Capture
Identify timestamp when 'state' changes to a value indicating a wait for user input.
Event type
inferred
|
|||
|
Incident Assigned To Agent
|
Represents the point at which a specific agent within a support group takes ownership of the incident. This is captured by monitoring changes to the 'assigned_to' field. | ||
|
Why it matters
This provides a granular view of agent workload and first-contact resolution. It helps determine how long incidents wait for an individual to begin work after being assigned to a group.
Where to get
Inferred from the 'sys_audit' table by tracking when the 'assigned_to' field on the 'incident' table is populated or changed.
Capture
Use the timestamp from the audit log for changes to the 'assigned_to' field.
Event type
inferred
|
|||
|
Incident Categorized
|
Represents the initial classification of the incident, where fields like Category, Subcategory, and Priority are set. This event is typically inferred from the audit trail when these fields are first populated or updated shortly after creation. | ||
|
Why it matters
Accurate categorization is crucial for correct routing and prioritization. Tracking this activity helps analyze recategorization rates and their impact on resolution time.
Where to get
Inferred from the 'sys_audit' table by identifying the first change to fields like 'category', 'subcategory', or 'priority' for a given incident.
Capture
Identify the timestamp of the first update to classification fields in the audit log.
Event type
inferred
|
|||
|
Incident Escalated
|
Occurs when the priority or severity of an incident is increased, often requiring a faster response or different resources. This is inferred by detecting an upward change in the 'priority' field's value. | ||
|
Why it matters
Escalations often indicate that an incident is more severe than initially thought or is approaching an SLA breach. Analyzing these events helps in understanding process exceptions.
Where to get
Inferred from the 'sys_audit' table by identifying a change in the 'priority' field to a higher-urgency value.
Capture
Detect when the 'priority' field value increases (e.g., from '3 - Moderate' to '2 - High').
Event type
inferred
|
|||
|
Incident Linked To Problem
|
This activity occurs when an incident is formally associated with a problem record, indicating it's part of a larger, underlying issue. This is inferred when the 'problem_id' field on the incident record is populated. | ||
|
Why it matters
Linking to a problem is a critical step in moving from reactive incident fixing to proactive root cause analysis. It supports the 'Recurring Incident Volume' dashboard.
Where to get
Inferred by detecting when the 'problem_id' reference field on the 'incident' table is populated with a value.
Capture
Identify the timestamp from the audit log when the 'problem_id' field is populated.
Event type
inferred
|
|||
|
Incident Reopened
|
Occurs when a user reports that the issue persists after it was marked as resolved. This is inferred when the incident state moves from 'Resolved' back to an active state like 'In Progress'. | ||
|
Why it matters
Reopened incidents indicate failed resolutions and represent rework. Tracking this activity is vital for measuring resolution quality and first-time fix rates.
Where to get
Inferred from the 'sys_audit' table by detecting a 'state' field transition from 'Resolved' back to an active state like 'In Progress' or 'Assigned'.
Capture
Identify timestamp when 'state' moves from 'Resolved' to an active value.
Event type
inferred
|
|||
|
Support Comment Added
|
A support agent adds a work note or a comment visible to the user. This is an explicit event logged in the incident's activity stream. | ||
|
Why it matters
Tracks communication and investigation efforts by the support team. Analyzing the frequency and timing of these comments provides insight into the investigation process.
Where to get
Captured from the 'sys_journal_field' table, which logs entries for the 'work_notes' and 'comments' fields on the 'incident' table.
Capture
Use the creation timestamp of journal entries where the element is 'work_notes' or 'comments'.
Event type
explicit
|
|||