Get started with Opsgenie as a user
Learn how to configure your profile, get notifications from Opsgenie and view on-call schedules.
This report is only available on Opsgenie’s Enterprise plan.
Infrastructure may consist of hundreds or thousands of services. Infrastructure Health Reporting conveys the health of each piece, problematic components or services, so focus can be specific for improvements or removals. These reports give a comprehensive look into the health of a huge part of the organization especially if your organization is very service-focused. They provide insight into making (business) decisions based on incident frequency and the data for how business-impacting or easy to solve incidents for a particular service is.
This dashboard offers a broad overview of services, their related incidents, the relevant response efforts, as well as detailed information about the incidents and the handlers who were notified or responded.
Watch this video for an overview of infrastructure health reports for all on-call teams.
Filters
You can filter the charts by service, status, priority, and date.
Charts
Incidents per service
This shows the number of incidents occurred in each service in a team over the specified date range.
Incidents by date
This shows the number of incidents over a specified date range. This clubs nearby incidents in a circle.
Incidents by status
The total number of incidents per service grouped by status.
Incidents by priority
The total number of incidents grouped by their priority level.
Daily mean time to resolve and close incidents
This shows the average time taken to close or resolve incidents along with the incident count based over a date range.
Responder and stakeholder notifications per service
This lists services and their corresponding incident notification count to the stakeholders.
Number of incidents by tag
The total number of incidents grouped by tags.
Service uptime percentage
This bar graph shows the available percentage time for up to 30 services, starting with the service with the lowest uptime, over the period of time selected on the date filter.
User response performance
This table lists the names of recipients along with the number of alerts acknowledged by them, the number of alerts escalated (yes/no), the number of responder alerts, the number of acknowledged alerts that were escalated, and mean time to acknowledge.
Team response statistics
This lists team members and their corresponding information: the number of incidents responded, the number of incidents owned, percentage of alerts acknowledged, percentage of alerts escalated, the mean time to respond, and number of sessions participated.
Was this helpful?