• Products
  • Get started
  • Documentation
  • Resources

View global infrastructure health reports

This report is only available on Opsgenie’s Enterprise plan.

Infrastructure may consist of hundreds or thousands of services. Infrastructure Health Reporting conveys the health of each piece, problematic components or services, so focus can be specific for improvements or removals. These reports give a comprehensive look into the health of a huge part of the organization especially if your organization is very service-focused. They provide insight into making (business) decisions based on incident frequency and the data for how business-impacting or easy to solve incidents for a particular service is.

This dashboard offers a broad overview of services, their related incidents, the relevant response efforts, as well as detailed information about the incidents and the handlers who were notified or responded.

Watch this video for an overview of infrastructure health reports for all on-call teams.

Filters

You can filter the charts by service, status, priority, and date.

Charts

  • Incidents per service
    This shows the number of incidents occurred in each service in a team over the specified date range.

  • Incidents by date
    This shows the number of incidents over a specified date range. This clubs nearby incidents in a circle.

  • Incidents by status
    The total number of incidents per service grouped by status.

  • Incidents by priority
    The total number of incidents grouped by their priority level.

  • Daily mean time to resolve and close incidents
    This shows the average time taken to close or resolve incidents along with the incident count based over a date range.

  • Responder and stakeholder notifications per service
    This lists services and their corresponding incident notification count to the stakeholders.

  • Number of incidents by tag
    The total number of incidents grouped by tags.

  • Service uptime percentage
    This bar graph shows the available percentage time for up to 30 services, starting with the service with the lowest uptime, over the period of time selected on the date filter.

  • User response performance
    This table lists the names of recipients along with the number of alerts acknowledged by them, the number of alerts escalated (yes/no), the number of responder alerts, the number of acknowledged alerts that were escalated, and mean time to acknowledge.

  • Team response statistics

    This lists team members and their corresponding information: the number of incidents responded, the number of incidents owned, percentage of alerts acknowledged, percentage of alerts escalated, the mean time to respond, and number of sessions participated.

Additional Help