09/22/2025 | Press release | Distributed by Public on 09/23/2025 08:07
A couple of our recent blogs have shared findings from a survey administered at Cisco Live 2025 with 319 IT professionals participating. The first was about the top troubleshooting surprises we learned from the survey results, and the second shared details on how observability can help reduce mean time to resolution (MTTR).
In this blog post, we build on the survey findings with additional insights and key takeaways you can use to guide your observability strategy.
IT system disruptions, technical glitches, and network problems happen every day around the world; some we've highlighted in recent blog posts, and others are emerging now:
These examples highlight the fact that no company is immune from unplanned, unexpected network disruptions.
Our blog post regarding MTTR revealed that problem resolution for 80 percent of respondents from large companies took a few hours or more. In fact, 25 percent shared that took from a day to a week to restore service. And this was simply too long.
Mean time to knowledge (MTTK) is one of four stages in the MTTR process. So, we asked the Cisco attendees participating in our survey what percentage of the total time was spent on that stage. Not surprisingly, it was significant.
For companies with 10,000 or more employees, more than 45 percent indicated that the MTTK stage consumed between 26 and 50 percent of the overall time; about a quarter of the respondents said it took them 51 to 75 percent of the total time; and nearly 9 percent believed it took more than 75 percent of the troubleshooting time (see Figure 1).
Figure 1: Respondents in companies with 10,000 or more employees answered the question regarding what percentage of time was spent on MTTK stage of the troubleshooting process at their company.
Keep in mind, these IT professionals have already noted that troubleshooting is often lengthy, with 80 percent saying it takes their organization a few hours or more.
The increased complexity of today's infrastructure has been due, in part, to the explosion in the number of vendors involved in an enterprise network. This has compounded the challenges of problem isolation and resolution when issues occur. A long-used technique when quick resolution eludes an IT organization is to engage a war room that brings together multiple IT team members along with vendors involved in the affected service delivery.
Respondents in companies with 10,000 or more employees indicated that the majority of their organizations (73 percent) initiate their war rooms within the first day of an unresolved problem (see Figure 2). War rooms have been criticized for ineffectiveness, protracted troubleshooting, and lack of a single source of information for all participants to use for troubleshooting. Despite this, it was surprising that the survey found only 3.5 percent of participants reported their companies had discontinued use of war rooms as a troubleshooting tool.
Figure 2: The majority of respondents in companies with 10,000 or more employees (73 percent) initiate war rooms for troubleshooting within the first day.
The survey included questions on data sources in use by the participants' organizations. Metrics, events, logs, and traces (MELT) was the No. 1 data source as identified by 42.1 percent of the participants. This was followed by packet captures/pcaps (26.3 percent), flow data/e.g., NetFlow (19.3 percent), and deep packet inspection (DPI), at 9.6 percent.
Because there is a difference in what is used for troubleshooting and what IT professionals would like to use, the survey asked respondents if it was important/valuable to leverage DPI in their observability process. Analysis of the responses showed that participants in companies with 10,000 or more employees had a significantly higher perceived value, with 41.6 percent responding that DPI is very important/valuable as part of their observability process (see Figure 3).
Figure 3: The majority of respondents in companies with 10,000 or more employees (83 percent) believe DPI is important or very important in understanding issues better and solving problems faster.
The survey did not directly ask the participants why they thought DPI was very important in their observability process. However, there could be several reasons that contributed to this input.
As a result, DPI proves more valuable in larger organizations, where complex infrastructures, numerous vendors and tools, and a growing number of applications and services mean greater risks to productivity, revenue, costs, customer service, and reputation when network disruptions take longer to identify and resolve.
In fact, we highlighted recent research by Enterprise Management Associates (EMA) in its April 2025 study "Enterprise Strategies for Hybrid, Multi-Cloud Networks" that touched on what data was used and what was needed by IT teams. EMA's research revealed that 49 percent of IT professionals in hybrid, multicloud environments view network flow data as critical for monitoring, troubleshooting, and optimizing their cloud networks. Another 38 percent identified packet data as equally important. An interesting revelation in this study was that only 29 percent of the respondents reported being fully satisfied with their current monitoring tools.
Because this blog concludes the information revealed in the survey conducted at Cisco Live 2025, we wanted to close with a review of the revelations and establish some actions that might be taken in light of the implications from these findings. Some key results include the following:
Yet when asked about the role of DPI in their observability process, employees in large organizations reported it as important or very important, with 83.2 percent affirming its value. Ways to impact early warning identification (mean time to identify, or MTTI) and rapid troubleshooting with DPI (MTTK) were identified as essential actions that can dramatically reduce MTTR. Given that the current processes and outcomes leveraging legacy tools and incomplete datasets are less than optimal, isn't it time to give NETSCOUT nGenius solutions for observability a try?
To learn more about observability strategies, read "Comprehensive Observability Dramatically Improves Troubleshooting Time."