Skip to main content

 

Cisco Meraki Documentation

Dashboard Alerts - Insight

Troubleshooting Guide

This document enlists all the available alert categories under Meraki Insight’s Web Application Performance and the triggers and recommended next steps to resolve the application issue on the network.

The root cause currently covers the WAN problems detected by passive analysis of application flows on the web app health WAN problems 

Application Performance Degradation

If no root cause for an issue has been found, the alert will show as Performance Degradation.

Triggers

Web App Health goodput or response time data for a particular application and network drops below whatever threshold the user has set (either smart thresholds or manually set).


ISP Issue

Triggers

Goodput on the WAN for passively collected flows per application and network drops below whatever threshold the user has set (either smart thresholds or manually set).

Uplink usage is less than 80% of the bandwidth defined on the SD-WAN & Traffic shaping page

Goodput of actively collected ICMP ping data for the alerting uplink drops below 2.5KB

Troubleshooting steps

  • Check to see if you have configured the bandwidth on the SD-WAN & Traffic shaping page (link to the SD-WAN and traffic shaping page)

  • Check Bandwidth restrictions by referring to the Global Bandwidth limit considerations KB article 

  • Look at the event logs to notice if there are continuous VPN registry changes and refer to the Site-to-Site VPN troubleshooting doc or Meraki Auto VPN -Configuration and Troubleshooting 

  • Look at network status on the WAN Health page, or Uplink Tab on Appliance status page to check loss and latency reported from ping data

  • Look at the Meraki Insight Web App health drill down page to check passive goodput


High Latency Over VPN

Triggers

Goodput on the WAN for passively collected flows per application and network drops below whatever threshold the user has set (either smart thresholds or manually set).

The flow goes over VPN and there is latency detected on the WAN for this application only, then this alert is triggered. There is a check for other major problems/outages  considering this root cause High confidence

Troubleshooting steps

  • Check the WAN health portfolio in Meraki insight to see if the overall network is having high latency.

  • Check VPN status page to see usage information for any bandwidth throttling


Uplink Saturation

Triggers

Goodput on the WAN for passively collected flows per application and network drops below whatever threshold the user has set (either smart thresholds or manually set).

If the uplink usage is greater than 80% (as configured on the SD-WAN & Traffic shaping page) then there is High confidence that the application performance is suffering due to the uplink being saturated.

Troubleshooting steps

  • Check to see if you have configured the bandwidth on the SD-WAN & Traffic shaping page[insert SD-WAN and traffic shaping page dash link]

  • Check Bandwidth restrictions by referring to the Global Bandwidth limit considerations KB article 

  • Check WAN health Page to see if the given network shows a high usage alert on the availability graph

  • if a certain user is utilizing more than the required bandwidth, refer to Creating and Applying group Policies to add policies to restrict or limit bandwidth utilized


Dual Active MXes

Triggers

When there are unexpected numbers of VRRP transition events on the event log for primary and spare MX that can hamper the flows of the application to be interrupted or cause loss, this alert is triggered. There is also a check to see if the WAN overall is seeing a loss,

If yes, then this alert is triggered with High confidence that the reason for the issue is due to VRRP transitions and high WAN loss

If no, then this alert is triggered with medium confidence that this might only be due to the VRRP transitions 

Troubleshooting steps


Traffic Shaping Rule Saturation

Triggers

Goodput on the WAN for passively collected flows per application and network drops below whatever threshold the user has set (either smart thresholds or manually set).

The app configured on Web App Health has a traffic shaping rule with a bandwidth limit applied to it under the SD-WAN and Traffic shaping page

Usage of the app is >80% of the configured bandwidth limit as configured by traffic shaping

Troubleshooting steps

  • Check the Traffic Shaping rule section under the SD-WAN and Traffic shaping page to increase the limit on the bandwidth provided to the application

  • Refer to Traffic and bandwidth Shaping for more information on limitations that can be applied to application traffic

 


Sticky Client

Triggers

If poor performance detected between SSID and client connection due to suboptimal AP selection this alert will be triggered. Client devices choose which AP to connect to. Meraki APs cannot force a client to choose a particular AP. 

Troubleshooting Steps

  • Try to force the client to re-select a more optimal AP by having the client disassociate and reassociate.

  1. To learn more about setting to avoid sticky client issues please refer to this documentation. 


MAC Address Flap Anomaly

Overview

MAC flap event is triggered when a MAC address is learned 3 times or more on 2 or more different ports within 10 seconds. Sometimes these events can become noisy due to wireless roaming which is expected. Due to this noise, it is hard to tell which events are expected vs unexpected. Networks with APs in bridge mode are susceptible to this issue.

It is difficult to separate and identify unexpected events. To solve this problem, Cisco Meraki built a machine-learning algorithm to detect these unexpected events more accurately.

Trigger

The machine learning algorithm looks at the past 35 days of hourly event counts and creates hourly thresholds for the next 7 days. It monitors the real-time hourly event counts, and if the real-time hourly count breaches the expected threshold, it will create an alert - MAC flap anomaly detected on “SWITCHNAME” from “TIMESTAMP”.

The alert will only resolve itself if the algorithm detects the real-time hourly counts goes below the threshold, which happens at the top of the hour from the time of anomaly detection otherwise, it will persist.

Troubleshooting steps

  1. Make sure there is no loop in the network. 
  2. Follow the high-density Wi-Fi deployment guidelines to make sure the network meets the optimal network criteria. 
  3. Sometimes a bad WiFi adapter on a client device can cause frequent connection failure and cause frequent roaming.

ARP Failure

Trigger: 

Large number of clients experiencing issues with the ARP protocol/gateway, gateway timeout, excessive ARP requests

Troubleshooting Steps: 

When you see an ARP Failure alert, be sure to check that the gateway is online and reachable. Also make sure that there is no congestion in the network.

 


DFS Event Pattern

Trigger: 

A DFS event pattern has been recognized on a channel in use on the wireless network.

Troubleshooting Steps: 

No manual steps as remediation is a regulated action. APs in a network will switch to the next best channel (disconnecting all clients on DFS channels) as per regulation requirements, which would be a non-DFS channel and then a DFS channel.

 


High VoIP Jitter

Trigger:

This alert is triggered when the variance in latency between tests for a specified WAN connection is higher than configured during the specified window to the specified VoIP server

Troubleshooting Steps:

Investigate as deemed necessary

 


Low VoIP MOS

Trigger:

This alert is triggered when the MOS score to a specified VoIP server for a specified WAN connection is lower for the specified window of time

Troubleshooting Steps:

Investigate as deemed necessary

 


High VoIP Packet Loss

Trigger:

This alert is triggered when a configured WAN uplink has packet loss that surpasses the configured loss rate over the configured period of time when testing to the configured VoIP server.

Troubleshooting Steps:

Investigate as deemed necessary

 


High WAN Latency

Trigger:

This alert is triggered when a configured WAN uplink has packet latency that surpasses the configured latency over the configured period of time.

Troubleshooting Steps:

Investigate as deemed necessary

 


High WAN Packet Loss

Trigger:

This alert is triggered when a configured WAN uplink has packet loss that surpasses the configured loss rate over the configured period of time.

Troubleshooting Steps:

Investigate as deemed necessary

 


High WAN Usage

Trigger:

This alert is triggered when network usage exceeds the specified data usage over the specified period of time

Troubleshooting Steps:

Investigate the end devices that participated in this higher utilization during the period if necessary

 


WAN Status Changed

Trigger:

This alert is triggered when the configured WAN connection has been unplugged or is not marked as a valid uplink.

Troubleshooting Steps:

If this is not expected, then investigate the WAN connection/s that alerted

 


Insight Web App Alert "Name of Application"

(Header can be set dynamically based on what app is experiencing outage)

Trigger:

When a custom monitored application triggers a threshold/event that is deemed necessary to alert

Troubleshooting Steps:

Begin troubleshooting within application environment