Nutanix alert not cleared in Prism
When I visit a Nutanix customer, I always check the cluster health and if there are no alerts.
One customer had a error that wasn’t acknowledged for more than 45 day’s. And that alert repeated itself every hour resulting in more than 500 critical alerts.
After resolving the issue (rebooting the CVM) I wanted to clear the critical alerts in Prism. The procedure for this is quite simple. Just go to Alerts | mark all Alerts| click Acknowledge and Resolve.
I this case I noticed that the Alerts where not cleared. Although the where Acknowledged and Resolved. Repeating the action did not solved the problem.
The solution was to resolve the Alert through NCLI with the following script:
for alert in `ncli alert ls | grep ID | awk ‘{print $3}’`; do echo “Resolving alert $alert”; ncli alert resolve ids=$alert; sleep 2; done
You may have to run the script multiple times until all Alerts are cleared.
About Michael
Michael Wilmsen is a experienced VMware Architect with more than 20 years in the IT industry. Main focus is VMware vSphere, Horizon View and Hyper Converged with a deep interest into performance and architecture.
Michael is VCDX 210 certified, has been rewarded with the vExpert title from 2011, Nutanix Tech Champion and a Nutanix Platform Professional.