Apache Ambari Issues
Troubleshoot Apache Ambari issues for Big Data Service clusters.
Ambari Showing Already Removed DataNode as Dead in HDFS Summary and Host Level Health Alarm Opened, Causing LCM Operations to Fail
Troubleshoot a removed DataNode causing LCM operations failure in Ambari for a Big Data Service cluster.
After decommissioning and deleting a DataNode
on a host manually, Ambari shows a critical alert with the message, DataNode Health Summary [live='x' stale='0' dead='1']
. Because of the host level alarm, it's considered as unhealthy and causes LCM operations failure.
NameNode
doesn't clean up the dead DataNode
by itself. Cleanup the Dead node manually: