This is a brief post to share the CLI commands and sample output for a quick VPLEX health check. Our VPLEX had a dial home event and below are the commands that EMC ran to verify that it was healthy. Here is the dial home event that was generated:
SymptomCode: 0x8a266032
SymptomCode: 0x8a34601a
Category: Status
Severity: Error
Status: Failed
Component: CLUSTER
ComponentID: director-1-1-A
SubComponent: stdf
CallHome: Yes
FirstTime: 2014-11-14T11:20:11.008Z
LastTime: 2014-11-14T11:20:11.008Z
CDATA: Compare and Write cache transaction submit failed, status 1 [Versions:MS{D30.60.0.3.0, D30.0.0.112, D30.60.0.3}, Director{6.1.202.1.0}, ClusterWitnessServer{unknown}] RCA: The attempt to start a cache transaction for a Scsi Compare and Write command failed. Remedy: Contact EMC Customer Support.
Description: The processing of a Scsi Com pare and Write command could not complete.
ClusterID: cluster-1
Based on that error the commands below were run to make sure the cluster was healthy.
This is the general health check command:
VPlexcli:/> health-check Product Version: 5.3.0.00.00.10 Product Type: Local Hardware Type: VS2 Cluster Size: 2 engines Cluster TLA: cluster-1: FNM00141800023 Clusters: --------- Cluster Cluster Oper Health Connected Expelled Local-com Name ID State State --------- ------- ----- ------ --------- -------- --------- cluster-1 1 ok ok True False ok Meta Data: ---------- Cluster Volume Volume Oper Health Active Name Name Type State State --------- ------------------------------- ----------- ----- ------ ------ cluster-1 c1_meta_backup_2014Nov21_100107 meta-volume ok ok False cluster-1 c1_meta_backup_2014Nov20_100107 meta-volume ok ok False cluster-1 c1_meta meta-volume ok ok True Director Firmware Uptime: ------------------------- Director Firmware Uptime -------------- ------------------------------------------ director-1-1-A 147 days, 16 hours, 15 minutes, 29 seconds director-1-1-B 147 days, 15 hours, 58 minutes, 3 seconds director-1-2-A 147 days, 15 hours, 52 minutes, 15 seconds director-1-2-B 147 days, 15 hours, 53 minutes, 37 seconds Director OS Uptime: ------------------- Director OS Uptime -------------- --------------------------- director-1-1-A 12:49pm up 147 days 16:09 director-1-1-B 12:49pm up 147 days 16:09 director-1-2-A 12:49pm up 147 days 16:09 director-1-2-B 12:49pm up 147 days 16:09 Inter-director Management Connectivity: --------------------------------------- Director Checking Connectivity Enabled -------------- -------- ------------ director-1-1-A Yes Healthy director-1-1-B Yes Healthy director-1-2-A Yes Healthy director-1-2-B Yes Healthy Front End: ---------- Cluster Total Unhealthy Total Total Total Total Name Storage Storage Registered Ports Exported ITLs Views Views Initiators Volumes --------- ------- --------- ---------- ----- -------- ----- cluster-1 56 0 299 16 353 9802 Storage: -------- Cluster Total Unhealthy Total Unhealthy Total Unhealthy No Not visible With Name Storage Storage Virtual Virtual Dist Dist Dual from Unsupported Volumes Volumes Volumes Volumes Devs Devs Paths All Dirs # of Paths --------- ------- --------- ------- --------- ----- --------- ----- ----------- ----------- cluster-1 203 0 199 0 0 0 0 0 0 Consistency Groups: ------------------- Cluster Total Unhealthy Total Unhealthy Name Synchronous Synchronous Asynchronous Asynchronous Groups Groups Groups Groups --------- ----------- ----------- ------------ ------------ cluster-1 0 0 0 0 Cluster Witness: ---------------- Cluster Witness is not configured
This command checks the status of the cluster:
VPlexcli:/> cluster status Cluster cluster-1 operational-status: ok transitioning-indications: transitioning-progress: health-state: ok health-indications: local-com: ok
This command checks the state of the storage volumes:
VPlexcli:/> storage-volume summary Storage-Volume Summary (no tier) ---------------------- -------------------- Health out-of-date 0 storage-volumes 203 unhealthy 0 Vendor DGC 203 Use meta-data 4 used 199 Capacity total 310T