Monitoring Physical Disks

The S.M.A.R.T. status of physical disks is monitored by the smartctl tool installed along with Virtuozzo. The tool is run every 5 minutes. The smartctl tool polls all physical disks attached to Hardware Nodes in the cluster, including caching and journaling SSDs, and reports the results to the MDS server.

For the tool to work, enable the S.M.A.R.T. functionality in Node’s BIOS.

You can view disk poll results for the last 10 minutes in the output of the vstorage top command. For example:

image

If the SMART warning message is shown in the main table, one of the physical disks is in pre-failure condition according to S.M.A.R.T. Press d to switch to the disks table to see more details. For example:

image

The disks table shows the following parameters:

ParameterDescription
DISKDisk name assigned by operating system.
SMART

Disk's S.M.A.R.T. status:

  • OK: The disk is healthy.
  • Warn: The disk is in pre-failure condition.

Pre-failure condition means that at least one of these S.M.A.R.T. counters is nonzero:

  • Reallocated Sector Count
  • Reallocated Event Count
  • Current Pending Sector Count
  • Offline Uncorrectable
TEMPDisk temperature in Celsius.
CAPACITYDisk capacity.
SERIALDisk serial number.
MODELDisk model.
HOSTDisk's host address.

To disable S.M.A.R.T. disk monitoring, stop and disable the corresponding systemd units: vstorage-send-diskinfo.service and vstorage-send-diskinfo.timer.