关于通过ipmi命令查询硬盘故障灯的问题

当前在调用ipmi命令点亮nvme盘故障灯时,出现了物理场景实际点亮,但是通过GetPhysicalDriveLedStatus的ipmi命令查询时,发现查询到的故障灯状态为0(熄灭)。
反复尝试后发现通过ipmi命令和web方式点亮定位灯后,都可以正常查询定位灯,且状态为1,但是点亮故障灯后,查询结果始终为0。目前推测是sr中故障灯寄存器地址配置可能存在问题,导致无法获取正确的故障灯状态,想要求助确认寄存器地址以及下一步的定位手段。
当前环境硬盘背板vendor/openUBMC/14100665_00000001030302023938.sr

"Drive_4": {
      "Id": 3,
      "Name": "Disk3",
      "PhysicalLocation": "HDD Plane",
      "NodeId": "HDDPlaneDisk3",
      "Presence": "<=/Scanner_Drive3PresentAccessor.Status;<=/Scanner_Drive3PresentAccessor.Value |> expr($1 != 0 ? 255 : $2)",
      "LocateLed": "<=/Scanner_Drive3LocateAccessor.Value",
      "FaultLed": "<=/Scanner_Drive3FalutAccessor.Value",
      "ActivationLed": "<=/Scanner_Drive3ActivationAccessor.Value",
      "SetLocateLed": "#/Accessor_Drive3LocateAccessor.Value",
      "SetFaultLed": "#/Accessor_Drive3FaultAccessor.Value",
      "TemperatureCelsius": 255,
      "Missing": 0,
      "Health": "<=/Component_Drive3.Health",
      "RebuildState": 0,
      "FirmwareStatus": 255,
      "PredictiveFailure": 0,
      "InAFailedArray": 0,
      "FirmwareStatusError": false,
      "SerialNumber": "",
      "RelativeSlot": 3,
      "@Default": {
        "PredictedMediaLifeLeftPercent": 255
      },
      "PassThrough": "<=/Scanner_Drive4PCIePresence.Value;<=/Scanner_Drive4PCIeType.Value|> expr(($1 == 1) && ($2 == 1))",
      "HddBackplaneStartSlot": "<=/HddBackplane_1.StartSlot",
      "Type": "SATA/SAS",
      "IODeteriorationHealthCode": 0
    },
    "DiscreteSensor_Disk3": {
      "AssertMask": 199,
      "DeassertMask": 199,
      "DiscreteMask": 199
    },
...
    "Scanner_Drive4LocateAccessor": {
      "Chip": "#/Smc_EnclSMC",
      "Offset": 335545605,
      "Size": 2,
      "Mask": 3072,
      "Type": 0,
      "Period": 2000,
      "Debounce": "None",
      "Value": 0
    },
    "Scanner_Drive4FalutAccessor": {
      "Chip": "#/Smc_EnclSMC",
      "Offset": 335545605,
      "Size": 2,
      "Mask": 768,
      "Type": 0,
      "Period": 2000,
      "Debounce": "None",
      "Value": 0
    },

问题排除,当前定位原因为硬盘背板的CPLD存在问题,解决方式为升级新的硬盘背板的固件版本。
升级版本并且生效后可以通过ipmi正常查询故障灯状态。

1 个赞