// 此模板仅供参考,如果不适用可以修改
问题描述
Component_drivers组件适配的SF400T网卡,最开始使用1.1.42版本的Component_drivers组件版本构建出包现象为:网卡温度无法获取,调速触发异常调速,出现温度获取失败告警。日志app.log中出现刷屏:
2026-05-20 13:53:21.003617 devmon ERROR: ncsi_over_mctp_wx.cpp(105): Error: failed to send NCSI get MAC address command [repeated 242 times in 301s from 2026-05-20 13:48:20.934307 to 2026-05-20 13:53:21.003617]
2026-05-20 13:53:21.003908 devmon ERROR: ncsi_over_mctp_wx.cpp(227): Error: failed to send NCSI get LLDP status command [repeated 242 times in 301s from 2026-05-20 13:48:20.934788 to 2026-05-20 13:53:21.003908]
2026-05-20 13:54:21.066996 devmon ERROR: ncsi_over_mctp_wx.cpp(72): Error: failed to send NCSI get temperature command [repeated 61 times in 305s from 2026-05-20 13 :49:16.005904 to 2026-05-20 13:54:21.066996]
2026-05-20 13:54:21.067378 devmon ERROR: ncsi_over_mctp_wx.cpp(131): Error: failed to send NCSI get serial number command [repeated 61 times in 305s from 2026-05-20 13:49:16.006238 to 2026-05-20 13:54:21.067378]
2026-05-20 13:58:26.061286 devmon ERROR: ncsi_over_mctp_wx.cpp(105): Error: failed to send NCSI get MAC address command [repeated 242 times in 305s from 2026-05-20 13:53:21.003617 to 2026-05-20 13:58:26.061286]
2026-05-20 13:58:26.061768 devmon ERROR: ncsi_over_mctp_wx.cpp(227): Error: failed to send NCSI get LLDP status command [repeated 242 times in 305s from 2026-05-20 13:53:21.003908 to 2026-05-20 13:58:26.061768]
2026-05-20 13:59:26.129548 devmon ERROR: ncsi_over_mctp_wx.cpp(72): Error: failed to send NCSI get temperature command [repeated 61 times in 305s from 2026-05-20 13 :54:21.066996 to 2026-05-20 13:59:26.129548]
2026-05-20 13:59:26.129957 devmon ERROR: ncsi_over_mctp_wx.cpp(131): Error: failed to send NCSI get serial number command [repeated 61 times in 305s from 2026-05-20 13:54:21.067378 to 2026-05-20 13:59:26.129957]
使用最新版本1.2.147的Component_drivers组件和对应的libmcpp,devmon组件,构建出包网卡温度仍然无法获取,此时TemperatureStatus=2,以至于没有触发异常调速和告警。此时查看smbios状态=3,cooling.cpp中仍然显示smbios not ready。
日志刷屏:
2026-05-21 17:21:17.665134 devmon ERROR: mctp.cpp(52): =================mctp::request create_endpoint failed, false [repeated 1000 times in 280s from 2026-05-21 17:16:37.629747 to 2026-05-21 17:21:17.665134]
2026-05-21 17:25:52.764908 devmon ERROR: mctp.cpp(52): =================mctp::request create_endpoint failed, false [repeated 1000 times in 275s from 2026-05-21 17:21:17.665134 to 2026-05-21 17:25:52.764908]
2026-05-21 17:30:32.777440 devmon ERROR: mctp.cpp(52): =================mctp::request create_endpoint failed, false [repeated 1000 times in 280s from 2026-05-21 17:25:52.764908 to 2026-05-21 17:30:32.777440]
2026-05-21 17:32:47.944528 devmon INFO: pcie_function.cpp(44): Error: Failed to update version id
2026-05-21 17:32:52.852998 devmon INFO: network_adapter.cpp(35): Error: Failed to update version id
2026-05-21 17:32:52.943570 devmon INFO: ncsi_over_mctp_wangxun.cpp(132): Error: failed to send NCSI get serial number command
2026-05-21 17:32:52.943750 devmon INFO: pcie_device.cpp(61): Error: Failed to update serial number
2026-05-21 17:32:52.944167 devmon INFO: ncsi_over_mctp_wangxun.cpp(311): Error: failed to send NCSI get_vpd_sn command
2026-05-21 17:32:52.944338 devmon INFO: pcie_device.cpp(78): Error: Failed to get vpd_sn
2026-05-21 17:32:52.944666 devmon INFO: ncsi_over_mctp_wangxun.cpp(343): Error: failed to send NCSI get_vpd_pn command
2026-05-21 17:32:52.944833 devmon INFO: pcie_device.cpp(93): Error: Failed to get_vpd_pn
2026-05-21 17:32:52.945356 devmon INFO: ncsi_over_mctp_wangxun.cpp(73): Error: failed to send NCSI get temperature command
2026-05-21 17:32:52.945508 devmon INFO: cooling.cpp(26): Error: Failed to update chip temp
2026-05-21 17:32:52.945657 devmon INFO: cooling.cpp(29): smbios not ready
2026-05-21 17:32:52.946137 devmon INFO: pcie_function.cpp(44): Error: Failed to update version id
问题1:SF400T的温度无法获取原因。(SF400T网卡部件我们设备提供信息不支持MCTP(over smbus),NCSI)
问题2:component_drives中如何设置环温调速呢
环境信息
-
操作系统:[如 Ubuntu 24.04]
-
软件版本:[如 OpenUBMC2509]
-
硬件配置:[如 CPU、内存等]
重现步骤
-
[第一步]
-
[第二步]
-
[第三步]
期望结果
[您希望实现什么]
实际结果
[实际发生了什么,包含错误信息]
尝试过的解决方案
[列出您已经尝试过的解决方案及结果]
分析结论
[详细总结您对此问题的分析结论]




