我现在正在配置PCIE
os下查看四元组如下,系统识别到Dou卡了
但是使用dmidecode从SMBIOS中获取PCIe信息,是获取不到的
一键收集来看也没有四元组的内容
这代表什么,是bios没有上报四元组信息吗
我现在正在配置PCIE
os下查看四元组如下,系统识别到Dou卡了
但是使用dmidecode从SMBIOS中获取PCIe信息,是获取不到的
一键收集来看也没有四元组的内容
这代表什么,是bios没有上报四元组信息吗
请提供下app.log
可以在一键收集的日志中解压 dump_info/OSDump/systemcom.tar 压缩包。观察systemcom.dat是否有厂商id,设备id和bdf信息返回给到bmc:
xkl@SRD-xiaokaili:~/log/dump_info/OSDump/systemcom$ grep "slotid" -in systemcom.dat
2977:SlotId 1 VID 9005,DID 28F
2979:SlotId 3 VID 19E5,DID D802
2981:SlotId 5 VID 19E5,DID D802
2983:SlotId 7 VID 19E5,DID D802
2985:SlotId 9 VID 19E5,DID D802
3081:SlotId 11 VID 19E5,DID D802
3083:SlotId 13 VID 19E5,DID D802
3085:SlotId 15 VID 19E5,DID D802
3087:SlotId 17 VID 19E5,DID D802
3091:[738L]CPU 0 SlotId 1 VID 9005,DID 28F Bus F: Dev 0.Func 0
3092:[738L]CPU 0 SlotId 3 VID 19E5,DID D802 Bus 12: Dev 0.Func 0
3093:[738L]CPU 0 SlotId 5 VID 19E5,DID D802 Bus 13: Dev 0.Func 0
3094:[738L]CPU 0 SlotId 7 VID 19E5,DID D802 Bus C: Dev 0.Func 0
3095:[738L]CPU 0 SlotId 9 VID 19E5,DID D802 Bus 9: Dev 0.Func 0
3096:[738L]CPU 1 SlotId 11 VID 19E5,DID D802 Bus 8E: Dev 0.Func 0
3097:[738L]CPU 1 SlotId 13 VID 19E5,DID D802 Bus 8F: Dev 0.Func 0
3098:[738L]CPU 1 SlotId 15 VID 19E5,DID D802 Bus 88: Dev 0.Func 0
3099:[738L]CPU 1 SlotId 17 VID 19E5,DID D802 Bus 85: Dev 0.Func 0
匹配到二进制文件 systemcom.dat
再到app.log搜索PMU关键词查看四元组信息:
(base) xkl@SRD-xiaokaili:~/log/S74AK3_4116056106ALC41082_20260121-1124/dump_info/LogDump$ grep "PMU" -ain app.log
2092:2026-01-20 20:15:59.926563 mctpd NOTICE: mctp_mdb_mgmt.lua(267): mctp_mdb_mgmt: pmu status change to 1, OS Power ON
4917:2026-01-21 09:20:10.656460 mctpd NOTICE: mctp_mdb_mgmt.lua(267): mctp_mdb_mgmt: pmu status change to 1, OS Power ON
6908:2026-01-21 10:09:44.010076 compute NOTICE: pmu_service.lua(143): check_uptree pmu system_id 1, slot 0
6912:2026-01-21 10:09:44.056159 compute NOTICE: pmu_service.lua(143): check_uptree pmu system_id 1, slot 0
7313:2026-01-21 10:09:54.214258 compute NOTICE: pmu_object.lua(364): [imu_collect] init_imu_collect_handle soc=Hi1630V100, soc_id=3, host_flag=true, sys_id=1
8675:2026-01-21 10:10:38.229182 mctpd NOTICE: mctp_mdb_mgmt.lua(267): mctp_mdb_mgmt: pmu status change to 1, OS Power ON
9922:2026-01-21 10:14:06.431555 pcie_device NOTICE: device_loader.lua(130): [BizTopoLoader] Get id from PMU, vid:5555, did:4119, sub_vid:6629, sub_did:53557
9930:2026-01-21 10:14:06.678520 pcie_device NOTICE: device_loader.lua(130): [BizTopoLoader] Get id from PMU, vid:6629, did:55298, sub_vid:6629, sub_did:16384
9942:2026-01-21 10:14:06.902388 pcie_device NOTICE: device_loader.lua(130): [BizTopoLoader] Get id from PMU, vid:6629, did:55298, sub_vid:6629, sub_did:16384
9950:2026-01-21 10:14:07.019603 pcie_device NOTICE: device_loader.lua(130): [BizTopoLoader] Get id from PMU, vid:6629, did:55298, sub_vid:6629, sub_did:16384
9962:2026-01-21 10:14:07.223479 pcie_device NOTICE: device_loader.lua(130): [BizTopoLoader] Get id from PMU, vid:6629, did:55298, sub_vid:6629, sub_did:16384
9970:2026-01-21 10:14:07.427510 pcie_device NOTICE: device_loader.lua(130): [BizTopoLoader] Get id from PMU, vid:6629, did:55298, sub_vid:6629, sub_did:16384
9978:2026-01-21 10:14:07.549494 pcie_device NOTICE: device_loader.lua(130): [BizTopoLoader] Get id from PMU, vid:6629, did:55298, sub_vid:6629, sub_did:16384
9988:2026-01-21 10:14:07.737983 pcie_device NOTICE: device_loader.lua(130): [BizTopoLoader] Get id from PMU, vid:6629, did:55298, sub_vid:6629, sub_did:16384
9998:2026-01-21 10:14:07.859360 pcie_device NOTICE: device_loader.lua(130): [BizTopoLoader] Get id from PMU, vid:6629, did:55298, sub_vid:6629, sub_did:16384
10007:2026-01-21 10:14:07.957271 pcie_device NOTICE: device_loader.lua(130): [BizTopoLoader] Get id from PMU, vid:32902, did:5490, sub_vid:32902, sub_did:0
12131:2026-01-21 10:39:11.186090 mctpd NOTICE: mctp_mdb_mgmt.lua(267): mctp_mdb_mgmt: pmu status change to 1, OS Power ON
这个dmidecode指令查到的一键收集是依赖bmc上报丝印的。进一步定位需要确定您这边的使用场景,具体是卡插在Riser卡上还是PCIe Switch,或者是什么别的情况,以及相关的丝印配置,可能的话麻烦提供一下一键收集日志。
这边CPIE全是CPU出的,收集日志目前还在审核请稍等
PCIeCard PCIeAddrInfo: segment=0, port_id=3, socket_id=1, slot=3
PCIeCard PCIeAddrInfo: segment=0, port_id=3, socket_id=1, slot=3
这两个重复了,看下一键收集的丝印文件
丝印文件具体是哪个文件
一键收集bios目录的silk文件
一键收集中没有单独的silk文件,只有silkconfig
如下:
{“CpuSilk”:[{“LogicalSocketId”:0,“DeviceLocator”:“”,“Silk”:“CPU1”,“PhysicalSocketId”:1},{“LogicalSocketId”:1,“DeviceLocator”:“”,“Silk”:“CPU2”,“PhysicalSocketId”:2}],“PCIeSilk”:[{“RootPortDeviceId”:3,“DeviceType”:“PCIe”,“Silk”:“”,“SocketId”:1,“Segment”:0,“SlotId”:3},{“RootPortDeviceId”:3,“DeviceType”:“PCIe”,“Silk”:“”,“SocketId”:1,“Segment”:0,“SlotId”:3},{“RootPortDeviceId”:1,“DeviceType”:“PCIe”,“Silk”:“”,“SocketId”:1,“Segment”:0,“SlotId”:1},{“RootPortDeviceId”:2,“DeviceType”:“PCIe”,“Silk”:“”,“SocketId”:1,“Segment”:0,“SlotId”:2}],“MemSilk”:[{“LogicalChannelId”:0,“DimmId”:0,“Silk”:“DIMM130”,“PhysicalChannelId”:3,“SocketId”:1},{“LogicalChannelId”:2,“DimmId”:1,“Silk”:“DIMM011”,“PhysicalChannelId”:1,“SocketId”:0},{“LogicalChannelId”:3,“DimmId”:0,“Silk”:“DIMM000”,“PhysicalChannelId”:0,“SocketId”:0},{“LogicalChannelId”:2,“DimmId”:0,“Silk”:“DIMM110”,“PhysicalChannelId”:1,“SocketId”:1},{“LogicalChannelId”:1,“DimmId”:0,“Silk”:“DIMM020”,“PhysicalChannelId”:2,“SocketId”:0},{“LogicalChannelId”:1,“DimmId”:1,“Silk”:“DIMM021”,“PhysicalChannelId”:2,“SocketId”:0},{“LogicalChannelId”:0,“DimmId”:0,“Silk”:“DIMM030”,“PhysicalChannelId”:3,“SocketId”:0},{“LogicalChannelId”:0,“DimmId”:1,“Silk”:“DIMM031”,“PhysicalChannelId”:3,“SocketId”:0},{“LogicalChannelId”:3,“DimmId”:0,“Silk”:“DIMM100”,“PhysicalChannelId”:0,“SocketId”:1},{“LogicalChannelId”:3,“DimmId”:1,“Silk”:“DIMM101”,“PhysicalChannelId”:0,“SocketId”:1},{“LogicalChannelId”:3,“DimmId”:1,“Silk”:“DIMM001”,“PhysicalChannelId”:0,“SocketId”:0},{“LogicalChannelId”:2,“DimmId”:1,“Silk”:“DIMM111”,“PhysicalChannelId”:1,“SocketId”:1},{“LogicalChannelId”:1,“DimmId”:0,“Silk”:“DIMM120”,“PhysicalChannelId”:2,“SocketId”:1},{“LogicalChannelId”:2,“DimmId”:0,“Silk”:“DIMM010”,“PhysicalChannelId”:1,“SocketId”:0},{“LogicalChannelId”:0,“DimmId”:1,“Silk”:“DIMM131”,“PhysicalChannelId”:3,“SocketId”:1},{“LogicalChannelId”:1,“DimmId”:1,“Silk”:“DIMM121”,“PhysicalChannelId”:2,“SocketId”:1}],“NICSilk”:,“DiskSilk”:[{“RootBDF”:“0000:06:00.0”,“PhyId”:1,“ControlId”:1,“SocketId”:0,“SlotId”:50},{“RootBDF”:“0000:38:05.0”,“PhyId”:0,“ControlId”:1,“SocketId”:0,“SlotId”:51}],“Properties”:{“Version”:1,“Type”:“BIOS SILK CFG”}}
提供一份完整的一键收集
方便提供一下邮箱吗,网页上传不了压缩包
邮件已发送,请查收,感谢!
看你的丝印文件中 ,RootPortDeviceId是不是配置的有一些问题?为什么“SlotId”:3配置了两次。从你的system.dat来看bios是没有返回bdf信息给bmc的。建议你和bios对齐一下丝印文件。
好的我这边先和bios沟通下!
槽位3出现两个一模一样的丝印,排查下csr配置(riser、psr、基础板)的pcie配置是否存在重复
SlotTableInit Status = Not Found
ReportPcieCardBDFInfoToBMC Status Success
SsdSlotTableInit Status = Not Found
SsdSlotTableInit Status = Not Found
SsdSlotTableInit Status = Not Found
ReportSSDCardBDFInfoToBMC Status Success
OcpSlotTableInit Status = Not Found
从这里看,带内没有上报卡的bdf到bmc。
应该是丝印信息不对,槽位3出现两个一模一样的丝印,排查下csr配置(riser、psr、基础板)的pcie配置是否存在重复。
丝印信息会影响到带内上报到BMC吗,因为我只是加载了官方的sr,EXU→BCU→IEU,并没有更改其中PCIE的配置
会的 从丝印信息可以看出来应该与实际不一样,已出现重复的两个配置