Mellanox/mstflint

eth card doesn't work in unraid 6.9.2

stephenzhou30 opened this issue · 3 comments

issues

uname -a
Linux Tower 5.10.28-Unraid #1 SMP Wed Apr 7 08:23:18 PDT 2021 x86_64 AMD Opteron(tm) X3421 APU AuthenticAMD GNU/Linux

cat /etc/issue
unRAID Server OS version: 6.9.2
IPv4 address: 10.100.10.3
IPv6 address: not set

cat /proc/bus/pci/devices
0000 10221576 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0002 10221577 19 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0008 10029874 ff e000000c 0 f000000c 0 f001 feb00000 c0002 10000000 0 800000 0 100 40000 20000
0009 10029840 ff feb64004 0 0 0 0 0 0 4000 0 0 0 0 0 0
0010 1022157b 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0012 1022157c 1b 0 0 0 0 0 0 0 0 0 0 0 0 0 0 pcieport
0014 1022157c 1c 0 0 0 0 0 0 0 0 0 0 0 0 0 0 pcieport
0015 1022157c 1d 0 0 0 0 0 0 0 0 0 0 0 0 0 0 pcieport
0018 1022157b 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0019 1022157c 1e 0 0 0 0 0 0 0 0 0 0 0 0 0 0 pcieport
0040 10221578 ff f090000c 0 fe500000 feb6f000 0 feb6a000 0 20000 0 100000 1000 0 2000 0
0048 1022157d 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0080 10227914 12 feb68004 0 0 0 0 0 0 2000 0 0 0 0 0 0 xhci_hcd
0088 10227901 13 f141 f131 f121 f111 f101 feb6d000 0 8 4 8 4 10 400 0 ahci
0090 10227908 12 feb6c000 0 0 0 0 0 0 100 0 0 0 0 0 0 ehci-pci
00a0 1022790b 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 piix4_smbus
00a3 1022790e 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
00c0 10221570 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
00c1 10221571 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
00c2 10221572 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
00c3 10221573 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 k10temp
00c4 10221574 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 fam15h_power
00c5 10221575 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0100 1b4b9230 25 e051 e041 e031 e021 e001 fea40000 fea00000 8 4 8 4 20 800 40000 ahci
0200 c0a92263 28 fe900004 0 0 0 0 0 0 4000 0 0 0 0 0 0 nvme
0300 14e4165f 26 f085000c 0 f084000c 0 f083000c 0 fe840000 10000 0 10000 0 10000 0 40000 tg3
0301 14e4165f 27 f082000c 0 f081000c 0 f080000c 0 fe800000 10000 0 10000 0 10000 0 40000 tg3
0400 15b31015 2e 42000000c 0 0 0 0 0 fe700000 2000000 0 0 0 0 0 100000
0401 15b31015 24 42200000c 0 0 0 0 0 fe600000 2000000 0 0 0 0 0 100000

mstflint -vv
Invalid command: -vv
NAME
MstFlint - Flash Interface

SYNOPSIS
    MstFlint [OPTIONS] <command> [Parameters]
        [-d|--device <device>] [-i|--image <image>] [--latest_fw] [--ir] [-h|--help] [--hh]
        [-y|--yes] [--no] [--guid <GUID>] [--guids <GUIDS...>] [--mac <MAC>]
        [--macs <MACs...>] [--uid <UID>] [--blank_guids] [--clear_semaphore] [--qq]
        [--low_cpu] [--flashed_version] [--nofs] [--allow_rom_change]
        [--override_cache_replacement] [--no_flash_verify] [--use_fw] [-s|--silent]
        [--vsd <string>] [--use_image_ps] [--use_image_guids] [--use_image_rom]
        [--use_dev_rom] [--ignore_dev_data] [--no_fw_ctrl] [--dual_image] [--striped_image]
        [--banks <bank>] [--log <log_file>]
        [--flash_params <type, log2size, num_of_flashes>] [-v|--version]
        [--private_key <key_file>] [--public_key <key_file>] [--key_uuid <uuid_file>]
        [--private_key2 <key_file>] [--hmac_key <hmac_key>] [--key_uuid2 <uuid_file>]
        [--psid <PSID>] [--cc <Congestion_Control>] [--linkx]
        [--downstream_device_id_start_index <downstream_device_id_start_index>]
        [--num_of_downstream_devices <num_of_downstream_devices>] [--linkx_auto_update]
        [--activate] [--download_transfer] [--downstream_device_ids <list of ports>]
        [--public_key_label <string>] [--private_key_label <string>]
        [--output_file <string>] [--user_password <string>]

lspci
00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Root Complex
00:00.2 IOMMU: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) I/O Memory Management Unit
00:01.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Wani [Radeon R5/R6/R7 Graphics] (rev 84)
00:01.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Kabini HDMI/DP Audio
00:02.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Host Bridge
00:02.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Root Port
00:02.4 PCI bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Root Port
00:02.5 PCI bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Root Port
00:03.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Host Bridge
00:03.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Root Port
00:08.0 Encryption controller: Advanced Micro Devices, Inc. [AMD] Carrizo Platform Security Processor
00:09.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Carrizo Audio Dummy Host Bridge
00:10.0 USB controller: Advanced Micro Devices, Inc. [AMD] FCH USB XHCI Controller (rev 20)
00:11.0 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] (rev 49)
00:12.0 USB controller: Advanced Micro Devices, Inc. [AMD] FCH USB EHCI Controller (rev 49)
00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller (rev 4a)
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge (rev 11)
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Function 0
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Function 1
00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Function 2
00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Function 3
00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Function 4
00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Function 5
01:00.0 SATA controller: Marvell Technology Group Ltd. 88SE9230 PCIe 2.0 x2 4-port SATA 6 Gb/s RAID Controller (rev 11)
02:00.0 Non-Volatile memory controller: Micron/Crucial Technology P1 NVMe PCIe SSD (rev 03)
03:00.0 Ethernet controller: Broadcom Inc. and subsidiaries NetXtreme BCM5720 Gigabit Ethernet PCIe
03:00.1 Ethernet controller: Broadcom Inc. and subsidiaries NetXtreme BCM5720 Gigabit Ethernet PCIe
04:00.0 Ethernet controller: Mellanox Technologies MT27710 Family [ConnectX-4 Lx]
04:00.1 Ethernet controller: Mellanox Technologies MT27710 Family [ConnectX-4 Lx]

mstflint -d 04:00.0 v

FS3 failsafe image

     /0x00000038-0x00001d97 (0x001d60)/ (BOOT2) - OK
     /0x00002000-0x0000201f (0x000020)/ (ITOC_HEADER) - OK
     /0x00004000-0x00013dd3 (0x00fdd4)/ (IRON_PREP_CODE) - OK
     /0x00014000-0x000140ff (0x000100)/ (FS3_RESET_INFO) - OK
     /0x00015000-0x00015bff (0x000c00)/ (FW_MAIN_CFG) - OK
     /0x00016000-0x0001647f (0x000480)/ (FW_BOOT_CFG) - OK
     /0x00017000-0x000185ff (0x001600)/ (HW_MAIN_CFG) - OK
     /0x00019000-0x0001913f (0x000140)/ (HW_BOOT_CFG) - OK
     /0x0001a000-0x0001cc7f (0x002c80)/ (PHY_UC_CONSTS) - OK
     /0x0001d000-0x0001d13f (0x000140)/ (IMAGE_SIGNATURE_256) - OK
     /0x0001e000-0x0001e8ff (0x000900)/ (PUBLIC_KEYS_2048) - OK
     /0x0001f000-0x0001f08f (0x000090)/ (FORBIDDEN_VERSIONS) - OK
     /0x00020000-0x0002023f (0x000240)/ (IMAGE_SIGNATURE_512) - OK
     /0x00021000-0x000220ff (0x001100)/ (PUBLIC_KEYS_4096) - OK
     /0x00023000-0x00072fff (0x050000)/ (FS4_PART_TYPE_PROGRAMMABLE_HW_FW) - OK
     /0x00073000-0x00123827 (0x0b0828)/ (ROM_CODE) - OK
     /0x00124000-0x00133fff (0x010000)/ (CRDUMP_MASK_DATA) - OK
     /0x00134000-0x001349ff (0x000a00)/ (PHY_UC_CMD) - OK
     /0x00135000-0x0013e91f (0x009920)/ (PHY_UC_CODE) - OK
     /0x0013f000-0x0016a4e7 (0x02b4e8)/ (PCI_CODE) - OK
     /0x0016b000-0x004ab11f (0x340120)/ (MAIN_CODE) - OK
     /0x004ac000-0x004b9c5f (0x00dc60)/ (PCIE_LINK_CODE) - OK
     /0x004ba000-0x004bae3f (0x000e40)/ (POST_IRON_BOOT_CODE) - OK
     /0x004bb000-0x004bcbfb (0x001bfc)/ (UPGRADE_CODE) - OK
     /0x004bd000-0x004bd3ff (0x000400)/ (IMAGE_INFO) - OK
     /0x004bd400-0x004bdb6f (0x000770)/ (DBG_FW_INI) - OK
     /0x004bdb70-0x004bdb77 (0x000008)/ (DBG_FW_PARAMS) - OK
     /0x00fa0000-0x00faffff (0x010000)/ (NV_DATA) - CRC IGNORED
     /0x00fb0000-0x00fbffff (0x010000)/ (NV_DATA) - CRC IGNORED
     /0x00fc0000-0x00fcffff (0x010000)/ (FW_NV_LOG) - CRC IGNORED
     /0x00fee000-0x00fee1ff (0x000200)/ (DEV_INFO) - OK
     /0x00ff8000-0x00ff813f (0x000140)/ (MFG_INFO) - OK
     /0x00ff8140-0x00ff81b7 (0x000078)/ (VPD_R0) - OK

-I- FW image verification succeeded. Image is bootable.

mstflint -d 04:00.0 q
Image type: FS3
FW Version: 14.31.1014
FW Release Date: 30.6.2021
Product Version: 14.31.1014
Rom Info: type=UEFI version=14.24.13 cpu=AMD64,AARCH64
type=PXE version=3.6.403 cpu=AMD64
Description: UID GuidsNumber
Base GUID: b8599f0300e3a70e 4
Base MAC: b8599fe3a70e 4
Image VSD: N/A
Device VSD: N/A
PSID: MT_2420110004
Security Attributes: N/A

mstvpd 04:00.0
ID: CX4121A - ConnectX-4 LX SFP28
PN: MCX4121A-XCAT
EC: AG
SN: MT1940J06497
V0: PCIeGen3 x8

lspci -v -s 04:00.1 | more
04:00.1 Ethernet controller: Mellanox Technologies MT27710 Family [ConnectX-4 Lx]
Subsystem: Mellanox Technologies ConnectX-4 Lx Stand-up dual-port 10GbE MCX4121A-XCAT
Flags: fast devsel, IRQ 36, IOMMU group 2
Memory at 422000000 (64-bit, prefetchable) [size=32M]
Expansion ROM at fe600000 [disabled] [size=1M]
Capabilities: [60] Express Endpoint, MSI 00
Capabilities: [48] Vital Product Data
Capabilities: [9c] MSI-X: Enable- Count=64 Masked-
Capabilities: [c0] Vendor Specific Information: Len=18 <?>
Capabilities: [40] Power Management version 3
Capabilities: [100] Advanced Error Reporting
Capabilities: [150] Alternative Routing-ID Interpretation (ARI)
Capabilities: [180] Single Root I/O Virtualization (SR-IOV)
Capabilities: [230] Access Control Services
Kernel modules: mlx5_core

dmesg | more | grep 04:00
[ 0.681114] pci 0000:04:00.0: [15b3:1015] type 00 class 0x020000
[ 0.681262] pci 0000:04:00.0: reg 0x10: [mem 0xffe000000-0xfffffffff 64bit pref]
[ 0.681547] pci 0000:04:00.0: reg 0x30: [mem 0xfe700000-0xfe7fffff pref]
[ 0.682236] pci 0000:04:00.0: PME# supported from D3cold
[ 0.682503] pci 0000:04:00.0: reg 0x1a4: [mem 0x00000000-0x000fffff 64bit pref]
[ 0.682511] pci 0000:04:00.0: VF(n) BAR0 space: [mem 0x00000000-0x00ffffff 64bit pref] (contains BAR0 for 16 VFs)
[ 0.684013] pci 0000:04:00.1: [15b3:1015] type 00 class 0x020000
[ 0.684160] pci 0000:04:00.1: reg 0x10: [mem 0xffc000000-0xffdffffff 64bit pref]
[ 0.684455] pci 0000:04:00.1: reg 0x30: [mem 0xfe600000-0xfe6fffff pref]
[ 0.685097] pci 0000:04:00.1: PME# supported from D3cold
[ 0.685355] pci 0000:04:00.1: reg 0x1a4: [mem 0x00000000-0x000fffff 64bit pref]
[ 0.685363] pci 0000:04:00.1: VF(n) BAR0 space: [mem 0x00000000-0x00ffffff 64bit pref] (contains BAR0 for 16 VFs)
[ 0.710024] pci 0000:04:00.0: BAR 7: no space for [mem size 0x01000000 64bit pref]
[ 0.710033] pci 0000:04:00.0: BAR 7: failed to assign [mem size 0x01000000 64bit pref]
[ 0.710046] pci 0000:04:00.1: BAR 7: no space for [mem size 0x01000000 64bit pref]
[ 0.710054] pci 0000:04:00.1: BAR 7: failed to assign [mem size 0x01000000 64bit pref]
[ 0.710257] pci 0000:04:00.0: BAR 0: assigned [mem 0x420000000-0x421ffffff 64bit pref]
[ 0.710323] pci 0000:04:00.1: BAR 0: assigned [mem 0x422000000-0x423ffffff 64bit pref]
[ 0.710389] pci 0000:04:00.0: BAR 7: assigned [mem 0x424000000-0x424ffffff 64bit pref]
[ 0.710433] pci 0000:04:00.1: BAR 7: assigned [mem 0x425000000-0x425ffffff 64bit pref]
[ 0.711841] pci 0000:04:00.0: ConnectX-4: FW 65535.65535.65535 doesn't support INTx masking, disabling. Please upgrade FW to 14.14.1100 and up for INTx support
[ 0.712020] pci 0000:04:00.1: ConnectX-4: FW 65535.65535.65535 doesn't support INTx masking, disabling. Please upgrade FW to 14.14.1100 and up for INTx support
[ 36.199823] pci 0000:04:00.0: Adding to iommu group 2
[ 36.199837] pci 0000:04:00.1: Adding to iommu group 2
[ 43.172556] mlx5_core 0000:04:00.0: firmware version: 65535.65535.65535
[ 43.172594] mlx5_core 0000:04:00.0: 63.008 Gb/s available PCIe bandwidth (8.0 GT/s PCIe x8 link)
[ 63.175444] mlx5_core 0000:04:00.0: wait_fw_init:202:(pid 1256): Waiting for FW initialization, timeout abort in 100s
[ 83.179443] mlx5_core 0000:04:00.0: wait_fw_init:202:(pid 1256): Waiting for FW initialization, timeout abort in 79s
[ 103.183443] mlx5_core 0000:04:00.0: wait_fw_init:202:(pid 1256): Waiting for FW initialization, timeout abort in 59s
[ 123.187441] mlx5_core 0000:04:00.0: wait_fw_init:202:(pid 1256): Waiting for FW initialization, timeout abort in 39s
[ 143.191442] mlx5_core 0000:04:00.0: wait_fw_init:202:(pid 1256): Waiting for FW initialization, timeout abort in 19s
[ 163.175439] mlx5_core 0000:04:00.0: mlx5_function_setup:961:(pid 1256): Firmware over 120000 MS in pre-initializing state, aborting
[ 163.175459] mlx5_core 0000:04:00.0: init_one:1371:(pid 1256): mlx5_load_one failed with error code -16
[ 163.175907] mlx5_core: probe of 0000:04:00.0 failed with error -16
[ 163.176400] mlx5_core 0000:04:00.1: firmware version: 65535.65535.65535
[ 163.176446] mlx5_core 0000:04:00.1: 63.008 Gb/s available PCIe bandwidth (8.0 GT/s PCIe x8 link)
[ 183.180440] mlx5_core 0000:04:00.1: wait_fw_init:202:(pid 1256): Waiting for FW initialization, timeout abort in 100s
[ 203.183441] mlx5_core 0000:04:00.1: wait_fw_init:202:(pid 1256): Waiting for FW initialization, timeout abort in 79s
[ 223.186422] mlx5_core 0000:04:00.1: wait_fw_init:202:(pid 1256): Waiting for FW initialization, timeout abort in 59s
[ 243.189360] mlx5_core 0000:04:00.1: wait_fw_init:202:(pid 1256): Waiting for FW initialization, timeout abort in 39s
[ 263.193456] mlx5_core 0000:04:00.1: wait_fw_init:202:(pid 1256): Waiting for FW initialization, timeout abort in 19s
[ 283.178454] mlx5_core 0000:04:00.1: mlx5_function_setup:961:(pid 1256): Firmware over 120000 MS in pre-initializing state, aborting
[ 283.178461] mlx5_core 0000:04:00.1: init_one:1371:(pid 1256): mlx5_load_one failed with error code -16
[ 283.178775] mlx5_core: probe of 0000:04:00.1 failed with error -16

Please just tell me what to do or just waiting for official update,THX !