librenms/librenms-agent

SNMP/smart issue

sippe2 opened this issue · 0 comments

I have nvme drive "/dev/nvme0n1" and smart script outputs just null an 0 values. Smartctl output works fine. Is there some kind of idea how to fix the problem ealisy? And yes... The device name is pretty strange...

smart configuration
root@pve3:/etc/snmp# cat smart.config
useSN=0
smartctl=/sbin/smartctl
cache=/var/cache/smart
sda /dev/sda
nvme /dev/nvme0n1

Some output data:
root@pve3:/etc/snmp# ./smart
sda,0,null,518,null,0,0,0,0,46,null,0,null,0,0,null,null,0,0,0,0,0,0,0,0,8179
nvm,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,null,0,0,0,0,0,0,0,0,null

root@pve3:/etc/snmp# smartctl -a /dev/nvme0n1
smartctl 7.1 2019-12-30 r5022 [x86_64-linux-5.3.18-3-pve] (local build)
Copyright (C) 2002-19, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Number: SAMSUNG MZVLW256HEHP-000L7
Serial Number: S35ENX0K728166
Firmware Version: 5L7QCXB7
PCI Vendor/Subsystem ID: 0x144d
IEEE OUI Identifier: 0x002538
Total NVM Capacity: 256,060,514,304 [256 GB]
Unallocated NVM Capacity: 0
Controller ID: 2
Number of Namespaces: 1
Namespace 1 Size/Capacity: 256,060,514,304 [256 GB]
Namespace 1 Utilization: 71,985,352,704 [71.9 GB]
Namespace 1 Formatted LBA Size: 512
Namespace 1 IEEE EUI-64: 002538 b781b233eb
Local Time is: Tue Apr 7 03:53:48 2020 EEST
Firmware Updates (0x16): 3 Slots, no Reset required
Optional Admin Commands (0x0017): Security Format Frmw_DL Self_Test
Optional NVM Commands (0x001f): Comp Wr_Unc DS_Mngmt Wr_Zero Sav/Sel_Feat
Warning Comp. Temp. Threshold: 69 Celsius
Critical Comp. Temp. Threshold: 72 Celsius

Supported Power States
St Op Max Active Idle RL RT WL WT Ent_Lat Ex_Lat
0 + 7.60W - - 0 0 0 0 0 0
1 + 6.00W - - 1 1 1 1 0 0
2 + 5.10W - - 2 2 2 2 0 0
3 - 0.0400W - - 3 3 3 3 210 1500
4 - 0.0050W - - 4 4 4 4 2200 6000

Supported LBA Sizes (NSID 0x1)
Id Fmt Data Metadt Rel_Perf
0 + 512 0 0

=== START OF SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

SMART/Health Information (NVMe Log 0x02)
Critical Warning: 0x00
Temperature: 52 Celsius
Available Spare: 100%
Available Spare Threshold: 10%
Percentage Used: 1%
Data Units Read: 91,621 [46.9 GB]
Data Units Written: 10,167,006 [5.20 TB]
Host Read Commands: 1,287,383
Host Write Commands: 118,108,141
Controller Busy Time: 1,039
Power Cycles: 48
Power On Hours: 1,656
Unsafe Shutdowns: 33
Media and Data Integrity Errors: 0
Error Information Log Entries: 37
Warning Comp. Temperature Time: 0
Critical Comp. Temperature Time: 0
Temperature Sensor 1: 52 Celsius
Temperature Sensor 2: 60 Celsius

Error Information (NVMe Log 0x01, max 64 entries)
Num ErrCount SQId CmdId Status PELoc LBA NSID VS
0 37 0 0x0016 0x4004 0x02c 0 0 -
1 36 0 0x0015 0x4004 0x02c 0 0 -
...