[Bug 1931886] Re: show-regs can cause some samsung controllers to go offline
Bug Watch Updater
1931886 at bugs.launchpad.net
Sat Jul 3 19:29:03 UTC 2021
** Changed in: nvme-cli (Debian)
Status: Confirmed => Fix Released
--
You received this bug notification because you are a member of Ubuntu
Foundations Bugs, which is subscribed to nvme-cli in Ubuntu.
https://bugs.launchpad.net/bugs/1931886
Title:
show-regs can cause some samsung controllers to go offline
Status in nvme-cli package in Ubuntu:
In Progress
Status in nvme-cli source package in Groovy:
New
Status in nvme-cli source package in Hirsute:
New
Status in nvme-cli source package in Impish:
In Progress
Status in nvme-cli package in Debian:
Fix Released
Bug description:
[Impact]
nvme show-regs has been found to cause certain Samsung controllers
(MZ1L21T9HCLS in particular) to go offline.
[Test Case]
Run `nvme show-regs` on an effected controller device. Messages similar to this will appear in dmesg:
[963314.311332] nvme nvme2: controller is down; will reset: CSTS=0x3, PCI_STATUS=0x10
[963334.951328] nvme nvme2: Device not ready; aborting reset
[963334.963114] nvme nvme2: Removing after probe failure status: -19
[963334.999600] blk_update_request: I/O error, dev nvme2n1, sector 1050640 op 0x1:(WRITE) flags 0x800 phys_seg 1 prio class 0
[963335.023410] md: super_written gets error=10
[963335.033842] md/raid1:md0: Disk failure on nvme2n1p2, disabling device.
md/raid1:md0: Operation continuing on 1 devices.
[ +0.009599] XFS (md127): log I/O error -5
[ +0.015136] XFS (md127): xfs_do_force_shutdown(0x2) called from line 1250 of file fs/xfs/xfs_log.c. Return address = 00000000d0ea8129
[ +0.000001] XFS (md127): Log I/O Error Detected. Shutting down filesystem
[ +0.009290] XFS (md127): Please unmount the filesystem and rectify the problem(s)
[Fix]
This has been fixed upstream with the following commits:
https://github.com/linux-nvme/nvme-cli/commit/33e60ff64a043b189d2661543b417b21b6f3667b
https://github.com/linux-nvme/nvme-cli/commit/d43d545a68cc6cea5ac78fda4edeedf3b5198847
[What Could Go Wrong]
Because the register prmsc is now split into prmscl/prmscu as the specification requires, the displayed registers will be different in showregs output. This might surprise any code that is trying to parse this output. Also upstream made a formatting change here that adds additional whitespace to a field when running w/ -H (human-readable mode):
This:
Controller Base Address (CBA) : 0
Became:
Controller Base Address (CBA): 0
It is human-readable mode which at least I interpret as "not for
scripting", but it's possible that there is a user expecting that
specific format. We could carry an additional patch to restore this
whitespace if the SRU team is so inclined.
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/nvme-cli/+bug/1931886/+subscriptions
More information about the foundations-bugs
mailing list