[Bug 1877617] [NEW] Automatic scans cause instability for cloud use cases

Ben Swartzlander ben at swartzlander.org
Fri May 8 15:55:57 UTC 2020


Public bug reported:

When using iSCSI storage underneath cloud applications such as OpenStack
or Kubernetes, the automatic bus scan on login causes problems, because
it results in SCSI disks being registered in the kernel that will never
get cleaned up, and when those disks are eventually deleted off the
server, I/O errors begin to accumulate, eventually slowing down the
whole SCSI subsystem, spamming the kernel log, and causing timeouts at
higher levels such that users are forced to reboot the node to get back
to a usable state.

RedHat discovered this problem more than 3 years ago and fixed it
upstream.

https://bugzilla.redhat.com/show_bug.cgi?id=1422941

I had hoped that Debian would eventually pick up the version in which it
was fixed, but another LTS has gone by without picking up the newer
upstream version, and this is a critical problem, so I propose
backporting the fixes.

The 2 patches that need porting are:
https://github.com/open-iscsi/open-iscsi/pull/40
https://github.com/open-iscsi/open-iscsi/pull/49

These changes have been proven safe by 3 years of soak time in the
RedHat ecosystem, so I don't see much risk to taking them into Ubuntu.
They apply cleanly to the most recent versions of focal, bionic, and
xenial.

** Affects: open-iscsi (Ubuntu)
     Importance: Undecided
         Status: New

-- 
You received this bug notification because you are a member of Ubuntu
Foundations Bugs, which is subscribed to open-iscsi in Ubuntu.
https://bugs.launchpad.net/bugs/1877617

Title:
  Automatic scans cause instability for cloud use cases

Status in open-iscsi package in Ubuntu:
  New

Bug description:
  When using iSCSI storage underneath cloud applications such as
  OpenStack or Kubernetes, the automatic bus scan on login causes
  problems, because it results in SCSI disks being registered in the
  kernel that will never get cleaned up, and when those disks are
  eventually deleted off the server, I/O errors begin to accumulate,
  eventually slowing down the whole SCSI subsystem, spamming the kernel
  log, and causing timeouts at higher levels such that users are forced
  to reboot the node to get back to a usable state.

  RedHat discovered this problem more than 3 years ago and fixed it
  upstream.

  https://bugzilla.redhat.com/show_bug.cgi?id=1422941

  I had hoped that Debian would eventually pick up the version in which
  it was fixed, but another LTS has gone by without picking up the newer
  upstream version, and this is a critical problem, so I propose
  backporting the fixes.

  The 2 patches that need porting are:
  https://github.com/open-iscsi/open-iscsi/pull/40
  https://github.com/open-iscsi/open-iscsi/pull/49

  These changes have been proven safe by 3 years of soak time in the
  RedHat ecosystem, so I don't see much risk to taking them into Ubuntu.
  They apply cleanly to the most recent versions of focal, bionic, and
  xenial.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/open-iscsi/+bug/1877617/+subscriptions



More information about the foundations-bugs mailing list