[Bug 1955345] Re: Active ceph-mgr crashes on receiving report from a non-active mgr

Ponnuvel Palaniyappan 1955345 at bugs.launchpad.net
Sun Dec 19 16:59:02 UTC 2021


Attaching debdiff for Focal.

** Patch added: "focal1955345.patch"
   https://bugs.launchpad.net/ubuntu/+source/ceph/+bug/1955345/+attachment/5548581/+files/focal1955345.patch

-- 
You received this bug notification because you are a member of Ubuntu
OpenStack, which is subscribed to ceph in Ubuntu.
https://bugs.launchpad.net/bugs/1955345

Title:
  Active ceph-mgr crashes on receiving report from a non-active mgr

Status in ceph package in Ubuntu:
  In Progress
Status in ceph source package in Focal:
  In Progress

Bug description:
  [Impact]
  An active ceph-mgr crashes and another ceph-mgr takes over and becomes
  the active mgr. But this could again hit same issue and crash and the cycle can continue indefinitely (previously crashed ceph-mgr gets restarted by systemd).

  This could affect the cluster stability/usability as ceph mgr handles
  a number of essential operations (modules that control/change Ceph
  cluster behaviour, metrics, etc).

  [Test Plan]
  Deploy and operate a Ceph cluster normally.
  Increase the log level of mgr to 20.
  Observe MMgrReport sent from non-active mgrs get ignored (no crash).

  [Where problems could occur]
  Possibly the fix may not actually fix and mgr continue to crash as before.
  Might incorrectly ignore reports from active mgrs.

  [Other Info]
  Upstream main bug: https://tracker.ceph.com/issues/48022
  Octopus backport PR: https://github.com/ceph/ceph/pull/43861
  Octopus backport bug: https://tracker.ceph.com/issues/53198

  This has been already been fixed and available in Pacific.
  So needed to backport only for Octopus.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/ceph/+bug/1955345/+subscriptions




More information about the Ubuntu-openstack-bugs mailing list