[Bug 2003530] Re: Rook mgr module crashes due to missing mgr.nfs

Alan Baghumian 2003530 at bugs.launchpad.net
Thu Jul 6 17:22:26 UTC 2023


Hello There!

I have a cluster running the proposed packages since they have been
released:

$ dpkg -l | grep 17.2.6 | awk '{print $2"\t\t"$3"\t\t"$4}'
ceph		17.2.6-0ubuntu0.22.04.1		amd64
ceph-base		17.2.6-0ubuntu0.22.04.1		amd64
ceph-common		17.2.6-0ubuntu0.22.04.1		amd64
ceph-mds		17.2.6-0ubuntu0.22.04.1		amd64
ceph-mgr		17.2.6-0ubuntu0.22.04.1		amd64
ceph-mgr-modules-core		17.2.6-0ubuntu0.22.04.1		all
ceph-mon		17.2.6-0ubuntu0.22.04.1		amd64
ceph-osd		17.2.6-0ubuntu0.22.04.1		amd64
ceph-volume		17.2.6-0ubuntu0.22.04.1		all
libcephfs2		17.2.6-0ubuntu0.22.04.1		amd64
librados2		17.2.6-0ubuntu0.22.04.1		amd64
libradosstriper1		17.2.6-0ubuntu0.22.04.1		amd64
librbd1		17.2.6-0ubuntu0.22.04.1		amd64
librgw2		17.2.6-0ubuntu0.22.04.1		amd64
libsqlite3-mod-ceph		17.2.6-0ubuntu0.22.04.1		amd64
python3-ceph-argparse		17.2.6-0ubuntu0.22.04.1		amd64
python3-ceph-common		17.2.6-0ubuntu0.22.04.1		all
python3-cephfs		17.2.6-0ubuntu0.22.04.1		amd64
python3-rados		17.2.6-0ubuntu0.22.04.1		amd64
python3-rbd		17.2.6-0ubuntu0.22.04.1		amd64
radosgw		17.2.6-0ubuntu0.22.04.1		amd64


$ sudo ceph mgr module ls
MODULE                           
balancer           on (always on)
crash              on (always on)
devicehealth       on (always on)
orchestrator       on (always on)
pg_autoscaler      on (always on)
progress           on (always on)
rbd_support        on (always on)
status             on (always on)
telemetry          on (always on)
volumes            on (always on)
iostat             on            
nfs                on            
restful            on            
alerts             -             
influx             -             
insights           -             
localpool          -             
mirroring          -             
osd_perf_query     -             
osd_support        -             
prometheus         -             
selftest           -             
snap_schedule      -             
stats              -             
telegraf           -             
test_orchestrator  -             
zabbix             -

$ sudo ceph -s
  cluster:
    id:     6c2efd86-7423-11ed-97ec-2f3ef93079f7
    health: HEALTH_OK
 
  services:
    mon: 3 daemons, quorum juju-b096f0-88-lxd-0,juju-b096f0-90-lxd-0,juju-b096f0-92-lxd-0 (age 10h)
    mgr: juju-b096f0-88-lxd-0(active, since 4d), standbys: juju-b096f0-92-lxd-0, juju-b096f0-90-lxd-0
    osd: 8 osds: 8 up (since 2d), 8 in (since 2w)
 
  data:
    pools:   3 pools, 289 pgs
    objects: 169.40k objects, 492 GiB
    usage:   1.5 TiB used, 892 GiB / 2.3 TiB avail
    pgs:     289 active+clean


Installed ceph-mgr-rook on all Mon units:

$ juju run -a ceph-mon-ssd 'sudo apt-get -y install ceph-mgr-rook'


Check cluster Status:

$ sudo ceph -s
  cluster:
    id:     6c2efd86-7423-11ed-97ec-2f3ef93079f7
    health: HEALTH_OK
 
  services:
    mon: 3 daemons, quorum juju-b096f0-88-lxd-0,juju-b096f0-90-lxd-0,juju-b096f0-92-lxd-0 (age 10h)
    mgr: juju-b096f0-88-lxd-0(active, since 2m), standbys: juju-b096f0-90-lxd-0, juju-b096f0-92-lxd-0
    osd: 8 osds: 8 up (since 2d), 8 in (since 2w)
 
  data:
    pools:   3 pools, 289 pgs
    objects: 169.40k objects, 492 GiB
    usage:   1.5 TiB used, 892 GiB / 2.3 TiB avail
    pgs:     289 active+clean
 
$ sudo ceph mgr module ls
MODULE                           
balancer           on (always on)
crash              on (always on)
devicehealth       on (always on)
orchestrator       on (always on)
pg_autoscaler      on (always on)
progress           on (always on)
rbd_support        on (always on)
status             on (always on)
telemetry          on (always on)
volumes            on (always on)
iostat             on            
nfs                on            
restful            on            
alerts             -             
influx             -             
insights           -             
localpool          -             
mirroring          -             
osd_perf_query     -             
osd_support        -             
prometheus         -             
rook               -             
selftest           -             
snap_schedule      -             
stats              -             
telegraf           -             
test_orchestrator  -             
zabbix             -             


Please let me know if there is anything else you'd like me to test.

Best,
Alan

-- 
You received this bug notification because you are a member of Ubuntu
OpenStack, which is subscribed to ceph in Ubuntu.
https://bugs.launchpad.net/bugs/2003530

Title:
  Rook mgr module crashes due to missing mgr.nfs

Status in ceph package in Ubuntu:
  Fix Released
Status in ceph source package in Jammy:
  Fix Committed
Status in ceph source package in Kinetic:
  Fix Committed
Status in ceph source package in Lunar:
  Fix Released

Bug description:
  [Impact]

  The rook mgr service crashes on installing the ceph-mgr-rook package
  (see below traceback from /var/log/syslog). This is due to a missing
  ceph mgr package "nfs" which the rook mgr module depends upon.

  This makes the rook mgr module unusable which is required for
  integrating Ceph with the Rook storage orchestrator.

  The proposed patch fixes this by including the nfs mgr package into
  the ceph-mgr-modules-core .deb. This is similar as upstream packages
  nfs for the ceph mgr system.

  
  Jan 17 16:39:18 devcontainer-269785 bash[247610]: debug 2023-01-17T16:39:18.008+0000 7f930419fdc0 -1 mgr[py] Module not found: 'rook'
  Jan 17 16:39:18 devcontainer-269785 bash[247610]: debug 2023-01-17T16:39:18.008+0000 7f930419fdc0 -1 mgr[py] Traceback (most recent call last):
  Jan 17 16:39:18 devcontainer-269785 bash[247610]: File "/usr/share/ceph/mgr/rook/__init__.py", line 5, in <module>
  Jan 17 16:39:18 devcontainer-269785 bash[247610]: from .module import RookOrchestrator
  Jan 17 16:39:18 devcontainer-269785 bash[247610]: File "/usr/share/ceph/mgr/rook/module.py", line 41, in <module>
  Jan 17 16:39:18 devcontainer-269785 bash[247610]: from .rook_cluster import RookCluster
  Jan 17 16:39:18 devcontainer-269785 bash[247610]: File "/usr/share/ceph/mgr/rook/rook_cluster.py", line 29, in <module>
  Jan 17 16:39:18 devcontainer-269785 bash[247610]: from nfs.cluster import create_ganesha_pool
  Jan 17 16:39:18 devcontainer-269785 bash[247610]: ModuleNotFoundError: No module named 'nfs'

  [Test plan]

  The test requires a Ceph cluster. SSH to a system with a running ceph-
  mon service.

  $ sudo ceph mgr module ls  # verify: no rook mgr module
  $ sudo apt-get -q install ceph-mgr-rook
  $ sudo ceph -s  # verify: no crashed modules
  $ sudo ceph mgr module ls  # verify: rook mgr module present and enabled

  
  [Where problems could occur]

  The proposed patch only includes an additional Python package, and
  regression potential should be low.

  Issues could occur due to packaging bugs, such as missing dependencies
  for the nfs mgr package. As the nfs package is currently missing,
  there should not be any additional impact due to this.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/ceph/+bug/2003530/+subscriptions




More information about the Ubuntu-openstack-bugs mailing list