[Bug 2019221] Re: System fails to boot due to early OpenVSwitch Forwarding Unit service start

Frode Nordahl 2019221 at bugs.launchpad.net
Thu Jun 29 05:04:31 UTC 2023


The referenced kernel bug 2004262 does indeed look relevant to the
description of the problem, would you be able to try the proposed kernel
fix and see if that solves your problem?

** Changed in: openvswitch (Ubuntu)
       Status: New => Incomplete

-- 
You received this bug notification because you are a member of Ubuntu
OpenStack, which is subscribed to openvswitch in Ubuntu.
https://bugs.launchpad.net/bugs/2019221

Title:
  System fails to boot due to early OpenVSwitch Forwarding Unit service
  start

Status in openvswitch package in Ubuntu:
  Incomplete

Bug description:
  I am experiencing an issue with Ubuntu 22.04 LTS after upgrading from kernel version 5.15.0-65-generic to 5.15.0-69-generic. The system fails to boot, hanging at the OpenVSwitch Forwarding Unit service.
  Aftre reverting to 5.15.0-65-generic all fine.

  OVS version 2.17.5

  The system has a bond configured with LACP (802.3ad) mode. The
  OpenVSwitch bridge (br-ex) is bound to this bond. Through
  investigation, I found that the issue seems to be caused by the bond
  not starting before the OpenVSwitch Forwarding Unit service starts. As
  a result, the OpenVSwitch Forwarding Unit service fails to start,
  which subsequently causes network services to fail to start as well.

  If I boot into recovery mode, wait for the bond to start, and then
  manually start the OpenVSwitch Forwarding Unit service and apply the
  netplan configuration (netplan apply), all network services start
  correctly. This suggests that the bonding module is being loaded after
  the OpenVSwitch Forwarding Unit service starts.

  To rectify this, I attempted to modify the OpenVSwitch service to
  explicitly wait for the bond interface to be initialized by adding the
  following lines to /lib/systemd/system/openvswitch-switch.service:

  makefile
  Copy code
  Wants=sys-subsystem-net-devices-bond0.device
  After=sys-subsystem-net-devices-bond0.device
  Unfortunately, this did not resolve the issue.

  I also tried changing the bond mode to active-backup, but the issue
  persists.

  Logs indicate the error message: "Timed out waiting for device
  /sys/subsystem/net/devices/bond0" during startup, even though the bond
  interface appears to be available (as confirmed by checking
  /proc/net/bonding/bond0).

  I suspect that the OpenVSwitch service might be starting before
  systemd-networkd fully initializes the bond interface. I've also
  considered the possibility that the issue might be related to netplan
  configuration or recent changes in its operation.

  I can't find any error logs in ovs or netplan that clearly point to a
  problem. I only find errors with timeouts.

  Could you please provide guidance on how to ensure that the bond
  interface fully initializes before the OpenVSwitch service starts?
  Alternatively, could this be a bug that needs to be addressed in a
  future update?P erhaps this is due to Bonding or Netplan logic or
  Bonding driver(which is sewn into the kernel, if I'm not mistaken)?

  P.S. If I bind an interface (ens3f1, for example), not a bond, to the
  bridge of OVS (br-ex), the system boots normally.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/openvswitch/+bug/2019221/+subscriptions




More information about the Ubuntu-openstack-bugs mailing list