[Bug 1931244] Re: ovn sriov broken from ussuri onwards
Hemanth Nakkina
1931244 at bugs.launchpad.net
Fri Jun 11 13:36:54 UTC 2021
Verified on focal-proposed and test case is successful
$ juju run -a neutron-api -- sudo apt-cache policy neutron-common
neutron-common:
Installed: 2:16.3.2-0ubuntu3
Candidate: 2:16.3.2-0ubuntu3
Version table:
*** 2:16.3.2-0ubuntu3 500
500 http://archive.ubuntu.com/ubuntu focal-proposed/main amd64 Packages
100 /var/lib/dpkg/status
2:16.3.2-0ubuntu2 500
500 http://archive.ubuntu.com/ubuntu focal-updates/main amd64 Packages
2:16.0.0~b3~git2020041516.5f42488a9a-0ubuntu2 500
500 http://archive.ubuntu.com/ubuntu focal/main amd64 Packages
Created VM with 2 SRIOV ports
$ openstack server list --long
+--------------------------------------+-------------+--------+------------+-------------+----------------------------------------+------------+--------------------------------------+-------------+-----------+-------------------+---------------------+------------+
| ID | Name | Status | Task State | Power State | Networks | Image Name | Image ID | Flavor Name | Flavor ID | Availability Zone | Host | Properties |
+--------------------------------------+-------------+--------+------------+-------------+----------------------------------------+------------+--------------------------------------+-------------+-----------+-------------------+---------------------+------------+
| 0f0c5104-cda8-4b84-95b0-8a713e8a1db6 | sriov-test1 | ACTIVE | None | Running | sriov_net=10.230.58.157, 10.230.58.133 | bionic | 17cca127-b912-444d-bc9a-5e4cf48156b3 | m1.medium | 3 | nova | test.test.test | |
+--------------------------------------+-------------+--------+------------+-------------+----------------------------------------+------------+--------------------------------------+-------------+-----------+-------------------+---------------------+------------+
** Tags removed: verification-needed-focal
** Tags added: verification-done-focal
--
You received this bug notification because you are a member of Ubuntu
OpenStack, which is subscribed to Ubuntu Cloud Archive.
https://bugs.launchpad.net/bugs/1931244
Title:
ovn sriov broken from ussuri onwards
Status in Ubuntu Cloud Archive:
Fix Committed
Status in Ubuntu Cloud Archive ussuri series:
Fix Committed
Status in Ubuntu Cloud Archive victoria series:
Invalid
Status in Ubuntu Cloud Archive wallaby series:
Fix Committed
Status in Ubuntu Cloud Archive xena series:
Fix Committed
Status in neutron:
In Progress
Status in neutron package in Ubuntu:
Triaged
Status in neutron source package in Focal:
Fix Committed
Status in neutron source package in Hirsute:
Fix Committed
Status in neutron source package in Impish:
Triaged
Bug description:
I have an Openstack Ussuri 16.3.2 deployment using OVN. When I create
a vm with one or more sriov ports it fails with:
2021-06-08 11:38:31.939 526862 WARNING nova.virt.libvirt.driver [req-
c4be797e-7d7e-4e73-8406-f74ae51db192 696c98b722a44d229e16b6d6474a27d4
0b9102977dcc4d4ab662b48494bb3110 - 2e0bf6ec95c047d986a61f7570222149
2e0bf6ec95c047d986a61f7570222149] [instance: 7ab9b374-51eb-
4e94-8055-c69e8a7d76b3] Timeout waiting for [('network-vif-plugged',
'c2b7c68d-c465-4ca2-869a-59bc73b2b595'), ('network-vif-plugged',
'a50de16a-29ac-4dca-9cb6-0247a932fbf3')] for instance with vm_state
building and task_state spawning.: eventlet.timeout.Timeout: 300
seconds
A bit of analysis shows that nova-compute did its thing and sits there
waiting on network-vif-plugged. The sriov-agent then notices new VFs
configured and sends a get_devices_details_list() rpc call to neutron
and neutron never responds. Reverting to 16.3.1 fixes the issue.
Taking a closer look at 16.3.2 by reverting patches lead to [1] as the
culprit.
[1]
https://github.com/openstack/neutron/commit/7cf9597570f288d27768dc5ff7be04824d09f8bc
=== Ubuntu SRU details ===
[Impact]
[Test Case]
See above.
I think for testing we can run standard regression testing with OVN/neutron deployments plus tempest testing.
For now we are planning to revert the commit as a stop-gap to prevent
further upgrades from being regressed.
[Regression Potential]
There is regression potential in that the patch being reverted contributes partial fixes to the following related bugs. Considering most openstack users are on ussuri at this point and 16.3.2 has not been available for very long the revert that we are proposing would seem to have the least amount of regression potential.
https://bugs.launchpad.net/neutron/+bug/1894117
https://bugs.launchpad.net/neutron/+bug/1903008
To manage notifications about this bug go to:
https://bugs.launchpad.net/cloud-archive/+bug/1931244/+subscriptions
More information about the Ubuntu-openstack-bugs
mailing list