NACK - [SRU][J][PATCH 0/2] rcu stalls with many storage key guests (LP: 1975582)
Frank Heimes
frank.heimes at canonical.com
Fri Jun 10 12:50:07 UTC 2022
Please NACK, since this is not only for jammy, but also for impish and
focal (need to adjust the Subjects) on a v2
On Fri, Jun 10, 2022 at 2:46 PM <frank.heimes at canonical.com> wrote:
> BugLink: https://bugs.launchpad.net/bugs/1975582
>
> SRU Justification:
>
> [Impact]
>
> * Ubuntu on s390x KVM environments with lots of large guests with storage
> keys can be affected by rcu stalls.
>
> * These rcu stalls can cause the system to crash/dump.
>
> [Fix]
>
> * 3ae11dbcfac9 3ae11dbcfac906a8c3a480e98660a823130dc16a "s390/mm: use
> non-quiescing sske for KVM switch to keyed guest"
>
> * 6d5946274df1 6d5946274df1fff539a7eece458a43be733d1db8 "s390/gmap:
> voluntarily schedule during key setting"
>
> [Test Plan]
>
> * There is no trigger or direct test or re-creation of the
> problem situation possible, but...
>
> * and IBM z13 or LinuxONE (or never) LPAR is needed that
> runs Ubuntu Server 20.04 LTS or 18.04 LTS with HWE kernel
> and acts as KVM host with again several large guests running
> on top with storage groups.
>
> * Let such a system running for days under significant load
> and watch the logs for rcu issues.
>
> * Prior to the submission of this SRU patched test kernels
> for focal 5.4 and bionic hwe-5.4 were created and tested.
> They ran for days at a staging environemnt at IBM
> without further issues.
>
> * The modifications are all limited to s390x.
>
> * A test kernel was build (see below) that ran in a test environment
> at IBM under appropriate load for several days.
>
> [Where problems could occur]
>
> * Due to the change for the KVM switch to keyed guest
> from classic sske to non-quiescing sske
> the KVM behaviour might have changed and the storage keys harmed.
>
> * The now more generous scheduling while setting keys
> has an impact on the guest memory management and mapping
> which will lead to a different performance.
>
> * This, with the introduction of __s390_enable_skey_pmd and
> cond_resched, might increase the overhead in certain situations,
> but eventually improves the responsiveness over time,
> hence avoid rcu stalls.
>
> [Other Info]
>
> * Since the patches are upstream in 5.19-rc1,
> they will be included in the kernel that is planned for kinetic (5.19).
>
> * Hence this is an SRU to jammy, impish and focal.
>
> Christian Borntraeger (2):
> s390/gmap: voluntarily schedule during key setting
> s390/mm: use non-quiescing sske for KVM switch to keyed guest
>
> arch/s390/mm/gmap.c | 14 ++++++++++++++
> arch/s390/mm/pgtable.c | 2 +-
> 2 files changed, 15 insertions(+), 1 deletion(-)
>
> --
> 2.25.1
>
> --
> kernel-team mailing list
> kernel-team at lists.ubuntu.com
> https://lists.ubuntu.com/mailman/listinfo/kernel-team
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.ubuntu.com/archives/kernel-team/attachments/20220610/f63179d1/attachment.html>
More information about the kernel-team
mailing list