ACK/Cmnt: [SRU][F/aws][PATCH v3 0/6] aws: proper fix for c5.18xlarge hibernation issues

Andrea Righi andrea.righi at canonical.com
Wed May 19 19:33:13 UTC 2021


On Wed, May 19, 2021 at 12:23:22PM -0600, Tim Gardner wrote:
> Acked-by: Tim Gardner <tim.gardner at canonical.com>
> 
> pr_info() exists in focal/linux-aws. I'm curious why you didn't preserve it
> in patch 2/6 ?

Good point, I could have used pr_info(), but the original patch was
changing a pr_info() to another pr_info() and the original code has a
printk(), so I thought it was more consistent to keep the printk() and
change only the text like the original patch does...

-Andrea

> 
> On 5/19/21 9:15 AM, Andrea Righi wrote:
> > BugLink: https://bugs.launchpad.net/bugs/1920944
> > 
> > [Impact]
> > 
> > In LP: #1918694 we applied a fix and a workaround to solve the
> > hibernation issues on c5.18xlarge. The workaround was in the form of a
> > SAUCE patch:
> > 
> >    "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot"
> > 
> > It looks like we can replace this workaround with a proper fix, by
> > applying this patch:
> > 
> > http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/
> > 
> > [Test plan]
> > 
> > Create a c5.18xlarge instance, run the memory stress test script (the
> > same test script that we are using to stress test hibernation), trigger
> > the hibernate event, trigger the resume event. Repeat a couple of times
> > and the problem is very likely to happen.
> > 
> > [Fix]
> > 
> > Replace "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot"
> > with:
> > 
> > http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/
> > 
> > The fix has been tested extensively in the AWS infrastructure with
> > positive results.
> > 
> > [Where problems could occur]
> > 
> > This new code introduced by the fix can be executed also when a CPU is
> > put offline, so we may see potential regressions in the KVM CPU
> > hotplugging.
> > 
> > ----------------------------------------------------------------
> > Changelog (v2 -> v3):
> >   - updated backported / signed-off lines with the right upstream info
> >     (thanks Guilherme!)
> > 
> > NOTE: backport activity was minimal, it only required some context
> > adjustments to properly apply the changes.
> > 
> > Andrea Righi (1):
> >        Revert "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot"
> > 
> > Vitaly Kuznetsov (5):
> >        x86/kvm: Fix pr_info() for async PF setup/teardown
> >        x86/kvm: Teardown PV features on boot CPU as well
> >        x86/kvm: Disable kvmclock on all CPUs on shutdown
> >        x86/kvm: Disable all PV features on crash
> >        x86/kvm: Unify kvm_pv_guest_cpu_reboot() with kvm_guest_cpu_offline()
> > 
> >   arch/x86/include/asm/kvm_para.h |   9 ++----
> >   arch/x86/kernel/kvm.c           | 113 ++++++++++++++++++++++++++++++++++++++++++++----------------------
> >   arch/x86/kernel/kvmclock.c      |  28 ++---------------
> >   3 files changed, 79 insertions(+), 71 deletions(-)
> > 
> > 
> 
> -- 
> -----------
> Tim Gardner
> Canonical, Inc



More information about the kernel-team mailing list