[Bug 1904585] Re: opal-prd: Have a worker process handle page offlining (Fixes "PlatServices: dyndealloc memory_error() failed" is getting reported in error log (opal-prd))

Launchpad Bug Tracker 1904585 at bugs.launchpad.net
Tue Dec 15 18:09:54 UTC 2020


This bug was fixed in the package skiboot - 6.6.2-1ubuntu1

---------------
skiboot (6.6.2-1ubuntu1) hirsute; urgency=medium

  * opal-prd: mmap(range:ibm,hbrt-code-image,...) fails with EPERM
     (LP: #1905393) opal-prd fails to start
     d/patches/0005-fix-opal-prd-fail-with-EPERM.patch
  * opal-prd: Have a worker process handle page offlining
     (LP: #1904585) Have a worker process handle page offlining
     d/patches/0006-fix-opal-prd-have-worker-process-handle-page-offlining.patch

 -- Matthieu Clemenceau <matthieu.clemenceau at canonical.com>  Mon, 07 Dec
2020 17:50:16 -0600

** Changed in: skiboot (Ubuntu Hirsute)
       Status: In Progress => Fix Released

-- 
You received this bug notification because you are a member of Ubuntu
Sponsors Team, which is subscribed to the bug report.
https://bugs.launchpad.net/bugs/1904585

Title:
  opal-prd: Have a worker process handle page offlining (Fixes
  "PlatServices: dyndealloc memory_error() failed" is getting reported
  in error log (opal-prd))

Status in The Ubuntu-power-systems project:
  In Progress
Status in skiboot package in Ubuntu:
  Fix Released
Status in skiboot source package in Xenial:
  In Progress
Status in skiboot source package in Bionic:
  In Progress
Status in skiboot source package in Focal:
  In Progress
Status in skiboot source package in Groovy:
  In Progress
Status in skiboot source package in Hirsute:
  Fix Released

Bug description:
  --Problem Description---

  https://github.com/open-
  power/skiboot/commit/8cbd0de88d162e387f11569eee1bdecef8fad2e3

  opal-prd: Have a worker process handle page offlining

  The memory_error() hservice interface expects the memory_error() call to
  just accept the offline request and return without actually offlining the
  memory. Currently we will attempt to offline the marked pages before
  returning to HBRT which can result in an excessively long time spent in the
  memory_error() hservice call which blocks HBRT from processing other
  errors. Fix this by adding a worker process which performs the page
  offlining via the sysfs memory error interfaces.

  Reviewed-by: Vasant Hegde <hegdevasant at linux.vnet.ibm.com>
  Signed-off-by: Oliver O'Halloran <oohall at gmail.com>

  Thanks in advance for your support.
   
  Machine Type = Power8 and Power9 OPAL systems 
   
  ---Steps to Reproduce---
  * Inject memory error (UE) 
  * Verify that opal-prd doesn't return asynchronously to the platform after requesting the memory offlining operation
   
  Userspace tool common name: opal-prd 
   
  We need this fix for 16.04.x and 18.04.x LTS releases.

  Fix also is needed for 20.04 and 20.10.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-power-systems/+bug/1904585/+subscriptions



More information about the Ubuntu-sponsors mailing list