[Bug 2054391] Re: Fix CPU thermal sensors enumeration

Christian Ehrhardt  2054391 at bugs.launchpad.net
Fri Jun 21 07:24:48 UTC 2024


Local test build has been happy as well.

You test description is not wrong, but a bit too incomplete for what an SRU should have.
"Check the output of thermald" could use some "how exactly" and good/bad case examples.

I got a system in testflinger with a modern CPU hoping it would expose
this issue to test and try to improve that to be more acceptable.

-- 
You received this bug notification because you are a member of Ubuntu
Sponsors, which is subscribed to the bug report.
https://bugs.launchpad.net/bugs/2054391

Title:
  Fix  CPU thermal sensors enumeration

Status in HWE Next:
  New
Status in thermald package in Ubuntu:
  Fix Released
Status in thermald source package in Jammy:
  Confirmed

Bug description:
  [Impact]
  Some CPU sensors are not enumerated, this can make thermald deviates from the correct behavior of the CPU TDP.

  [Fix]
  Traverse all sensors under hwmon sysfs directory to make sure everything is enumerated.

  [Test]
  Check the output of thermald. Once the fix is in place, thermal zones that are previously omitted now shows up:
  [INFO]Zone 1: AMBF, Active:1 Bind:1 Sensor_cnt:1
  To do so
  0. get a large machine which will have more thermal zones
  1. stop the potentially auto-running service
     $ systemctl stop thermald
  2. run the daemon in foreground with loglevel to see what is going on.
     On many modern systemd (=the large ones) it might not know the CPUid,
     to bypass that for the test you can ask it to ignore the check
     $ sudo thermald --no-daemon --loglevel=info --ignore-cpuid-check
  3. check the output
     On init the system will be probed and that will show something like:

  ...
   ZONE DUMP BEGIN
  [1718954645][INFO]Zone 2: cpu, Active:1 Bind:0 Sensor_cnt:1
  ...
  [1718954645][INFO]Zone 3: cpu, Active:1 Bind:0 Sensor_cnt:1
  ...
   ZONE DUMP END

  In here, on systems with many thermal zones one would before the fix
  only see a few, and with the fix more zones.


  [Where problems could occur]
  Since the new logic traverse the whole hwmon sysfs, the startup time can take slightly longer.

To manage notifications about this bug go to:
https://bugs.launchpad.net/hwe-next/+bug/2054391/+subscriptions




More information about the Ubuntu-sponsors mailing list