ACK: [SRU][F][PATCH 0/3] Fix nouveau error storm and unresponsive display after desktop idle timeout
Thibault Ferrante
thibault.ferrante at canonical.com
Tue Jan 7 13:56:35 UTC 2025
Acked-by: Thibault Ferrante <thibault.ferrante at canonical.com>
On 10-12-2024 23:03, Jacob Martin wrote:
> BugLink: https://bugs.launchpad.net/bugs/2078011
>
> SRU Justification
>
> [Impact]
>
> On a system with a GV100 GPU using the nouveau driver, the display becomes
> unresponsive and a storm of "nouveau 0000:07:00.0: disp: ctrl 00000080"
> messages are continuously printed to dmesg once the desktop environment reaches
> its idle timeout. This is interfering with certification testing for the DGX
> Station desktop system, as the system eventually will become unresponsive
> during testing.
>
> [Fix]
>
> This only affects Focal.
>
> Backporting the following patches from K5.6 resolves the issue:
> 58ae5284f6 ("drm/nouveau/disp/gv100-: halt NV_PDISP_FE_RM_INTR_STAT_CTRL_DISP_ERROR storms")
> 5bb88d0794 ("drm/nouveau/kms/gv100-: move window ownership setup into modesetting path")
> 137c4ba716 ("drm/nouveau/kms/gv100-: avoid sending a core update until the first modeset")
>
> [Test Case]
>
> 1. Install desktop environment
> $ sudo apt install ubuntu-desktop
>
> 2. Configure GDM
> $ sudo vim /etc/gdm3/custom.conf
> => Uncomment WaylandEnable=false
> => Configure automatic login for the `ubuntu` user by setting
> AutomaticLoginEnable = true
> AutomaticLogin = ubuntu
>
> 3. Disable display timeout
> $ gsettings set org.gnome.desktop.session idle-delay 0
>
> 4. Set graphical as the default target
> $ sudo systemctl set-default graphical.target
>
> 5. Reboot the system
>
> 6. Enable 1 second display timeout and wait ~10 seconds
> $ gsettings set org.gnome.desktop.session idle-delay 1
>
> 7. Observe that after applying these patches, the display can wake up from idle
> and the system continues to be usable without a storm of "nouveau 0000:07:00.0:
> disp: ctrl 00000080" messages in dmesg.
>
> [Where things could go wrong]
>
> These changes affect only the nouveau driver. Issues would appear as
> misbehavior of the nouveau driver, mostly likely for Volta NVIDIA GPUs.
>
> Ben Skeggs (3):
> drm/nouveau/disp/gv100-: halt NV_PDISP_FE_RM_INTR_STAT_CTRL_DISP_ERROR
> storms
> drm/nouveau/kms/gv100-: move window ownership setup into modesetting
> path
> drm/nouveau/kms/gv100-: avoid sending a core update until the first
> modeset
>
> drivers/gpu/drm/nouveau/dispnv50/core.h | 6 +++++
> drivers/gpu/drm/nouveau/dispnv50/corec37d.c | 23 +++++++++++++++----
> drivers/gpu/drm/nouveau/dispnv50/corec57d.c | 9 ++++----
> drivers/gpu/drm/nouveau/dispnv50/disp.c | 16 +++++++++++++
> .../gpu/drm/nouveau/nvkm/engine/disp/gv100.c | 6 +++++
> 5 files changed, 50 insertions(+), 10 deletions(-)
>
More information about the kernel-team
mailing list