[Bug 1696970] Re: softlockup DoS causes systemd-journald.service to abort with SIGABORT
Balint Reczey
balint.reczey at canonical.com
Thu Feb 15 13:29:43 UTC 2018
@colin-king In this case the journal was not broken. Do you have many
examples of really broken journals?
If so, do you have any solution in mind that would work better in
scenarios you care about but don't cause regressions?
Proper infrastructure planning with (CPU) quotas in place where it
matters seems to be a good practice instead of running highly loaded
servers.
The 3 min default timer for systemd-journald seems appropriate or even
generous.
--
You received this bug notification because you are a member of Ubuntu
Foundations Bugs, which is subscribed to systemd in Ubuntu.
https://bugs.launchpad.net/bugs/1696970
Title:
softlockup DoS causes systemd-journald.service to abort with SIGABORT
Status in systemd package in Ubuntu:
Opinion
Status in systemd source package in Artful:
Opinion
Bug description:
I was running the new stress-ng softlockup stressor and observed that
systemd-journald gets killed with an abort and this corrupts the
systemd journal.
How to reproduce:
git clone git://kernel.ubuntu.com/cking/stress-ng
cd stress-ng
make clean; make
sudo ./stress-ng --softlockup 0 -t 360 -v
..and wait for 360 seconds. dmesg shows the following, 100%
reproduceable:
[ 875.310331] systemd[1]: systemd-timesyncd.service: Watchdog timeout (limit 3min)!
[ 875.310740] systemd[1]: systemd-timesyncd.service: Killing process 574 (systemd-timesyn) with signal SIGABRT.
[ 875.327289] systemd[1]: systemd-timesyncd.service: Main process exited, code=killed, status=6/ABRT
[ 875.327666] systemd[1]: systemd-timesyncd.service: Unit entered failed state.
[ 875.327686] systemd[1]: systemd-timesyncd.service: Failed with result 'watchdog'.
[ 875.327917] systemd[1]: systemd-timesyncd.service: Service has no hold-off time, scheduling restart.
[ 875.327954] systemd[1]: Stopped Network Time Synchronization.
[ 875.328845] systemd[1]: Starting Network Time Synchronization...
[ 875.525071] systemd[1]: Started Network Time Synchronization.
[ 875.539619] systemd[1]: systemd-journald.service: Main process exited, code=dumped, status=6/ABRT
[ 875.544257] systemd-journald[5214]: File /run/log/journal/440e485e550040e3b93b66b2faae8525/system.journal corrupted or uncleanly shut down, renaming and replacing.
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/systemd/+bug/1696970/+subscriptions
More information about the foundations-bugs
mailing list