[Bug 116815] linux-image 2.6.15-28.55 regression from 2.6.15-28.53, crashes under network load

Mattias Wadenstein maswan at acc.umu.se
Fri May 25 11:15:37 UTC 2007


Public bug reported:

Binary package hint: linux-source-2.6.15

After doing the security upgrade to 2.6.15-28.55 our server started to
crash within minutes of boot. Backing down to 28.53 put is back in
stable operation again.

The lockups look like this:
[42950993.170000] BUG: soft lockup detected on CPU#0!
[42950993.170000]
[42950993.170000] Pid: 25106, comm:           downloader
[42950993.170000] EIP: 0060:[<c0189a0c>] CPU: 0
[42950993.170000] EIP is at posix_locks_deadlock+0x5c/0xc0
[42950993.170000]  EFLAGS: 00000202    Tainted: P       (2.6.15-28-server)
[42950993.170000] EAX: dfb67e40 EBX: cf67934c ECX: ffffffff EDX: da95f460
[42950993.170000] ESI: da95fb30 EDI: da95f1d8 EBP: cf67917c DS: 007b ES: 007b
[42950993.170000] CR0: 8005003b CR2: b5d25000 CR3: 1fa9f340 CR4: 000006f0
[42950993.170000]  [<c0189c12>] __posix_lock_file+0x82/0x5f0
[42950993.170000]  [<c0171684>] nameidata_to_filp+0x44/0x50
[42950993.170000]  [<c018b4b0>] fcntl_setlk+0x2d0/0x370
[42950993.170000]  [<c013c130>] autoremove_wake_function+0x0/0x60
[42950993.170000]  [<c0186b38>] sys_fcntl64+0xb8/0xe0
[42950993.170000]  [<c0103313>] sysenter_past_esp+0x54/0x75


And there are serious page allocation failures, like:

ingrid-h.hpc2n.umu.se login: [42949719.420000] downloader: page allocation failure. order:1, mode:0x20
[42949719.420000]  [<c0154217>] __alloc_pages+0x217/0x320
[42949719.420000]  [<c014d674>] handle_IRQ_event+0x64/0x70
[42949719.420000]  [<c0157cb9>] kmem_getpages+0x49/0xe0
[42949719.420000]  [<c0158a67>] alloc_slabmgmt+0x57/0x60
[42949719.420000]  [<c0158c48>] cache_grow+0xa8/0x1b0
[42949719.420000]  [<c0158f54>] cache_alloc_refill+0x204/0x240
[42949719.420000]  [<c015928e>] __kmalloc+0x7e/0x80
[42949719.420000]  [<c028c5df>] __alloc_skb+0x5f/0x180
[42949719.420000]  [<c02c9ed5>] tcp_collapse+0x125/0x350
[42949719.420000]  [<c02ca233>] tcp_prune_queue+0x83/0x210
[42949719.420000]  [<c02c9671>] tcp_data_queue+0x561/0xca0

[.... full dump on http://www.acc.umu.se/~maswan/ubuntu/page-
alloc-28.55]

Running 28.53, we would occasionally get an OoM on a process, but not
total crashes like on 28.55.

/Mattias Wadenstein

** Affects: linux-source-2.6.15 (Ubuntu)
     Importance: Undecided
         Status: Unconfirmed

-- 
linux-image 2.6.15-28.55 regression from 2.6.15-28.53, crashes under network load
https://bugs.launchpad.net/bugs/116815
You received this bug notification because you are a member of Kernel
Bugs, which is a bug contact for linux-source-2.6.15 in ubuntu.




More information about the kernel-bugs mailing list