[Acked/cmt] [SRU][Trusty][PATCH 0/2] fix "hw csum failure" errors

Jay Vosburgh jay.vosburgh at canonical.com
Fri Jan 16 17:09:39 UTC 2015


Andy Whitcroft <apw at canonical.com> wrote:

>On Thu, Jan 15, 2015 at 06:04:06PM -0800, Jay Vosburgh wrote:
>> 
>> BugLink: https://bugs.launchpad.net/bugs/1409123
>> 
>> SRU Justification
>> 
>> Impact: Certain network topologies using VXLAN with certain network
>> cards will experience "hw csum failure" errors in the kernel log
>> 
>> 	"Certain network toplogies" is VXLAN tunneled network between
>> two containers or VMs on separate hosts.
>> 
>> 	"Certain network cards" are at least enic and sky2.  This
>> probably includes any other device that performs CHECKSUM_COMPLETE
>> receive checksum offload.
>> 
>> Fix: backport the following two commits from current net-next
>
>Actually these are both in mainline now, as of v3.19-rc3.
>
>> commit 17e96834fd35997ca7cdfbf15413bcd5a36ad448
>> Author: Govindarajulu Varadarajan <_govind at gmx.com>
>> Date:   Thu Dec 18 15:58:42 2014 +0530
>> 
>>     enic: fix rx skb checksum
>> 
>> commit 2c26d34bbcc0b3f30385d5587aa232289e2eed8e
>> Author: Jay Vosburgh <jay.vosburgh at canonical.com>
>> Date:   Fri Dec 19 15:32:00 2014 -0800
>> 
>>     net/core: Handle csum for CHECKSUM_COMPLETE VXLAN forwarding
>
>Both seems to be trivial backports from those originals, and as such
>seem acceptable.  Depending on the severity of the symtoms, it is not
>clear to me if these are just annoying traces or data is being lost at
>the same time, we could take these in advance of them going to -stable.
>Also these probabally should go to -stable :).  I think davem handles
>this differently to everyone else of course, so they may be in plan for
>-stable already.

	The ultimate impact is that (a) traffic to a VM or container
ends up being checksummed in software, and (b) the kernel dmesg fills up
with big scary-looking stack traces.

	Yes, I'll be submitting a -stable request to netdev.  I just got
verification from the customer that the patches fix the problem and I
did the SRU first to get that rolling.

>Overall:
>
>Acked-by: Andy Whitcroft <apw at canonical.com>
>
>-apw

[... combining replies ...]
>And while I am thinking about it, if this is needed on Trusty, I assume
>that Utopic and the lts-backport thereof are also affected?  I assume we
>should be patching there too?  Again hopefully this will come out via
>stable.  I've nominated the bug to the various packages and series that
>seem relevant, and marked it up for commit tracking (so if they come in
>via stable it will also get closed out).

	Yes, the later releases should also need the patches, but there
is no rush there.

	-J

---
	-Jay Vosburgh, jay.vosburgh at canonical.com




More information about the kernel-team mailing list