[Bug 1659195] Re: resolver regression on ubuntu-core from #1636912

Michael Vogt michael.vogt at canonical.com
Fri Apr 21 20:27:54 UTC 2017


Unfortunately there is no easy way to reproduce without building a new
core. I build a core snap with the "networkd-allow-networkd-to-start-in-
early-boot.patch" included.

With that our tests fail all over the place, e.g.:
https://s3.amazonaws.com/archive.travis-ci.org/jobs/224387435/log.txt -
search for "dial tcp: lookup".

I created an image with the problematic core (it is revision 1754) and
ran the testsuite:

With an image generated with core r1754:
$ kvm -m 1500 -redir tcp:10022::22 ./ubuntu-core-16-amd64.img/pc.img -snapshot
$ export SPREAD_EXTERNAL_ADDRESS=localhost:10022
$ ./tests/lib/external/prepare-ssh.sh localhost 10022
$ spread -v -reuse external:ubuntu-core-16-64

but on qemu it is fine. However when I try this on linode I get:
...
error: cannot install "jq": Get https://search.apps.ubuntu.com/api/v1/snaps/details/jq?channel=stable&fields=anon_download_url%2Carchitecture%2Cchannel%2Cdownload_sha3_384%2Csummary%2Cdescription%2Cdeltas%2Cbinary_filesize%2Cdownload_url%2Cepoch%2Cicon_url%2Clast_updated%2Cpackage_name%2Cprices%2Cpublisher%2Cratings_average%2Crevision%2Cscreenshot_urls%2Csnap_id%2Csupport_url%2Ccontact%2Ctitle%2Ccontent%2Cversion%2Corigin%2Cdeveloper_id%2Cprivate%2Cconfinement: dial tcp: lookup search.apps.ubuntu.com on [::1]:53: read udp [::1]:54730->[::1]:53: read: connection refused
...
2017/04/21 20:54:24 Failed tasks: 6
    - linode:ubuntu-core-16-64:tests/main/auto-aliases
    - linode:ubuntu-core-16-64:tests/main/searching
    - linode:ubuntu-core-16-64:tests/main/snap-connect
    - linode:ubuntu-core-16-64:tests/main/snap-download
    - linode:ubuntu-core-16-64:tests/main/ubuntu-core-classic
    - linode:ubuntu-core-16-64:tests/main/ubuntu-core-create-user
2017/04/21 20:54:24 Failed task prepare: 10
    - linode:ubuntu-core-16-64:tests/main/auto-refresh
    - linode:ubuntu-core-16-64:tests/main/interfaces-content
    - linode:ubuntu-core-16-64:tests/main/interfaces-content-empty-content-attr
    - linode:ubuntu-core-16-64:tests/main/interfaces-fuse_support
    - linode:ubuntu-core-16-64:tests/main/interfaces-kernel-module-control
    - linode:ubuntu-core-16-64:tests/main/interfaces-snapd-control
    - linode:ubuntu-core-16-64:tests/main/refresh:strict_remote
    - linode:ubuntu-core-16-64:tests/main/revert-devmode:remote
    - linode:ubuntu-core-16-64:tests/main/revert:remote
    - linode:ubuntu-core-16-64:tests/main/snap-auto-mount
2017/04/21 20:54:24 Failed task restore: 1
    - linode:ubuntu-core-16-64:tests/main/snap-auto-mount

so ~10% of the tests on linode fail. Unfortunately I don't know yet why
this is fine in qemu but not on linode.

-- 
You received this bug notification because you are a member of Ubuntu
Foundations Bugs, which is subscribed to systemd in Ubuntu.
https://bugs.launchpad.net/bugs/1659195

Title:
  resolver regression on ubuntu-core from #1636912

Status in systemd package in Ubuntu:
  Confirmed

Bug description:
  We had a bunch of test failures in our automatic tests for snapd since this ~Saturday (21.01.2017). All on core, all with errors like:
  """
  error: cannot install "test-snapd-tools": Get https://search.apps.ubuntu.com/api/v1/snaps/details/test-snapd-tools?channel=stable&fields=anon_download_url%2Carchitecture%2Cchannel%2Cdownload_sha3_384%2Csummary%2Cdescription%2Cdeltas%2Cbinary_filesize%2Cdownload_url%2Cepoch%2Cicon_url%2Clast_updated%2Cpackage_name%2Cprices%2Cpublisher%2Cratings_average%2Crevision%2Cscreenshot_urls%2Csnap_id%2Csupport_url%2Ctitle%2Ccontent%2Cversion%2Corigin%2Cdeveloper_id%2Cprivate%2Cconfinement: dial tcp: lookup search.apps.ubuntu.com on [::1]:53: read udp [::1]:41766->[::1]:53: read: connection refused
  """

  The key here is the: "dial tcp: lookup search.apps.ubuntu.com on
  [::1]:53: read udp [::1]:41766->[::1]:53: read: connection refused".
  It was random but in the ~150 integration tests we run for the core
  image we had about 5 of those everytime. Further debugging yielded
  that the /etc/resolv.conf is empty when the errors happen.

  It looks like the fix for
  https://bugs.launchpad.net/ubuntu/+source/systemd/+bug/1636912 is
  causing this behaviour for us. Reverting this change gave us stable
  tests again. I also tried to update to the latest resolvconf (the one
  that is reference in #1649931 and also in the followup bug #1649931).
  Using this resolvconf improved the situation dramatically. However we
  still got these errors from time to time with just updating resolvconf
  to the version in xenial-proposed. With the revert of #1649931 we have
  no prolbems so far in our test.

  If there is anything I can try in our image PPA to help with a fix I
  will be happy to do that.

  Thanks,
   Michael

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/systemd/+bug/1659195/+subscriptions



More information about the foundations-bugs mailing list