[Bug 1078926] Re: raring instance failed to find EC2 datasource
Steve Langasek
steve.langasek at canonical.com
Mon Dec 3 23:58:05 UTC 2012
** Description changed:
+ [Impact]
+ The previous SRU, while fixing the problem it was intended to fix, partially reintroduced the problem from before 2.41 where some filesystem events would end up blocking on one another when they shouldn't. This is particularly noticeable for filesystems that have been mounted by the initramfs and there are jobs started on the 'mounted' event for one of these. In the particular case of cloud-init, the jobs that start on mounted MOUNTPOINT=/ block waiting for the network to come up, which needs the 'virtual-filesystems' event which isn't happening because the 'mounted MOUNTPOINT=/run' event hasn't finished processing - it's behind the 'mounted /' in the event queue. This intermittently causes cloud boot times in excess of 5 minutes.
+
+ [Test case]
+ 1. Boot the current quantal daily cloud images.
+ 2. Confirm that at least sometimes, the images take 5 minutes to boot.
+ 3. Upgrade mountall to the quantal-proposed version.
+ 4. Confirm that the images now boot without hitting the "wait for network" timeout.
+
+ [Regression potential]
+ Minimal, as this is correcting a regression from the previous SRU.
+
This seems sporadic failure at best. I had seen it on openstack fail similarly.
Today I launched 7 instances of us-east-1 t1.micro from each of :
ami-be70f5d7 ebs/ubuntu-raring-daily-amd64-server-20121109
ami-de27a2b7 ebs/ubuntu-raring-daily-amd64-server-20121110
ami-f6c94d9f ebs/ubuntu-raring-daily-amd64-server-20121111
ami-cc8307a5 ebs/ubuntu-raring-daily-amd64-server-20121112
ami-5c43c735 ebs/ubuntu-raring-daily-amd64-server-20121113
and then another one of 20121109 and 20121113.
One of the 2 20121109 failed to boot, and I will attach its console log
and cloud-init.log. The log was obtained from stopping the instance,
attaching to another system, and getting it. Then I restarted the
instance, and it booted fine.
Things that were of interest:
* from cloud-init's perspective, the thing that failed was finding the ec2 metadata service.
* In the failure case, there were no messages to the console log after initramfs, while
cloud-init's log shows it WARNing, which should go to console.
Related bugs:
* mountall: bug 1059471 2.41 fails to mount root partition
- * plymouth: bug 1086072 some output to /dev/console does not reach /dev/console
+ * plymouth: bug 1086072 some output to /dev/console does not reach /dev/console
ProblemType: Bug
DistroRelease: Ubuntu 13.04
Package: cloud-init 0.7.0-0ubuntu2
ProcVersionSignature: User Name 3.5.0-17.28-generic 3.5.5
Uname: Linux 3.5.0-17-generic x86_64
ApportVersion: 2.6.2-0ubuntu3
Architecture: amd64
Date: Wed Nov 14 21:30:05 2012
Ec2AMI: ami-be70f5d7
Ec2AMIManifest: (unknown)
Ec2AvailabilityZone: us-east-1b
Ec2InstanceType: t1.micro
Ec2Kernel: aki-825ea7eb
Ec2Ramdisk: unavailable
MarkForUpload: True
PackageArchitecture: all
ProcEnviron:
TERM=xterm
PATH=(custom, no user)
LANG=en_US.UTF-8
SHELL=/bin/bash
SourcePackage: cloud-init
UpgradeStatus: No upgrade log present (probably fresh install)
--
You received this bug notification because you are a member of Ubuntu
Foundations Bugs, which is subscribed to mountall in Ubuntu.
https://bugs.launchpad.net/bugs/1078926
Title:
raring instance failed to find EC2 datasource
Status in “mountall” package in Ubuntu:
In Progress
Status in “mountall” source package in Quantal:
In Progress
Status in “mountall” source package in Raring:
In Progress
Bug description:
[Impact]
The previous SRU, while fixing the problem it was intended to fix, partially reintroduced the problem from before 2.41 where some filesystem events would end up blocking on one another when they shouldn't. This is particularly noticeable for filesystems that have been mounted by the initramfs and there are jobs started on the 'mounted' event for one of these. In the particular case of cloud-init, the jobs that start on mounted MOUNTPOINT=/ block waiting for the network to come up, which needs the 'virtual-filesystems' event which isn't happening because the 'mounted MOUNTPOINT=/run' event hasn't finished processing - it's behind the 'mounted /' in the event queue. This intermittently causes cloud boot times in excess of 5 minutes.
[Test case]
1. Boot the current quantal daily cloud images.
2. Confirm that at least sometimes, the images take 5 minutes to boot.
3. Upgrade mountall to the quantal-proposed version.
4. Confirm that the images now boot without hitting the "wait for network" timeout.
[Regression potential]
Minimal, as this is correcting a regression from the previous SRU.
This seems sporadic failure at best. I had seen it on openstack fail similarly.
Today I launched 7 instances of us-east-1 t1.micro from each of :
ami-be70f5d7 ebs/ubuntu-raring-daily-amd64-server-20121109
ami-de27a2b7 ebs/ubuntu-raring-daily-amd64-server-20121110
ami-f6c94d9f ebs/ubuntu-raring-daily-amd64-server-20121111
ami-cc8307a5 ebs/ubuntu-raring-daily-amd64-server-20121112
ami-5c43c735 ebs/ubuntu-raring-daily-amd64-server-20121113
and then another one of 20121109 and 20121113.
One of the 2 20121109 failed to boot, and I will attach its console
log and cloud-init.log. The log was obtained from stopping the
instance, attaching to another system, and getting it. Then I
restarted the instance, and it booted fine.
Things that were of interest:
* from cloud-init's perspective, the thing that failed was finding the ec2 metadata service.
* In the failure case, there were no messages to the console log after initramfs, while
cloud-init's log shows it WARNing, which should go to console.
Related bugs:
* mountall: bug 1059471 2.41 fails to mount root partition
* plymouth: bug 1086072 some output to /dev/console does not reach /dev/console
ProblemType: Bug
DistroRelease: Ubuntu 13.04
Package: cloud-init 0.7.0-0ubuntu2
ProcVersionSignature: User Name 3.5.0-17.28-generic 3.5.5
Uname: Linux 3.5.0-17-generic x86_64
ApportVersion: 2.6.2-0ubuntu3
Architecture: amd64
Date: Wed Nov 14 21:30:05 2012
Ec2AMI: ami-be70f5d7
Ec2AMIManifest: (unknown)
Ec2AvailabilityZone: us-east-1b
Ec2InstanceType: t1.micro
Ec2Kernel: aki-825ea7eb
Ec2Ramdisk: unavailable
MarkForUpload: True
PackageArchitecture: all
ProcEnviron:
TERM=xterm
PATH=(custom, no user)
LANG=en_US.UTF-8
SHELL=/bin/bash
SourcePackage: cloud-init
UpgradeStatus: No upgrade log present (probably fresh install)
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/mountall/+bug/1078926/+subscriptions
More information about the foundations-bugs
mailing list