Closed
Bug 1198317
Opened 9 years ago
Closed 9 years ago
reduce the number of available b-2008-ix instances in TRY in order to force y-2008-spot instantiation
Categories
(Infrastructure & Operations :: RelOps: Puppet, task)
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: grenade, Assigned: grenade)
References
Details
(Whiteboard: [windows][aws])
Attachments
(1 file)
(deleted),
image/png
|
Details |
disabled instances:
b-2008-ix-0036
b-2008-ix-0039
b-2008-ix-0019
b-2008-ix-0038
b-2008-ix-0025
b-2008-ix-0054
b-2008-ix-0022
b-2008-ix-0058
b-2008-ix-0026
b-2008-ix-0059
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0036
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0039
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0019
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0038
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0022
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0025
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0026
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0059
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0054
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0058
Assignee | ||
Comment 1•9 years ago
|
||
disabled instances extended to include:
b-2008-ix-0030
b-2008-ix-0174
b-2008-ix-0057
b-2008-ix-0023
b-2008-ix-0047
b-2008-ix-0046
b-2008-ix-0041
b-2008-ix-0044
b-2008-ix-0061
b-2008-ix-0055
b-2008-ix-0043
b-2008-ix-0049
b-2008-ix-0035
b-2008-ix-0029
b-2008-ix-0031
b-2008-ix-0062
b-2008-ix-0045
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0174
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0061
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0044
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0030
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0057
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0041
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0043
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0023
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0049
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0031
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0062
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0045
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0029
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0035
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0046
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0047
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0055
Assignee | ||
Comment 2•9 years ago
|
||
all machines returned to pool.
will disable more tomorrow.
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0040
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0024
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0020
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0184
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0060
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0032
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0183
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0064
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0027
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0033
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0021
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0051
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0037
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0028
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0048
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0042
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0063
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0181
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0182
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0050
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0034
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0056
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0052
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0018
Assignee | ||
Updated•9 years ago
|
Blocks: b-2008-ix-0173
Assignee | ||
Comment 3•9 years ago
|
||
progress:
- reduced ix capacity to a single instance (b-2008-ix-0043)
- pushed win32, win64 m-c build to try (https://treeherder.mozilla.org/#/jobs?repo=try&revision=272cab1322fc)
- observed messages in watch pending log indicating our max bid price (0.4) would not be successful
- updated max bid price for y-2008 to 0.5 (https://github.com/mozilla/build-cloud-tools/pull/109)
- observed successful spot requests in ec2 console (3 for use1, 3 for usw2, as expected/configured in slavealloc)
- observed spot instances starting, successfully running userdata, naming themselves and mailing logs
- now awaiting build output at https://ftp-ssl.mozilla.org/pub/mozilla.org/firefox/try-builds/rthijssen@mozilla.com-272cab1322fc
Assignee | ||
Comment 4•9 years ago
|
||
us-east-1:
https://secure.pub.build.mozilla.org/builddata/reports/slave_health/slave.html?class=try&name=y-2008-spot-001
https://secure.pub.build.mozilla.org/builddata/reports/slave_health/slave.html?class=try&name=y-2008-spot-002
https://secure.pub.build.mozilla.org/builddata/reports/slave_health/slave.html?class=try&name=y-2008-spot-003
us-west-2:
https://secure.pub.build.mozilla.org/builddata/reports/slave_health/slave.html?class=try&name=y-2008-spot-101
https://secure.pub.build.mozilla.org/builddata/reports/slave_health/slave.html?class=try&name=y-2008-spot-102
https://secure.pub.build.mozilla.org/builddata/reports/slave_health/slave.html?class=try&name=y-2008-spot-103
Assignee | ||
Comment 5•9 years ago
|
||
the us-east-1 instances appear to have hung mid build. rdp'ing to the instances (001 - 003) as cltbld shows this running but apparently going nowhere cmd prompt.
Attachment #8653370 -
Flags: feedback?(mcornmesser)
Assignee | ||
Comment 6•9 years ago
|
||
the us-west-2 instances have all terminated. I cannot find any evidence that they did any work before terminating (slave_health/treeherder). The PaperTrail logs end like this:
Aug 27 02:09:48 y-2008-spot-101.try.releng.usw2.mozilla.com USER32: The process c:\windows\SysWOW64\shutdown.exe (Y-2008-SPOT-101) has initiated the shutdown of computer Y-2008-SPOT-101 on behalf of user Y-2008-SPOT-101\cltbld for the following reason: No title for this reason could be found Reason Code: 0x800000ff Shutdown Type: shutdown Comment: #015
Assignee | ||
Comment 7•9 years ago
|
||
I think we've demonstrated that the spinning up and terminating processes work. We obviously have work to do to get mozilla-build's undies untwisted, but that's another bug...
Status: NEW → RESOLVED
Closed: 9 years ago
Resolution: --- → FIXED
Comment 8•9 years ago
|
||
There are alerts in #buildduty that indicate there's a buildbot misconfiguration/missing configuration:
[sns alert] Thu 06:08:03 PDT buildbot-master78.bb.releng.usw2.mozilla.com watch_twistd_log.py: Count: 675 | First instance: 2015-08-27 05:28:27-0700 | Most recent instance: 2015-08-27 06:00:02-0700 | Twistd exception: twisted.cred.error.UnauthorizedLogin - unknown 10.132.67.67
[sns alert] Thu 06:08:03 PDT buildbot-master78.bb.releng.usw2.mozilla.com watch_twistd_log.py: Count: 681 | First instance: 2015-08-27 05:28:27-0700 | Most recent instance: 2015-08-27 06:00:01-0700 | Twistd exception: twisted.cred.error.UnauthorizedLogin - unknown 10.132.67.101
I've verified that those are windows spot instances
10.132.67.67 (y-2008-spot-103) and 10.132.67.101 (y-2008-spot-102)
Comment 9•9 years ago
|
||
All of the alerts were for use1 IPs, I didn't see any for usw2.
Assignee | ||
Updated•9 years ago
|
Attachment #8653370 -
Flags: feedback?(mcornmesser)
You need to log in
before you can comment on or make changes to this bug.
Description
•