Closed
Bug 728535
(t-snow-r4-0007)
Opened 13 years ago
Closed 10 years ago
t-snow-r4-0007 problem tracking
Categories
(Infrastructure & Operations Graveyard :: CIDuty, task, P3)
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: philor, Unassigned)
References
()
Details
(Whiteboard: [badslave?][buildduty][capacity])
Sadly, I didn't say how it was broken in bug 716326, so I don't know whether that was also like https://tbpl.mozilla.org/php/getParsedLog.php?id=9433872&tree=Mozilla-Inbound where all of rm -rf tools and then rm -rf build says "rm: tools/.hg/00changelog.i: Invalid argument" about every file, and then it craps out trying to download the build, with "Cannot write to `firefox-13.0a1.en-US.mac.dmg' (Invalid argument)." but, I sort of think it was, and it probably needs its disk looked at, rather than just another reimage. Maybe.
Anyway, it's once again chewing up jobs like crazy because it only takes a couple of seconds to fail to rm and then fail to save a downloaded build.
Comment 1•13 years ago
|
||
Disabled in slavealloc and on the buildbot master. It's refusing ssh access so will needs some hands on help.
Updated•13 years ago
|
Priority: -- → P3
Comment 2•13 years ago
|
||
Putting it back in the pool. Will monitor.
Assignee: nobody → coop
Status: NEW → ASSIGNED
Priority: P3 → P2
Comment 4•13 years ago
|
||
I disabled it per Philor's request in IRC and didn't realize it already had a bug (teach me to not check slavealloc *before* filing bug...)
Comment 5•13 years ago
|
||
decomission?
Comment 6•13 years ago
|
||
(In reply to Armen Zambrano G. [:armenzg] - Release Engineer from comment #5)
> decomission?
rev4 machines are (essentially) brand-new, so I certainly hope not. Please file a server ops bug to get it fixed.
Comment 7•13 years ago
|
||
Handing back to buildduty now that hands-on has been requested.
Alias: talos-r4-snow-007
Assignee: coop → nobody
Severity: major → normal
Status: ASSIGNED → NEW
Priority: P2 → P3
Summary: talos-r4-snow-007 is broken → talos-r4-snow-007
Whiteboard: [badslave?][buildduty] → [badslave?][buildduty][capacity]
Updated•13 years ago
|
Assignee: nobody → bhearsum
Updated•13 years ago
|
Summary: talos-r4-snow-007 → talos-r4-snow-007 problem tracking
Updated•13 years ago
|
Assignee: bhearsum → nobody
Component: Release Engineering → Release Engineering: Machine Management
QA Contact: release → armenzg
Updated•13 years ago
|
Status: NEW → RESOLVED
Closed: 13 years ago
Resolution: --- → FIXED
Reporter | ||
Comment 8•13 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=11156650&tree=Mozilla-Inbound
https://tbpl.mozilla.org/php/getParsedLog.php?id=11156669&tree=Mozilla-Inbound
rm: tools/.hg: Device not configured
rm: tools/.hgignore: Invalid argument
rm: tools/.hgtags: Invalid argument
rm: tools/.pylintrc: Invalid argument
rm: tools/breakpad: Device not configured
rm: tools/buildbot-helpers: Device not configured
rm: tools/buildfarm: Device not configured
rm: tools/cdmaker: Device not configured
rm: tools/clobberer: Device not configured
rm: tools/graphserver_webapp: Device not configured
rm: tools/lib: Device not configured
rm: tools/MANIFEST.in: Invalid argument
rm: tools/misc: Device not configured
rm: tools/release: Device not configured
rm: tools/scripts: Device not configured
rm: tools/setup.py: Invalid argument
rm: tools/stage: Device not configured
rm: tools/sut_tools: Device not configured
rm: tools/trychooser: Device not configured
rm: tools: Directory not empty
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Comment 9•13 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=11156809&tree=Birch
https://tbpl.mozilla.org/php/getParsedLog.php?id=11156805&tree=Birch
https://tbpl.mozilla.org/php/getParsedLog.php?id=11156825&tree=Birch
rm: tools/.hg: Device not configured
rm: tools/.hgignore: Invalid argument
rm: tools/.hgtags: Invalid argument
rm: tools/.pylintrc: Invalid argument
rm: tools/breakpad: Device not configured
rm: tools/buildbot-helpers: Device not configured
rm: tools/buildfarm: Device not configured
rm: tools/cdmaker: Device not configured
rm: tools/clobberer: Device not configured
rm: tools/graphserver_webapp: Device not configured
rm: tools/lib: Device not configured
rm: tools/MANIFEST.in: Invalid argument
rm: tools/misc: Device not configured
rm: tools/release: Device not configured
rm: tools/scripts: Device not configured
rm: tools/setup.py: Invalid argument
rm: tools/stage: Device not configured
rm: tools/sut_tools: Device not configured
rm: tools/trychooser: Device not configured
rm: tools: Directory not empty
Comment 10•13 years ago
|
||
https://tbpl.mozilla.org/php/getParsedLog.php?id=11156872&tree=Birch
https://tbpl.mozilla.org/php/getParsedLog.php?id=11156764&tree=Birch
Connecting to build.mozilla.org|10.2.74.128|:80... connected.
HTTP request sent, awaiting response... 200 OK
Length: 1,169 (1.1K) [application/x-sh]
installdmg.sh: Invalid argument
Cannot write to `installdmg.sh' (Invalid argument).
program finished with exit code 1
Comment 11•13 years ago
|
||
Comment 12•13 years ago
|
||
Comment 13•13 years ago
|
||
Comment 14•13 years ago
|
||
Comment 15•13 years ago
|
||
Comment 16•13 years ago
|
||
Comment 17•13 years ago
|
||
disabled in slavealloc and ssh'ing in to make sure it's offline
Updated•13 years ago
|
Comment 18•13 years ago
|
||
something is wrong with the harddrive on this host - the above activity points to a bad or full drive and df -h shows:
talos-r4-snow-007:~ cltbld$ df -h
Filesystem Size Used Avail Capacity Mounted on
/dev/disk0s2 298Gi 8.3Gi 289Gi 3% /
devfs 106Ki 106Ki 0Bi 100% /dev
map -hosts 0Bi 0Bi 0Bi 100% /net
map auto_home 0Bi 0Bi 0Bi 100% /home
please pull this unit and check it's harddrive
Comment 19•13 years ago
|
||
this slave has returned from repairs with a new hdd. Re-imaged and back in SCL1.
Comment 20•13 years ago
|
||
Back in production
Status: REOPENED → RESOLVED
Closed: 13 years ago → 13 years ago
Resolution: --- → FIXED
Comment 21•12 years ago
|
||
Needs a reboot, and to be added back to nagios.
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Updated•12 years ago
|
Status: REOPENED → RESOLVED
Closed: 13 years ago → 12 years ago
Resolution: --- → FIXED
Comment 22•12 years ago
|
||
Rebooted via PDU after nagios reported it down.
Assignee | ||
Updated•11 years ago
|
Product: mozilla.org → Release Engineering
Updated•11 years ago
|
Alias: talos-r4-snow-007 → t-snow-r4-0007
Updated•11 years ago
|
Summary: talos-r4-snow-007 problem tracking → t-snow-r4-0007 problem tracking
Comment 23•10 years ago
|
||
Attempting SSH reboot...Failed.
Attempting PDU reboot...Failed.
Filed IT bug for reboot (bug 1037831)
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
Reporter | ||
Updated•10 years ago
|
Status: REOPENED → RESOLVED
Closed: 12 years ago → 10 years ago
Resolution: --- → FIXED
Updated•7 years ago
|
Product: Release Engineering → Infrastructure & Operations
Updated•5 years ago
|
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in
before you can comment on or make changes to this bug.
Description
•