Closed
Bug 673972
Opened 13 years ago
Closed 13 years ago
returned iX hardware repairs
Categories
(Release Engineering :: General, defect)
Tracking
(Not tracked)
RESOLVED
FIXED
People
(Reporter: mlarrain, Unassigned)
References
()
Details
Per bug 666411 I tried to get it booted and found that the HDD is dead.
Needs to go back with iX next time they are onsite.
Reporter | ||
Updated•13 years ago
|
OS: Mac OS X → Windows Server 2003
Comment 1•13 years ago
|
||
zandr, can you contact IX about getting this serviced?
Assignee: server-ops → zandr
Reporter | ||
Comment 2•13 years ago
|
||
We have a few machines that need to go back to iX now. w64-ix-slave02(has had issues before hand with powering off after trying to image), w64-ix-slave41(Possible bad HDD unable to be imaged) and w32-ix-slave06(Bad HDD)
Summary: w32-ix-slave06 DOA → Dead iX machines
Updated•13 years ago
|
Assignee: zandr → server-ops-releng
Component: Server Operations → Server Operations: RelEng
QA Contact: mrz → zandr
Updated•13 years ago
|
Assignee: server-ops-releng → zandr
colo-trip: --- → scl1
Reporter | ||
Comment 3•13 years ago
|
||
List of machines with issues thus far is;
linux64-ix-slave14, w64-ix-slave02, w64-ix-slave41 and w32-ix-slave06
Updated•13 years ago
|
Alias: iX-repairs
Summary: Dead iX machines → iX hardware repairs
Comment 4•13 years ago
|
||
I added a note to all those hosts on slavealloc.
Reporter | ||
Comment 5•13 years ago
|
||
armen I was able to fix w64-ix-slave41 and it can now go back into the pool.
Comment 6•13 years ago
|
||
(In reply to Matthew Larrain[:digipengi] from comment #5)
> armen I was able to fix w64-ix-slave41 and it can now go back into the pool.
Thanks Matt. I will set it up in bug 684019.
Comment 7•13 years ago
|
||
linux64-ix-slave37 - SMART detects bad disk at boot SATA Port0 ST3250318AS
linux64-ix-slave16 - WARNING: These memory DIMMs are NOT supported!!!
Updated•13 years ago
|
Assignee: zandr → mlarrain
Comment 8•13 years ago
|
||
These are part of the pile of hardware on Matt's desk that needs to be repaired. Zandr is introducing him to the iX rep today.
Comment 10•13 years ago
|
||
Matt, here are the details for these machines.
linux64-ix-slave14 A1-16186 4787 Machine Check Exception, CPU 2 (see bug 678907)
linux64-ix-slave16 A1-16188 4789 WARNING: These memory DIMMs are NOT supported!!!
linux64-ix-slave37 A1-16209 4810 SMART detects bad disk at boot SATA Port0 ST3250318AS
w64-ix-slave02 A1-16107 4708 has had issues before and with powering off after trying to image
w32-ix-slave06 A1-16052 4600 Hard drive is not recognizing
If the asset tags line up, can you send this data along to finney and see when they can pick them up?
Reporter | ||
Comment 11•13 years ago
|
||
Thanks you dustin for this I am checking the asset tags/serial #'s for verification and will get this passed along to Matt at iX tonight :)
Updated•13 years ago
|
colo-trip: scl1 → ---
Comment 12•13 years ago
|
||
Two more to add here, from bug 682574 and bug 684374.
Assignee: mlarrain → dustin
Comment 15•13 years ago
|
||
linux64-ix-slave14 A1-16186 4787 Machine Check Exception, CPU 2 (see bug 678907)
linux64-ix-slave16 A1-16188 4789 WARNING: These memory DIMMs are NOT supported!!!
linux64-ix-slave37 A1-16209 4810 SMART detects bad disk at boot SATA Port0 ST3250318AS
w64-ix-slave02 A1-16107 4708 has had issues before & powers off after imaging
w32-ix-slave06 A1-16052 4600 Hard drive is not recognizing
w32-ix-slave35 A1-16098 4699 WARNING: These Memory DIMMs are NOT Supported!!!
w32-ix-slave03 A1-16049 4597 disk disappeared even from BIOS, now BSOD'ing
Reporter | ||
Comment 16•13 years ago
|
||
email sent to Matt at iX.
Updated•13 years ago
|
Comment 17•13 years ago
|
||
w32-ix-slave12.build.mtv1 is not answering to ping, not even the management interface. It needs some investigation and likely also needs to go out for repair.
Reporter | ||
Comment 18•13 years ago
|
||
All 6 of the machines will be picked up tomorrow from MTV1 at 3pm.
Status: NEW → ASSIGNED
Comment 19•13 years ago
|
||
There should be 7 or 8 (if you get to w32-ix-slave12) machines according to comment 15.
Reporter | ||
Comment 20•13 years ago
|
||
All machines have been given to iX for repair. Asset tags are as follows;
4787 4789 4810 4708 4600 4699 4597 and 4606
Updated•13 years ago
|
Assignee: dustin → mlarrain
Reporter | ||
Comment 21•13 years ago
|
||
Copy Pasta from iX systems;
Here are the actions we took:
A1-16186 #4787 - Replaced CPU & system board
A1-16209 #4810 - Replaced HDD
A1-16052 #4600 - Replaced HDD
A1-16058 #4606 - Repaired bad sector on HDD
A1-16098 #4699 - No Trouble Found
A1-16049 #4597 - No Trouble Found
A1-16188 #4789 - No Trouble Found
A1-16107 #4708 - No Trouble Found
Be sure to check the BIOS settings before redeployment as they may have changed for our testing or in the case of A1-16186 the board was replaced and the settings were set to defaults.
Reporter | ||
Comment 22•13 years ago
|
||
I am verifying why we had marked those "No Trouble Found" machines and will message iX with questions.
Reporter | ||
Comment 23•13 years ago
|
||
w32-ix-slave35 A1-16098 4699 WARNING: These Memory DIMMs are NOT Supported!!!
w32-ix-slave03 A1-16049 4597 disk disappeared even from BIOS, now BSOD'ing
linux64-ix-slave16 A1-16188 4789 WARNING: These memory DIMMs are NOT supported!!!
w64-ix-slave02 A1-16107 4708 has had issues before & powers off after imaging
These are the issues those machines had. Will contact iX about it.
Reporter | ||
Updated•13 years ago
|
colo-trip: --- → scl1
Comment 24•13 years ago
|
||
I've seen other hosts occasionally have the "WARNING: These Memory DIMMs are NOT Supported!!!" error, and it goes away after a reboot.
4597 may have just needed a thump in the head or a hard power cycle to cure its flakiness.
I'm surprised about w64-ix-slave02 A1-16107 4708, though. That seemed to be a true problem child.
Comment 25•13 years ago
|
||
Based on location, seamonkey should get w32-ix-slave06 (since it's in mtv) and three other machines. I recommend w64-ix-slave02 (based on the current status of w64), and linux64-ix-slave37 and linux64-ix-slave16 (based on the thought that we can spare more linux64 then w32 machines).
Coop, do you have a preference?
Reporter | ||
Comment 26•13 years ago
|
||
I have contacted iX about the ones that didn't show an error and dustin has told me that the Memory thing does go away sometimes after reboot so I had asked iX before to do rapid reboots to try and flag the issue.(Yes I had sporatic errors as much as everyone else and can't expect them to care all to much if the boxes are showing green) I have also told iX to deliver the machines to MTV1 with my name on it and either Jake or I can get them racked and installed to wherever they need to be deployed upon return.
Reporter | ||
Comment 27•13 years ago
|
||
These machines are back at my desk. Assigned to Jake to take them to SCL1 to rerack them.
Reporter | ||
Updated•13 years ago
|
Assignee: mlarrain → jwatkins
Comment 28•13 years ago
|
||
(In reply to Amy Rich [:arich] [:arr] from comment #25)
> Based on location, seamonkey should get w32-ix-slave06 (since it's in mtv)
> and three other machines. I recommend w64-ix-slave02 (based on the current
> status of w64), and linux64-ix-slave37 and linux64-ix-slave16 (based on the
> thought that we can spare more linux64 then w32 machines).
>
> Coop, do you have a preference?
I don't have a particular preference here, but your choices make sense. Let's go with that.
Comment 29•13 years ago
|
||
This bug will be to re-rack/install:
linux64-ix-slave14 A1-16186 4787
w32-ix-slave35 A1-16098 4699
w32-ix-slave03 A1-16049 4597
Jake, can you rack these up in their previous locations, and make sure they come on and the mgmt interface is pingable? They should then be updated in the repairs spreadsheet and moved to the repaired section.
Once that's done, please update this bug, and I'll do a fresh install on them.
Comment 30•13 years ago
|
||
Ah, I missed that we also sent w32-ix-slave12.build.mtv1 out for repair and got it back. Since that one also lived in mtv, we'll send linux64-ix-slave16 back to scl1 and keep w32-ix-slave12 for seamonkey instead.
Comment 31•13 years ago
|
||
Jake got these four racked and IPMI pingable.
I've updated the spreadsheet.
Assignee: jwatkins → arich
Comment 32•13 years ago
|
||
linux64-ix-slave14 and linux64-ix-slave16 have been rebuilt with the linux64 image
w32-ix-slave03 and w32-ix-slave35 have been rebuilt with the w32 image from 2011-05 (see bug 683228).
They are ready for name changes and customizations.
Alias: iX-repairs
Assignee: arich → nobody
Component: Server Operations: RelEng → Release Engineering
QA Contact: zandr → release
Summary: iX hardware repairs → returned iX hardware repairs
Comment 33•13 years ago
|
||
Let me summarized what happened in this bug:
* To be put back into the pool by releng:
** linux64-ix-slave14
** linux64-ix-slave16
** w32-ix-slave03 (re-imaged from old snapshot) - worked on bug 683228
** w32-ix-slave35 (re-imaged from old snapshot) - worked on bug 683228
* Given to seamonkey
** w32-ix-slave06
** w32-ix-slave12
** w64-ix-slave02
** linux64-ix-slave37
** <strike>linux64-ix-slave16</strike> - see comment 30
From looking at comment 21 it seems that all 8 slaves are summarized in here.
> A1-16186 #4787 - Replaced CPU & system board - linux64-ix-slave14
> A1-16209 #4810 - Replaced HDD - linux64-ix-slave37
> A1-16052 #4600 - Replaced HDD - w32-ix-slave06
> A1-16058 #4606 - Repaired bad sector on HDD -
> A1-16098 #4699 - No Trouble Found -
> A1-16049 #4597 - No Trouble Found -
> A1-16188 #4789 - No Trouble Found - linux64-ix-slave16
> A1-16107 #4708 - No Trouble Found - w64-ix-slave02
NOTE: I took the comment and tried to match hostnames. If anyone wants the remaining hostnames I assume they are in the spreadsheet.
Comment 34•13 years ago
|
||
(In reply to Armen Zambrano G. [:armenzg] - Release Engineer from comment #33)
> * Given to seamonkey
> ** w32-ix-slave06
> ** w32-ix-slave12
> ** w64-ix-slave02
> ** linux64-ix-slave37
Shall we file a separate bug for removing these 4 from DNS/nagios/inventory?
I would like to remove these hosts from releng configs.
Comment 35•13 years ago
|
||
DNS/nagios/inventory work filed as bug 703662
Status: ASSIGNED → RESOLVED
Closed: 13 years ago
Resolution: --- → FIXED
Assignee | ||
Updated•11 years ago
|
Product: mozilla.org → Release Engineering
You need to log in
before you can comment on or make changes to this bug.
Description
•