Closed Bug 790612 (tegra-338) Opened 12 years ago Closed 10 years ago

tegra-338 problem tracking

Categories

(Infrastructure & Operations Graveyard :: CIDuty, task)

x86_64
Linux
task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: jmaher, Unassigned)

References

()

Details

(Whiteboard: [buildduty][buildslave][capacity][mobile])

No description provided.
Alias: tegra-338
Component: Release Engineering: Automation (General) → Release Engineering: Machine Management
QA Contact: catlee → armenzg
Summary: tegra-338 failed twice in a row with purple/red on m-c- please pull and investigate → tegra-338 problem tracking
Whiteboard: [buildduty]
Assignee: nobody → kmoir
That tegra should become idle after the current build on it finishes.
Reso/WFM for now. As I discussed with kim in IRC the way she set this to go idle unfortunately doesn't persist. Our automation notices buildbot is down and restarts it. She is correcting our documentation. In the meantime the recent list for this tegra appears happy.
Status: NEW → RESOLVED
Closed: 12 years ago
Resolution: --- → WORKSFORME
Whiteboard: [buildduty] → [buildduty][buildslave][capacity][mobile]
SDCard is not mounted it seems, please reimage + swap card
Status: RESOLVED → REOPENED
Depends on: 817995
Resolution: WORKSFORME → ---
Depends on: 822038
re-imaged + swapped card
pdu reboot start_cp
Back in production.
Status: REOPENED → RESOLVED
Closed: 12 years ago12 years ago
Resolution: --- → FIXED
No jobs taken on this device for > a week (< 3 weeks)
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
(mass change: filter on tegraCallek02reboot2013) I just rebooted this device, hoping that many of the ones I'm doing tonight come back automatically. I'll check back in tomorrow to see if it did, if it does not I'll triage next step manually on a per-device basis. --- Command I used (with a manual patch to the fabric script to allow this command) (fabric)[jwood@dev-master01 fabric]$ python manage_foopies.py -j15 -f devices.json `for i in 021 032 036 039 046 048 061 064 066 067 071 074 079 081 082 083 084 088 093 104 106 108 115 116 118 129 152 154 164 168 169 174 179 182 184 187 189 200 207 217 223 228 234 248 255 264 270 277 285 290 294 295 297 298 300 302 304 305 306 307 308 309 310 311 312 314 315 316 319 320 321 322 323 324 325 326 328 329 330 331 332 333 335 336 337 338 339 340 341 342 343 345 346 347 348 349 350 354 355 356 358 359 360 361 362 363 364 365 367 368 369; do echo '-D' tegra-$i; done` reboot_tegra The command does the reboot, one-at-a-time from the foopy the device is connected from. with one ssh connection per foopy
Depends on: 838687
did a start/stop cp cycle
Status: REOPENED → RESOLVED
Closed: 12 years ago12 years ago
Resolution: --- → FIXED
Product: mozilla.org → Release Engineering
agent check failing, pdu reboot didn't help
Status: RESOLVED → REOPENED
Depends on: 937173
Resolution: FIXED → ---
flashed and reimaged
Assignee: kmoir → nobody
Back in production
Status: REOPENED → RESOLVED
Closed: 12 years ago11 years ago
Resolution: --- → FIXED
Attempting SSH reboot...Failed. Attempting PDU reboot...Failed. Filed IT bug for reboot (bug 988525)
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
formatted SD card, reimaged and flashed. [vle@admin1a.private.scl3 ~]$ telnet tegra-338.tegra.releng.scl3.mozilla.com 20701 Trying 10.26.85.255... Connected to tegra-338.tegra.releng.scl3.mozilla.com. Escape character is '^]'. $>^]q telnet> q
and running green in production :) Thanks!
Status: REOPENED → RESOLVED
Closed: 11 years ago11 years ago
Resolution: --- → FIXED
Attempting SSH reboot...Failed. Attempting PDU reboot...Failed. Filed IT bug for reboot (bug 1008662)
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
sd card replaced, tegra flashed and reimaged. [vle@admin1a.private.scl3 ~]$ telnet tegra-338.tegra.releng.scl3.mozilla.com 20701 Trying 10.26.85.255... Connected to tegra-338.tegra.releng.scl3.mozilla.com. Escape character is '^]'. $>^]q telnet> q
Attempting SSH reboot...Failed. Attempting PDU reboot...Failed. Filed IT bug for reboot (bug 1010866)
formatted SD card, reimaged and flashed. [vle@admin1a.private.scl3 ~]$ telnet tegra-338.tegra.releng.scl3.mozilla.com 20701 Trying 10.26.85.255... Connected to tegra-338.tegra.releng.scl3.mozilla.com. Escape character is '^]'. $>^]q telnet> q
Status: REOPENED → RESOLVED
Closed: 11 years ago10 years ago
QA Contact: armenzg → bugspam.Callek
Resolution: --- → FIXED
Attempting SSH reboot...Failed. Attempting PDU reboot...Failed. Filed IT bug for reboot (bug 1016302)
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
SD card replaced, reflashed, and reimaged. telnet tegra-338.tegra.releng.scl3.mozilla.com 20701 Trying 10.26.85.255... Connected to tegra-338.tegra.releng.scl3.mozilla.com. Escape character is '^]'. $>^] telnet> q
Status: REOPENED → RESOLVED
Closed: 10 years ago10 years ago
Resolution: --- → FIXED
Attempting SSH reboot...Failed. Attempting PDU reboot...Failed. Filed IT bug for reboot (bug 1034825)
Status: RESOLVED → REOPENED
Resolution: FIXED → ---
formatted SD card, flashed and reimaged tegra. vle@vle-10516 ~ $ telnet tegra-338.tegra.releng.scl3.mozilla.com 20701 Trying 10.26.85.255... Connected to tegra-338.tegra.releng.scl3.mozilla.com. Escape character is '^]'. $>^] telnet> q
Status: REOPENED → RESOLVED
Closed: 10 years ago10 years ago
Resolution: --- → FIXED
Product: Release Engineering → Infrastructure & Operations
Product: Infrastructure & Operations → Infrastructure & Operations Graveyard
You need to log in before you can comment on or make changes to this bug.