Create new worker pools for GCP POSIX builders
Categories
(Release Engineering :: Firefox-CI Administration, defect)
Tracking
(Not tracked)
People
(Reporter: coop, Unassigned)
References
Details
Attachments
(3 files)
It looks like the existing worker pools for GCP builders got deleted as part of the redeployment over the weekend. No tier 3 builds have run since:
We were hoping to migrate build load from AWS to GCP this week in bug 1547111, but we need to validate that the tier 3 builds still work in the new deployment first.
We'll want to create new worker pools in GCP, probably with names close enough to the existing tier 1 worker pools that migration is easy.
I suggest maybe gecko-{level}/b-linux-gce
?
Reporter | ||
Comment 1•5 years ago
|
||
Here's the last mozilla-history entry from Nov 8 (last Friday) showing the worker pool for gce/gecko-1-b-linux:
We'll need pools for all levels, of course.
Comment 2•5 years ago
|
||
The workers still exist in ci-config, but all disabled. There are a few things we need to for this:
- Clean up all the test workers, other than the
gecko-<level>/...-gce
workers - Add some logic like
get_aws_provider_config
](https://hg.mozilla.org/ci/ci-admin/file/tip/ciadmin/generate/worker_pools.py#l56). There appears to be a bunch of duplication of things between each region. In particular, we should add support pulling images names from worker-image.yml, rather than having them inline. - Adjust the remaining workers to use the appropriate provider-ids (the legacy gcp providerIds are different than the production ones), and remove the hack to disable non-aws providerIDs (this exists because of the aforementioned difference in providerIds.
Comment 3•5 years ago
|
||
For my own notes: https://bugzilla.mozilla.org/show_bug.cgi?id=1583974#c9
Updated•5 years ago
|
Updated•5 years ago
|
Comment 4•5 years ago
|
||
Comment 5•5 years ago
|
||
Comment 6•5 years ago
|
||
I've added patches that show how to create images, but the old images that were generated are accessible to the production gcp projects, so I am unable to test this.
Updated•5 years ago
|
Comment 7•5 years ago
|
||
Reporter | ||
Comment 9•5 years ago
|
||
Pools are here:
https://firefox-ci-tc.services.mozilla.com/worker-manager/gecko-1%2Fb-linux-gcp
https://firefox-ci-tc.services.mozilla.com/worker-manager/gecko-3%2Fb-linux-gcp
Try push is here:
Perf looks good, now that the caches are warm.
Comment 10•5 years ago
|
||
Possibly related to the changes here we got a flurry of emails like this:
Worker Manager has encountered an error while trying to provision the worker pool gecko-3/b-linux-gcp:
Quota 'CPUS' exceeded. Limit: 2400.0 in region us-central1.
ErrorId: fQuQpojlQICflD-MzMXsNw
It includes the extra information:
code: QUOTA_EXCEEDED
Comment 11•5 years ago
|
||
bugherder |
Reporter | ||
Comment 12•5 years ago
|
||
(In reply to Nick Thomas [:nthomas] (UTC+13) from comment #10)
Quota 'CPUS' exceeded. Limit: 2400.0 in region us-central1.
Thanks, Nick. I'll talk to GCP and see whether we can increase our quota.
Reporter | ||
Comment 13•5 years ago
|
||
(In reply to Chris Cooper [:coop] pronoun: he from comment #12)
(In reply to Nick Thomas [:nthomas] (UTC+13) from comment #10)
Quota 'CPUS' exceeded. Limit: 2400.0 in region us-central1.
Thanks, Nick. I'll talk to GCP and see whether we can increase our quota.
Filed bug 1598295 for this.
Description
•