Open Bug 1445946 Opened 7 years ago Updated 1 year ago

evaluate our leaf-tasks in the release graph to add more checks to our configurations

Tracking

(Not tracked)

Status:

NEW

People

(Reporter: mtabara, Unassigned)

References

(Depends on 1 open bug)

Details

Mihai Tabara [:mtabara]⌚️GMT

Reporter

Description

•

7 years ago

We've recently had two incidents with release automation tasks that are leafs in the graph, as in they have no dependencies checking results of their action. * e.g. in bug 1444391 we accidentally updated Google Play Store to app name to contain "Beta" * e.g. in bug 1445672 we accidentally updated release aliases from a beta release. To improve things in the future, we should guard these sensitive changes with more checks. Catching configuration errors is particularly difficult because tasks go green, things look apparently as expected and other non-releng automation can catch them but runs with latency. For this reason, it might be good to assess the leafs tasks in the release graphs, for all products. This includes but is not limited to: * bouncer aliases * mark as shipped * version bump * pushapk (for fennec obviously) Stuff like balrog-scheduling is a leaf task but has the signoff that depends on human factor so that's out of the scope for now I suppose. To start off with an example, for bouncer aliases, we could: a) more configuration checks to make sure we don't mix up the wrong aliases b) checks before we even do the API calls c) a separate task that is to handle just the verification process, that should depend on the original bouncer aliases task

Mihai Tabara [:mtabara]⌚️GMT

Reporter

Updated

•

7 years ago

See Also: → https://bugzilla.mozilla.org/show_bug.cgi?id=1446028

Aki Sasaki (not active)

Comment 1

•

7 years ago

I think the issues are twofold: - scriptworker scripts should all have checks that run before we perform any changes. This should check things like "don't ship devedition to firefox" or "don't ship beta to release". We can add to this test suite over time as we discover new issues, since it can never cover all things. But ideally all new *scripts that perform sensitive user-facing tasks (most of them) should have some of these checks before going live. And ideally any rules for these checks are configurable via puppet, so we can push changes quicker than having to bump and repackage the scripts. For bouncer aliases specifically, we could have seen the issue on maple by looking at the task definition. And if bouncerscript had had no-crossing-channels checks on maple, we would have seen busted bouncer tasks on maple. - aiui we don't run anything for pushapk in staging, or we ignore the output. We should have actual tests in staging for all tasks if at all possible, and they should give us real values - no more perma-green or perma-ignorable-reds. This is more difficult than, say, balrog, since we don't control the other side. At the very least we should run the above checks on the apk and strings, so we know the upstream tasks and inputs are all sane.

Mihai Tabara [:mtabara]⌚️GMT

Reporter

Comment 2

•

7 years ago

Before I forget, bouncer submission should have similar tests as the ones added in bouncer aliases. [1]https://github.com/mozilla-releng/bouncerscript/pull/16.

Mihai Tabara [:mtabara]⌚️GMT

Reporter

Updated

•

7 years ago

Depends on: 1445684

Mihai Tabara [:mtabara]⌚️GMT

Reporter

Updated

•

7 years ago

See Also: → https://bugzilla.mozilla.org/show_bug.cgi?id=1458188

Simon Fraser [:sfraser] ⌚️GMT

Updated

•

6 years ago

Depends on: 1469803

Mihai Tabara [:mtabara]⌚️GMT

Reporter

Updated

•

6 years ago

Depends on: 1470226

Mihai Tabara [:mtabara]⌚️GMT

Reporter

Updated

•

6 years ago

Depends on: 1470227

Mihai Tabara [:mtabara]⌚️GMT

Reporter

Updated

•

6 years ago

Depends on: 1470228

Mihai Tabara [:mtabara]⌚️GMT

Reporter

Updated

•

6 years ago

Depends on: 1470232

Mihai Tabara [:mtabara]⌚️GMT

Reporter

Updated

•

6 years ago

Depends on: 1470233

Mihai Tabara [:mtabara]⌚️GMT

Reporter

Updated

•

6 years ago

Depends on: 1472667

Johan Lorenzo [:jlorenzo] - On PTO, back on September 11th

Updated

•

1 year ago

Severity: major → N/A

You need to log in before you can comment on or make changes to this bug.

Bugzilla

Quick Search

evaluate our leaf-tasks in the release graph to add more checks to our configurations

Categories

(Release Engineering :: Release Automation: Other, enhancement)

Tracking

(Not tracked)

People

(Reporter: mtabara, Unassigned)

References

(Depends on 1 open bug)

Details

Crash Data

Security

(public)

User Story

Description

Updated

Comment 1

Comment 2

Updated

Updated

Updated

Updated

Updated

Updated

Updated

Updated

Updated

Updated