Closed
Bug 1388407
Opened 7 years ago
Closed 7 years ago
Scheduling missing tests for autoland times out
Categories
(Taskcluster :: Services, enhancement)
Taskcluster
Services
Tracking
(Not tracked)
RESOLVED
FIXED
mozilla57
People
(Reporter: armenzg, Assigned: bstack)
References
Details
Attachments
(1 file)
I pushed to try, canceled the builds, clicked on the Gecko decision task, added a payload of {} and clicked on trigger action.
Unfortunately only the missing tests action task got scheduled.
https://treeherder.mozilla.org/#/jobs?repo=try&revision=5cecffefd20de1c958d5201a39575c38b596dc1c&selectedJob=121686629
> [task 2017-08-08T15:22:04.847044Z] Out of 37 test tasks, 37 already existed and the action created 0
Reporter | ||
Updated•7 years ago
|
Comment 1•7 years ago
|
||
Right -- those tests had already been run. That they had been cancelled is an entirely different issue. Run missing tests runs tests that were not originally run.
Reporter | ||
Comment 2•7 years ago
|
||
OK. I will try again w/o cancelling first.
I'm used to the old behaviour where cancelled or failed builds (e.g. infra failure) would be tried again.
Comment 3•7 years ago
|
||
That might be a good set of features to add -- and it's all in tree now, so anyone can do so :)
Reporter | ||
Comment 4•7 years ago
|
||
I tried it on autoland and it failed like this:
https://treeherder.mozilla.org/#/jobs?repo=autoland&revision=91b30f63e8d804de50178f3f0e85ebaad4766b4b&selectedJob=121694289
> [taskcluster:error] Task timeout after 1800 seconds. Force killing container.
I also tried it on 'try' without canceling builds and it did not schedule anything:
https://treeherder.mozilla.org/#/jobs?repo=try&revision=2850f912e482988a582f0c8c5092337a8099b699&group_state=expanded&selectedJob=121715931
Should we assume this is the new behaviour and ask for feedback?
Reporter | ||
Comment 5•7 years ago
|
||
I will rename this bug to only focus on the timeout issue.
Summary: Scheduling missing tests did not schedule anything → Scheduling missing tests for autoland times out
Comment hidden (mozreview-request) |
Comment hidden (mozreview-request) |
Assignee | ||
Updated•7 years ago
|
Attachment #8896498 -
Flags: review?(dustin)
Assignee | ||
Comment 8•7 years ago
|
||
This patch should use the logic the rest of taskgraph stuff uses to submit tasks concurrently. I believe it will also fix the missing dependencies in add_talos we've been investigating and simplify the code needed for an action task as well! I assume I've missed something about how the generation stuff works since it is pretty complex, but hopefully this is at least a move in the right direction.
Assignee: nobody → bstack
Status: NEW → ASSIGNED
Comment 9•7 years ago
|
||
mozreview-review |
Comment on attachment 8896498 [details]
Bug 1388407 - Fix timeouts in action-task graph submission
https://reviewboard.mozilla.org/r/167758/#review173540
::: taskcluster/taskgraph/actions/util.py:39
(Diff revision 2)
> +def create_tasks(to_run, full_task_graph, label_to_taskid, params, decision_task_id):
> + """Create new tasks. The task definition will have {relative-datestamp':
> + '..'} rendered just like in a decision task. Action callbacks should use
> + this function to create new tasks,
> + allowing easy debugging with `mach taskgraph action-callback --test`.
> + This builds up all required tasks to run in order to run the tasks requested."""
This docstring may need some adjustment. This will also render {task-reference: ..}, and it is suited to creating tasks based on the original decision task, rather than creating "new" tasks. Is it legit to "edit" the full_task_graph before handing it to this function?
::: taskcluster/taskgraph/actions/util.py:47
(Diff revision 2)
> + target_task_graph = TaskGraph(
> + {l: full_task_graph[l] for l in target_graph.nodes},
> + target_graph)
> + optimized_task_graph, label_to_taskid = optimize_task_graph(target_task_graph,
> + params,
> + to_run)
I think you want to pass label_to_taskid as the 4th parameter here; without that, optimization will look in the index and may find newer versions of prerequisite tasks like toolchains,
Attachment #8896498 -
Flags: review?(dustin) → review+
Comment hidden (mozreview-request) |
Assignee | ||
Updated•7 years ago
|
Keywords: checkin-needed
Comment 11•7 years ago
|
||
Pushed by ryanvm@gmail.com:
https://hg.mozilla.org/integration/autoland/rev/5214574f51b6
Fix timeouts in action-task graph submission r=dustin
Keywords: checkin-needed
Comment 12•7 years ago
|
||
bugherder |
Status: ASSIGNED → RESOLVED
Closed: 7 years ago
Resolution: --- → FIXED
Target Milestone: --- → mozilla57
Reporter | ||
Comment 13•7 years ago
|
||
I've tested that this works and very fast (within a minute of start).
Thank you!
I can see 1600+ tasks that got scheduled in this push:
https://treeherder.mozilla.org/#/jobs?repo=autoland&revision=df4da5b8599a07277245d6afe9a932587c0a3585&filter-resultStatus=retry&filter-resultStatus=usercancel&filter-resultStatus=running&filter-resultStatus=pending&filter-resultStatus=runnable&group_state=expanded
Assignee | ||
Comment 14•7 years ago
|
||
Awesome. I'm glad this works :)
Updated•6 years ago
|
Component: Integration → Services
You need to log in
before you can comment on or make changes to this bug.
Description
•