Closed Bug 1329694 Opened 8 years ago Closed 5 years ago

Add SETA data expiration to the cycle_data task

Categories

(Tree Management Graveyard :: Treeherder: SETA, defect, P3)

defect

Tracking

(Not tracked)

RESOLVED WONTFIX

People

(Reporter: emorley, Unassigned)

References

(Blocks 1 open bug)

Details

Most Treeherder data types (other than pushlog, for which we decided to keep indefinitely) are covered by the cycle_data() task, which removes data older than 4 months: https://github.com/mozilla/treeherder/blob/master/treeherder/model/management/commands/cycle_data.py The initial SETA landing didn't add any handling there, so I don't think any data expiration is taking place. For example if preseed.json is modified, then we only add new rows and don't clean up old ones (either in load_preseed or cycle_data).
Blocks: 1078392
No longer blocks: 1276268
FYI the dataset does not grow very much over time. What would it take to deal with this? Also, sometime this year we might tackle the project of making TH manifest based rather than job and might make the current SETA implementation to need a refactoring. Would you still want to invest in this knowing that? Happy it gets taken care of!
Component: Treeherder: Data Ingestion → Treeherder: SETA
emorley, we have discussed how SETA works and we can see that preseed could leave some lingering data but that's about it. We are OK with just deleting anything older than 4 months.
Priority: P2 → P3

We're moving away from SETA wontfixing.

Status: NEW → RESOLVED
Closed: 5 years ago
Resolution: --- → WONTFIX
Product: Tree Management → Tree Management Graveyard
You need to log in before you can comment on or make changes to this bug.