Closed Bug 794549 Opened 12 years ago Closed 10 years ago

Investigate reports_duplicates data expiration policy, and whether middleware uses this table for anything

Categories

(Socorro :: Database, task)

x86
macOS
task
Not set
normal

Tracking

(Not tracked)

RESOLVED FIXED

People

(Reporter: selenamarie, Assigned: selenamarie)

References

Details

See subject :)
We have a big table that keeps track of reports we think are duplicates. /report service uses reports_duplicates /crash service uses reports_duplicates Matview daily_url.py also uses reports_duplicates. Table is 2.4 GB, estimated tuple count is 21 million, and the oldest record is from 2010: 2010-12-31 23:00:01.890194+00 Maybe we should only hold data for 6 months in this table? :laura?
Assignee: nobody → sdeckelmann
+1 to 6 month retention
Flags: needinfo?(laura)
+1 for 6 month retention from me.
Flags: needinfo?(laura)
Flags: needinfo?(laura)
Commits pushed to master at https://github.com/mozilla/socorro https://github.com/mozilla/socorro/commit/95680ca0ecca2ac5b4fc84d1c96de2cfa82e9b98 Fixes bug 794549 Add partitioning to reports_duplicates table https://github.com/mozilla/socorro/commit/bd1be60b2631c0ab8d768e2d102aef47ea39fb7f Merge pull request #2579 from selenamarie/bug794549-retention-enforcement-duplicates Fixes bug 794549 Add partitioning to reports_duplicates table
Status: NEW → RESOLVED
Closed: 10 years ago
Resolution: --- → FIXED
Depends on: 1124300
You need to log in before you can comment on or make changes to this bug.