{"id":828,"date":"2026-04-28T01:07:28","date_gmt":"2026-04-28T01:07:28","guid":{"rendered":"https:\/\/geiselmed.dartmouth.edu\/gsr\/?page_id=828"},"modified":"2026-04-28T01:07:28","modified_gmt":"2026-04-28T01:07:28","slug":"data-retention","status":"publish","type":"page","link":"https:\/\/geiselmed.dartmouth.edu\/gsr\/data-retention\/","title":{"rendered":"Data Retention"},"content":{"rendered":"<h2>How long data is kept<\/h2>\n<p>Each project has a single age \u2014 the date of the oldest file it contains. Retention timers count from that age, so adding or editing a file later in a project does <strong>not<\/strong> reset the clock.<\/p>\n<table style=\"border-collapse: collapse;width: 100%\" border=\"1\">\n<thead>\n<tr style=\"background-color: #00693e;color: #fff\">\n<th style=\"padding: 10px;text-align: left\">File type<\/th>\n<th style=\"padding: 10px;text-align: left\">How long we keep it<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"padding: 10px\"><strong>Raw sequencing &amp; imaging data<\/strong> \u2014 FASTQ, TIFF images (Visium H&amp;E, fluorescence, etc.), Xenium output bundles, methylation arrays (<code>.idat<\/code>)<\/td>\n<td style=\"padding: 10px\"><strong>5 years<\/strong> from the project's start date<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 10px\"><strong>Oxford Nanopore raw reads<\/strong> \u2014 <code>.fast5<\/code> \/ <code>.pod5<\/code><\/td>\n<td style=\"padding: 10px\"><strong>90 days<\/strong> from the project's start date<\/td>\n<\/tr>\n<tr>\n<td style=\"padding: 10px\"><strong>Analysis results<\/strong> \u2014 everything inside a project's <code>analysis\/<\/code> folder (Cell Ranger output, alignment BAMs, variant calls, intermediate files)<\/td>\n<td style=\"padding: 10px\"><strong>30 days<\/strong> from the project's start date<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p><em>Some files are always kept<\/em> \u2014 Snakemake driver files (<code>Snakefile<\/code>, <code>*.yaml<\/code>, <code>*.sh<\/code>, etc.), Cell Ranger per-sample summary HTML pages, and the full contents of Xenium output bundles. These live as long as the project does so pipelines can be re-run.<\/p>\n<h2>When you'll hear from us<\/h2>\n<p>You'll get an email <strong>90 days, 30 days, and 7 days<\/strong> before raw sequencing\/imaging or Nanopore files are removed. You'll get an email <strong>7 days<\/strong> before analysis files (Cell Ranger outputs, etc.) are removed.<\/p>\n<h2>Who gets the email<\/h2>\n<p>Everyone you've granted access to your lab's GSR data will receive the retention email \u2014 typically the PI, postdocs, grad students, and anyone else you've added to your lab's permissions. The same list that reaches you when GSR grants data access is the one we use for these warnings.<\/p>\n<h2>What to do if you want to keep a file<\/h2>\n<p>Several options, easiest first:<\/p>\n<ol>\n<li><strong>Copy it out.<\/strong> Move it to your own storage (your home directory, a lab DartFS allocation, an external drive) before the deadline.<\/li>\n<li><strong>Ask for an extension on specific files.<\/strong> Reply to the retention email and list the paths you want kept. The admin will exclude them from the next removal cycle.<\/li>\n<li><strong>Opt out an entire project.<\/strong> Contact the GSR admin directly.<\/li>\n<\/ol>\n<h2>After a deadline passes<\/h2>\n<p>If you don't act, files first move into a recovery-area folder (<code>.retention_trash\/<\/code>) for <strong>14 days<\/strong>. During those 14 days an admin can put anything back if you realize you missed something. After 14 days the files are removed permanently.<\/p>\n<h2>First-time rollout<\/h2>\n<p>We're just starting to roll this policy out, and it applies retroactively to all data already on DartFS. If any of your existing files are already past their retention window, you'll receive a one-time rollout email listing the projects affected. <strong>You have 30 days from that email to move anything you want to preserve<\/strong>; after that, those files enter the normal removal cycle described above.<\/p>\n<h2>Questions<\/h2>\n<p>Reply to any of these notification emails, or contact Fred Kolling (GSR admin) directly at <a href=\"mailto:fred.w.kolling.iv@dartmouth.edu\">fred.w.kolling.iv@dartmouth.edu<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>How long data is kept Each project has a single age \u2014 the date of the oldest file it contains. Retention timers count from that age, so adding or editing a file later in a project does not reset the clock. File type How long we keep it Raw sequencing [\u2026] <\/p>\n<div class=\"clear\"><\/div>\n<p><a class=\"more_link clearfix\" href=\"https:\/\/geiselmed.dartmouth.edu\/gsr\/data-retention\/\" rel=\"nofollow\">Read More<\/a><\/p>\n","protected":false},"author":77,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-828","page","type-page","status-publish","hentry","author-77"],"jetpack_shortlink":"https:\/\/wp.me\/PbaUZP-dm","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/geiselmed.dartmouth.edu\/gsr\/wp-json\/wp\/v2\/pages\/828","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/geiselmed.dartmouth.edu\/gsr\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/geiselmed.dartmouth.edu\/gsr\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/geiselmed.dartmouth.edu\/gsr\/wp-json\/wp\/v2\/users\/77"}],"replies":[{"embeddable":true,"href":"https:\/\/geiselmed.dartmouth.edu\/gsr\/wp-json\/wp\/v2\/comments?post=828"}],"version-history":[{"count":1,"href":"https:\/\/geiselmed.dartmouth.edu\/gsr\/wp-json\/wp\/v2\/pages\/828\/revisions"}],"predecessor-version":[{"id":829,"href":"https:\/\/geiselmed.dartmouth.edu\/gsr\/wp-json\/wp\/v2\/pages\/828\/revisions\/829"}],"wp:attachment":[{"href":"https:\/\/geiselmed.dartmouth.edu\/gsr\/wp-json\/wp\/v2\/media?parent=828"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}