{"id":2563,"date":"2026-02-12T14:32:54","date_gmt":"2026-02-13T00:32:54","guid":{"rendered":"https:\/\/www.hawaii.edu\/infosec\/?page_id=2563"},"modified":"2026-02-13T13:45:01","modified_gmt":"2026-02-13T23:45:01","slug":"research-security","status":"publish","type":"page","link":"https:\/\/www.hawaii.edu\/infosec\/research-security\/","title":{"rendered":"Protecting Research Data"},"content":{"rendered":"
The primary strategy for mitigating sensitive data risk is data minimization: evaluating what is strictly necessary and reducing the volume, identifiability, and retention period of project data.<\/p>\n
Before starting your research, audit every intended data field. If a variable is not essential to your analysis, do not collect it. Avoid gathering highly sensitive Personally Identifiable Information (PII) such as Social Security Numbers, financial data, or Protected Health Information (PHI) unless there is a compelling rationale<\/strong> for doing so.<\/p>\n When handling PHI, follow the HIPAA Safe Harbor method<\/strong>. This requires removing 18 specific identifiers<\/a> to prevent the re-identification of participants.<\/p>\n Note: Unauthorized exposure of this data constitutes a breach, carrying severe financial and reputational consequences for the University.<\/p>\n<\/li>\n Regularly review datasets for unintentionally captured information that could compromise anonymity. This includes:<\/p>\n Data risk is a function of time: the longer it is held, the higher the risk.<\/p>\n Maintain an accurate, up-to-date inventory of where all research data is stored. Establish a “transfer of custody” process to ensure that datasets are not orphaned as team members or student researchers depart. Data should always be associated with a current, responsible lead to prevent security gaps.<\/p>\n<\/li>\n<\/ol>\n Following collection, prioritize the immediate removal or replacement of PII. De-identifying data breaks the link between sensitive variables and individual identities. Depending on your research requirements, you may apply multiple methods in tandem.<\/p>\n\n
\n
De-identification Strategies by Risk Level<\/h2>\n