When a thesis is completed, there are a few actions that need to be taken regarding the research data. Firstly, it needs to be decided:
The answers to these questions depend on agreements made with potential collaborators (such as a company or RDI project) regarding data ownership, usage rights, and preservation. They also depend on the consent given by the research participants.
Unless otherwise agreed upon, the student owns the research data of their thesis. In some cases, ownership of the data may be transferred to the collaborating organization, such as a company or Metropolia. The student may also grant usage rights to the data.
Unnecessary files should be destroyed when the thesis is completed. If your research involves human subjects, you will need to specify when and how the data will be destroyed after the research is finished in privacy notice.
If your research data contains personal information or other confidential data, these should be promptly destroyed once you no longer need them for your thesis, as they pose a privacy risk. Simply "deleting" the data and emptying the trash bin is not sufficient for data destruction. Follow the guidelines provided by the helpdesk for proper data destruction. Paper-based materials should be disposed of in a secure trash.
If you have collected a high-quality and interesting research dataset, you may want to reuse it yourself or offer it for others to use after completing your thesis. The success of reusing the dataset should be ensured in advance through:
Seeking permissions retroactively can prove to be impossible, so if you plan to reuse your dataset, take it into account already in your data management plan.
Opening the dataset refers to making the research dataset freely available for others to use. This typically involves depositing the dataset in an open data repository. Opening the dataset can be facilitated by utilizing Creative Commons licenses, through which the dataset owner grants usage rights to the dataset.
Note that opening the dataset requires good data management practices from the very beginning. If you are interested in opening your dataset after completing your thesis, discuss this with your supervisor as early as possible. Opening the dataset should be considered in various aspects, including collaboration agreements, informing research participants, and anonymising any sensitive data.
Dataset anonymisation refers to the process of processing the dataset in a way that it no longer contains any identifiable information. In the case of personal data, individuals cannot be reasonably identified from the dataset. Organizational information or other confidential data can also be anonymised from the dataset.
Even if you do not directly collect personal data from the research participants, it may still be possible to identify them from the dataset. For example, an anonymous survey may not be truly anonymous if participants can reveal information about themselves in open-ended responses or if the survey form records the respondent's IP address. Such a dataset is not anonymous and is subject to data protection laws.
Techniques for anonymisation include:
Metropolia Library and Information Services | Accessibility Statement