Research data storage and backup
Research Data Management
Best Practices |
|
Data Storage and BackupOn this page: Data protection | Data access and sharing | Version control | Backing up data | Other computing resources questions Data storage and backup is important because:
See also Archiving & Preservation, Intellectual Property, and how to Share Data. Data ProtectionIn order to maintain the integrity of stored data, project data should be protected from physical damage as well as from tampering, loss, or theft. This is best done by limiting access to the data. PIs should decide which project members are authorized to access and manage the stored data. Notebooks or questionnaires should be kept together in a safe, secure location away from public access, e.g., a locked file cabinet. Privacy and anonymity can be assured by replacing names and other information with encoded identifiers, with the encoding key kept in a different secure location. Ultimately, the best way to protect data may be to fully educate all members of the research team about data protection procedures. How Can Data Be Protected?Theft and hacking are particular concerns with electronic data. Many research projects involve the collection and maintenance of human subjects data and other confidential records that could become the target of hackers. The costs of reproducing, restoring, or replacing stolen data and the length of recovery time in the event of a theft highlight the need for protecting the computer system and the integrity of the data. Electronic data can be protected by taking the following precautions: Protecting access to data:
Protecting your system:
Protecting data integrity
In addition:
Data Access and SharingUnderstanding the flow of the data will assist in determining appropriate access controls and security measures as the data moves around. The growth of interdisciplinary and cross-institutional research has seen a corresponding change in the need to share data: it has become much easier, faster, and more reliable than ever before. Consider how you will share the research data ahead of time and determine if there will be cost implications. How much data will there be? How long do I need to keep it? How will I access it in the future?One of the most costly considerations in any IT implementation is the ability to store and restore data. It is important to weigh the cost of the storage against the risk of data loss. Using cheap desktop storage without any redundancy or backups could end up being the most expensive choice you make. When planning storage and backup architecture, take into account the following points:
For more information on storage infrastructure offerings at Oregon, and how to plan and implement storage for your research data, see: http://it.uoregon.edu/systems/services/storage. Campus Based StorageThe University of Oregon Information Technology offers limited file storage to students, faculty and staff. While this data is backed up regularly, it may not provide the security or protection required of your research work. Additional network based storage may be available via UO Information Technology Storage Services. Cloud Based StorageCloud based storage holds data on remote servers which can reduce the burden of access and management issues. However, protected and sensitive data should not be stored on third-party servers. Costs related to data transfer can add considerably to the budget required for cloud-based storage. TeraGrid Repositories and Data CentersRepositories and data centers are options for published/publicly available datasets, but should not be viewed as primary storage during the research project. Version ControlVersion control software is useful for tracking updates to files as you work with them. Backing Up DataData should be backed up on a regular basis. Scheduled backups, to an off-site location (i.e., a different building or a different geographic area) will help protect from catastrophic data loss. Contact the IT group for your unit Other Computing Resources QuestionsBelow, are specific questions grant applicants should consider to ensure appropriate IT needs are available or requested in the grant application.
1. What computing resources are needed for this research project?Regardless of what type of research you are undertaking, you may stand to benefit from expanding your computing resources beyond a simple computer-per-researcher setup. Consider the following objectives and how additional lab resources or leveraging central resources, could streamline your effort:
2. Servers and Compute CyclesI have heard a bit about virtual machines… what role could they play in my research effort? Many groups at Oregon have implemented partial or complete virtual infrastructures as part of their computing strategy. Virtual server infrastructures offer several advantages over physical infrastructures. They require a specific set of expertise to plan and implement, virtual server service is already offered by Information Services. Leveraging virtualization may help your research project meet sustainability, cost containment, efficiency, and scalability goals. Contact your local IT support staff, or see UO Information Services VMware for further information. 3. Storagea. How will my data be accessed and shared? Understanding the flow of the data will assist in determining appropriate access controls and security measures as the data moves around. The growth of interdisciplinary and cross-institutional research has seen a corresponding change in the need to share data: it has become much easier, faster, and more reliable than ever before. Consider how you will share the research data ahead of time and determine if there will be cost implications. b. How much data will there be? How long do I need to keep it? How will I access it in the future? One of the most costly considerations in any IT implementation is the ability to store and restore data. It is important to weigh the cost of the storage against the risk of data loss. Using cheap desktop storage without any redundancy or backups could end up being the most expensive choice you make. When planning storage architecture, take into account the following points:
For more information on storage infrastructure offerings at Oregon, and how to plan and implement storage for your research data, contact your local IT support staff or see UO Information Services Storage information. See Archiving & Preservation and consult with NSF guidance or other specific funded research guidance for resources and information. 4. SecurityCan the security requirements of your data be identified? Will protected data (HIPAA, FERPA, PHI, etc.) make up my research data? Both the University and many sponsors have specific requirements for handling, transmitting, and storing certain types of data used in research. Familiarize yourself with those details ahead of time: it can be costly if you hadn't anticipated your data protection needs until after the fact. See Intellectual Property and Sharing Data pages, and the Office for the Protextion of Human Subjects if you are dealing with human subjects data. 5. ApplicationsWhat applications does my project require? You need licensed software for all your current systems and any additional computers you may purchase. You might also need additional user licenses for your project applications. The applications you use may undergo several updates or revisions over the duration of your project, ensure that you have an application maintenance contract or budget for upgrades. Contact your local IT support staff and see UO Information Services Software Licensing. 6. SupportHow is all of this going to stay running? Does my research group possess the appropriate level of expertise to support my computing resources? Do I want my researchers spending their time patching servers? Should I really use my research assistants to run this critical environment? As computing needs become more fundamental to your research, ensure that you have support options covered. You don't want to suspend research to address repairs or problems. Knowing who the local network administrators and systems/storage administrators are and how to contact them in the event of problems is important. They can help to schedule ongoing maintenance to ensure interruptions to your research are minimized. The cost to have your environment managed by IT professionals may be less than you expect. Contact your local IT support services and/or Sean Sharp in Information Services. |
|
| Maintained by: Brian Westra, bwestra@uoregon.edu | |
