Active Research Data
Research Drive provides an initial 25Tb storage allocation for free, and is the starting point for research data storage needs.
Because we are a member of the UW Health Care Component, you need to specify that this needs to be a Restricted Research Drive when you follow this link to create an account – https://kb.wisc.edu/researchdata/internal/page.php?id=93998#account
Some use cases for RRD (Restricted Research Drive) –
- Mount via NFS/SMB onto appropriate server/node for compute access
- Share a dataset as a “collection” via Globus
- Setup a share for lab group
- Create home directories for lab staff
Storing Data (Archival data)
Campus provides a 50TB storage allotment for eligible researchers
Some use cases –
- Data archiving
- Backup software target (with open source tools such as Restic or Kopia, free tools like Duplicati, and commercial backup tools such as Commvault, IBM Storage Protect, Synology NAS, Oracle, etc)
- Web applications – via the S3 API
- Static web content
- Big data analysis (Go Language, Julia, R and Python, just to name a few stats-centric things, are some of many things that can connect to S3 Object Storage via the S3 API)
- Instrumentation/IoT data
Sharing Data
Globus is a file transfer option available to share datasets with other researchers. As we are a member of the Health Care component here at UW, as always we need to follow HIPAA guidelines.
To use Globus, you’ll need to follow these instructions to first get a Restricted Research Drive, and then follow the directions for using Globus here –https://kb.wisc.edu/internal/page.php?id=109337
Please review these guidelines if need to think about serving restricted and/or HIPAA data over the internet
For non-restricted data sharing and collaboration there are many options-
On Campus –
WiscWeb – https://wiscweb.wisc.edu
Google Drive – https://drive.google.com
Minds@UW – https://www.library.wisc.edu/digital-library-services/minds
Knowledge Base – https://it.wisc.edu/services/knowledgebase
DoIT Web Hosting – here https://it.wisc.edu/services/web-hosting and here https://kb.wisc.edu/webhosting/109899
DoIT provides a nice matrix of options that may assist you in determining your web solution.
Beyond the UW-
Galaxy – Maximum size limit is 50 GB (uncompressed). 250 GB of storage is available per Galaxy account. – https://usegalaxy.org/
CyVerse Discovery Environment – 5 GB of space for free, but can be relatively slow to display. Offers a paid subscription service to expand space. https://de.cyverse.org/de/
Gitlab – files limited to 100MB, but very fast. https://git.doit.wisc.edu/users/sign_in
Figshare – not limited and fast, but every file needs to be uploaded individually and cannot be changed. Optimal for very stable links, e.g. in publications. https://figshare.com/
UW is a member of the open data publishing platform, Dryad –
Dryad – https://datadryad.org/stash
NIH also has a listing of data sharing sites – https://sharing.nih.gov/data-management-and-sharing-policy/sharing-scientific-data/repositories-for-sharing-scientific-data
Moving Data
Globus is a file transfer option available to share datasets with other researchers. As we are a member of the Health Care component here at UW, as always we need to follow HIPAA guidelines. In order to use Globus, you’ll need to follow these instructions to first get a Restricted Research Drive, and then follow the directions for using Globus here –
https://kb.wisc.edu/internal/page.php?id=109337
Links to SMPH Research platforms and systems –
Platform X
Research Computing Platform
SMPH Informatics
This includes two clinical research systems: REDCap data storage and OnCore clinical trial management system.
i2b2
This NIH-funded tool provides an easy-to-use, self-service method for UW researchers to query the UW Health Enterprise Data Warehouse (EDW)