Research Data

Active Research Data

Research Drive provides an initial 25Tb storage allocation for free, and is the starting point for research data storage needs.

Because we are a member of the UW Health Care Component, you need to specify that this needs to be a Restricted Research Drive when you follow this link to create an account – https://kb.wisc.edu/researchdata/internal/page.php?id=93998#account

Some use cases for RRD (Restricted Research Drive) –

  • Mount via NFS/SMB onto appropriate server/node for compute access
  • Share a dataset as a “collection” via Globus
  • Setup a share for lab group
  • Create home directories for lab staff

Storing Data (Archival data)

Campus provides a 50TB storage allotment for eligible researchers

S3 Object Storage

Some use cases  –

  • Data archiving
  • Backup software target (with open source tools such as Restic or Kopia, free tools like Duplicati, and commercial backup tools such as Commvault, IBM Storage Protect, Synology NAS, Oracle, etc)
  • Web applications – via the S3 API
  • Static web content
  • Big data analysis (Go Language, Julia, R and Python, just to name a few stats-centric things, are some of many things that can connect to S3 Object Storage via the S3 API)
  • Instrumentation/IoT data

Sharing Data

Globus is a file transfer option available to share datasets with other researchers. As we are a member of the Health Care component here at UW, as always we need to follow HIPAA guidelines.

To use Globus, you’ll need to follow these instructions to first get a Restricted Research Drive, and then follow the directions for using Globus here –https://kb.wisc.edu/internal/page.php?id=109337

Please review these guidelines if need to think about serving restricted and/or HIPAA data over the internet

For non-restricted data sharing and collaboration there are many options-

On Campus –
WiscWeb – https://wiscweb.wisc.edu

Google Drive – https://drive.google.com

Minds@UWhttps://www.library.wisc.edu/digital-library-services/minds

Knowledge Base – https://it.wisc.edu/services/knowledgebase

DoIT Web Hosting – here https://it.wisc.edu/services/web-hosting and here https://kb.wisc.edu/webhosting/109899

DoIT provides a nice matrix  of options that may assist you in determining your web solution.

 

 

Beyond the UW-
Galaxy – Maximum size limit is 50 GB (uncompressed). 250 GB of storage is available per Galaxy account. – https://usegalaxy.org/

CyVerse Discovery Environment – 5 GB of space for free, but can be relatively slow to display. Offers a paid subscription service to expand space. https://de.cyverse.org/de/

Gitlab – files limited to 100MB, but very fast. https://git.doit.wisc.edu/users/sign_in

Figshare – not limited and fast, but every file needs to be uploaded individually and cannot be changed. Optimal for very stable links, e.g. in publications. https://figshare.com/

UW is a member of the open data publishing platform, Dryad –
Dryad – https://datadryad.org/stash

NIH also has a listing of data sharing sites – https://sharing.nih.gov/data-management-and-sharing-policy/sharing-scientific-data/repositories-for-sharing-scientific-data

 

Moving Data

Globus is a file transfer option available to share datasets with other researchers. As we are a member of the Health Care component here at UW, as always we need to follow HIPAA guidelines. In order to use Globus, you’ll need to follow these instructions to first get a Restricted Research Drive, and then follow the directions for using Globus here –

https://kb.wisc.edu/internal/page.php?id=109337

Links to SMPH Research platforms and systems –

Platform X

Research Computing Platform

SMPH Informatics

This includes two clinical research systems: REDCap data storage and OnCore clinical trial management system.

i2b2

This NIH-funded tool provides an easy-to-use, self-service method for UW researchers to query the UW Health Enterprise Data Warehouse (EDW)

Posted in KB