DataSHIELD

DataSHIELD ecosystem is an infrastructure and series of R packages that enables the remote and non-disclosive analysis of sensitive research data. It has been used with real world and consented research data for over 10 years.

Get started
1632 commitsLast commit ≈ 1 month ago14 stars26 forks

Description

Open source:
DataSHIELD comprises a collection of open source infrastructure developed by OBiBa and Molgenis and R packages for analytic functionality maintained by the DataSHIELD community comprising contributors from academia, healthcare, industry and SMEs.

Federated analysis:
DataSHIELD analysis can be aggregated from individual patient data at each location or a global analysis can be conducted simultaneously at all sites without sharing or moving individual patient data. It is a software solution for secure data analysis of personal health data in the programming language R, in which data holders can grant data access to researchers without physical data transfer.

Automated output checking:
DataSHIELD analytic functions have been designed to only share non disclosive summary statistics, with built in automated output checking based on statistical disclosure control. Only data sites can set the threshold values for the automated output checks.

Documentation and installation instructions:
An installation manual of DataSHIELD can be found as part of the official OBiBa Opal documentation or the NFDI4Health project. For detailed documentation, visit the DataSHIELD wiki

Logo of DataSHIELD
Keywords
Programming language
  • R 100%
License
</>Source code

Reference papers

Member of community

nfdi4health