There are many possibilities within DataSHIELD. This section of the wiki is organised into bite-size chunks which explain separately how to perform key aspects of analysis. If you prefer full tutorials, an archive of previously given workshops is available here.
Two example datashield servers are set up and available to practice on. Each tutorial assumes that you have first logged in to these servers using the steps outlined in Section 2: logging in.
Beginnng to use DataSHIELD has a steep learning curve. We recommend that you start with Section 1: Key concepts where we explain in simple language how analysis with DataSHIELD works.
An Armadillo and Opal server are available to be freely used by anyone learning DataSHIELD. To use the tutorials in the wiki, you first need to login to these servers. Section 2: logging in shows you how. Both servers also contain a number of example datasets which can be used for general practice.
The tutorials in Section 3: Session management explain useful things you might need to do to manage your session (e.g. loading and saving workspaces), but they do not involve actual analysis.
Section 4 deals with returning non-disclosive information about serverside objects, such as their dimensions.
There are two options for data manipulation in DataSHIELD: (i) Use the functionality within dsBaseClient
or, (ii) use dsTidyverseClient
. We recommend using dsTidyverseClient
as it is quicker and contains many more features. If you are unavailable to use dsTidyverseClient
(e.g. if your consortia does not support it), you can also view tutorials on data manipulation soley using dsBaseClient
.
Section 5.1: Data manipulation with dsTidyverseClient
Section 5.2: Data manipulation with dsBaseClient
Section 6 illustrates how to return descriptive statistics using DataSHIELD and transform them into neater tables.
Section 7: Making figures explains how to make figures using built-in DataSHIELD functions, and how to make your own custom figures using summary statistics.
Many packages are available to conduct analysis in DataSHIELD, and contain their own tutorials about specific analysis. In this section we show how to run two common and useful models: general linear models using both 1-stage virtual pooling and 2-stage meta-analysis. Links to further tutorials are show in the 'further information' section at the end of this page.
Section 8: Linear models with 1-stage and 2-stage meta analyis.
Over the years many DataSHIELD workshops and lectures have been given. Here you can find links to videos covering many different topics in DataSHIELD.
If you would like to contribute your own tutorials, please read these guidelines.