This code is used in conjunction with the Standard Operating Procedure for anonymising data for release for use by clinical researchers according to the terms of the end user license.
We have made this repository public in the interests of transparency.
- sdcMicro
- ccdata
- Prepare a YAML configuration file. You can use the following function to create a template for you.
template.conf("template.yaml")
-
Using
sdc.trial
to find the most suitable parameters such as K-anonymity or L-diversity. If it is specified in the SOP, then this step can be skipped. -
Create the new ccdata using
anonymisation
function.
# Load libraries
# Note, either modify your .Rprofile to contain the location on non-standard libraries, or remember to add the lib.loc = "/data/RLibraries/" argument
library(cleanEHR)
library(ccanonym)
library(yaml)
# Load data
load("data/current_db")
# Run anonymiser
ccd <- anonymisation(
alldata,
conf="release-internal.yaml",
verbose=T,
remove.alive=FALSE,
k.anon=5)
anon_conf <- yaml.load_file("release-internal.yaml")
# Now save both data and configuration into a single R data file
save(file="anon_internal.RData", list=c("ccd", "anon_conf"))