dsSynthetic Package

Development name

dsSynthetic

Name of server-side packages

dsSynthetic

Name of client-side packages

dsSyntheticClient

Date this information was late updated/checked

30/05/2022

Description of packages purpose

This package can be used to generate a synthetic data set on the client side by running the generation on the server side. Users can then perform harmonisation while working with full access to synthetic data on the client to confirm algorithms are working as expected. When the user is happy that the algorithms are working correctly, they can then be applied to the real data on the server side. The user therefore has the benefit of being able to see the data they are working with, but without the need to go through labourious data transfer processes. The same benefits are realised for an analysis user.

How to contact developer institution/team/individual

Tom Bishop <trpb2@cam.ac.uk>, MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Cambridge. UK.

Soumya Banerjee <sb2333@cam.ac.uk>, MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Cambridge. UK.

Latest version

0.0.2

Type distribution licence

GNU General Public License v3.0

Methods of obtaining package

CRAN Address

-

Web-site/ftp-site/other
-

What versions of R work with the package?

≥ 3.5.0

What R packages do the packages depend on?

dsSynthetic

synthpop
dsBase

dsSyntheticClient

DSI (≥ 1.2.0)
methods
dsBaseClient
simstudy

Status

"draft" / "proof-of-concent"

Is the package tested?

No formal testing

Is the package documented?
Has the package had a disclosure audit?
No.

Is the package suitable for deployment in the production environment? (Yes/No)

Not yet suitable

Does your package have features to protect the privacy of data, or does it just provide remote analysis functionality?

No features yet to protect privacy other than the fact that synthetic data are generally less disclosive than real data

Additional Information

dsSynthetic: A DataSHIELD package to generate synthetic data: https://tombisho.github.io/synthetic_bookdown/

"Synthetic: Synthetic data generation for the DataSHIELDfederated analysis system"
,  Soumya Banerjee, Tom R.P. Bishop