Meet The Team

Paul Burton
Paul Burton is Professor of Data Science for Health at Newcastle University. He is Principal Investigator and lead of the DataSHIELD project and jointly leads the statistical functionality team. Centrally involved in incorporating generalized linear modelling (glm) and generalized linear mixed modelling (glmer) into DataSHIELD, with multi-study inference based either on full joint likelihood applied to Individual Patient Data (IPD) or through centrally controlled study-level meta-analysis.
Demetris Avraam
A core developer of DataSHIELD statistical functionality and provider of support to DataSHIELD users. Developer of statistical methodologies for secure multi-party computations and privacy-preserving data visualisations. Focusing on statistical approaches, including data synthesis and anonymization techniques, aiming the mitigation of disclosure risk associated with sensitive data without degrading the scientific value of the data.
Patricia Ryser-Welch
Research Associate in Data-Science. Particular expertise in distributed software architecture, algorithms and computation, scalable machine learning and bio-inspired methodologies. Leading on user interaction design and on developing innovative features required for non-parametric statistics and scalable machine learning in non-disclosive federated analysis. Centrally involved in developing DataSHIELD’s conceptual and testing frameworks.
Stuart Wheater
Stuart Wheater is a senior member of the DataSHIELD team, involved in all aspects of the development and maintenance of the DataSHIELD system, with expertise in managing the release, continuous integration and quality assurance processes.
Alex Westerberg
Research Assistant in Statistics on the DataSHIELD project. Major contributor to online presence: keeping the website up to date; maintaining the accuracy of wiki articles; and following current developments. Demonstrator for beginners’ workshop. Developer and tester of new statistical analysis functions for use within DataSHIELD.
Leire Abarrategui Martinez
Research Assistant in Biostatistics on the DataSHIELD project with knowledge in omics field. Collaborator in LITMUS project developing infrastructure and functions for sharing and analysing data safely. Contributor to implement functions for omics data analysis and responsible for function documentation and testing.
Hugh Garner
Hugh Garner works on devops for DataSHIELD, including developing a load testing framework, containerization methods and data management. His research interests focus on data visualisation methodology, particularly explorative methods for high-dimensional hierarchical datasets and timeseries.
Elaine Smith
Communications and Platform Coordinator for the DataSHIELD project. Contributor to the project's online presence: updating the website and social media. Admin assistant to the DataSHIELD Advisory Board. Diary management for Paul Burton.
Harald Binder
Coordinator of ”Use Case 2” in the MIRACUM project, which aims at jointly analysing data of several German university hospitals, with DataSHIELD as main infrastructure, with a research focus on deep learning and more generally machine learning techniques in DataSHIELD.
Tom Bishop
Senior Data Scientist and technical work package lead on the EUCAN-Connect project. Aetiology and Mechanisms of Diabetes and Related Metabolic Disorders of Later Life
Olly Butters
Population health data researcher working to join up datasets and gain insight from complex systems; part of the NIHR Applied Research Collaboration (Northwest Coast). Previously worked for Newcastle University on a variety of health data projects, including: DataSHIELD, the 1958 Birth Cohort Genetics Repository and Connected Health Cities NENC.
Soumya Banerjee
I analyze complex problems and implement new statistical and machine learning techniques for deriving insights from large amounts of data. My research is in the field of modelling of biological systems. I apply mathematical modelling to understand biological systems. I also apply machine learning to the field of healthcare.
Tim Cadman
Developmental Psychopathology Researcher, ie understanding what causes mental health problems such as anxiety and depression to develop over childhood. Collaborating on the H2020 LifeCycle project ( Focusing on understanding how social deprivation and exposure to urban environmental stressors (such as noise and air pollution) affects parental and child mental health.
Rui Camacho
Statistical methodology; function developer
Isabel Fortier
Data harmonization expert and principal investigator of Maelstrom Research. Developing methods and tools to improve rigour and cost efficiency of the data harmonisation process. Generation of scientifically founded harmonized datasets is essential to optimise the DataSHIELD users’ experience.
Gonçalo Campos Gonçalves
Researcher at INESC TEC working for RECAP Preterm and EUCAN-Connect. Developing and maintaining the CORAL distribution that leverages on software containers to reduce the effort needed to get the Obiba and DataSHIELD infrastructure up and running both for development and production.
Julian Gründner
Research Associate with specialising in: machine learning for patient-specific predictions; cross-hospital data analysis systems and infrastructures; data integration and standardised vocabulary
Sido Haakma
Yann Joly
Adviser on the ethical and legal aspects of DataSHIELD, covering applicable data privacy and ethical standards across global jurisdictions. A member of the Canadian Commission for UNESCO (CCU) Sectoral Commission for Natural, Social and Human Sciences. Current Chair of the Bioethics Workgroup of the International Human Epigenome Consortium (IHEC) and Co-Lead the regulatory and ethics work stream of the Global Alliance for Genomics and Health (GA4GH). Previously Chair (2017-2019) of the Ethics and Governance Committee of the International Cancer Genome Consortium (ICGC). Member of the Human Genome Organization (HUGO) Committee on Ethics, Law and Society (CELS).
Elinor Jones
Inferring causality from observational epidemiological data, federated analysis of data when data cannot be pooled, statistics education. Elinor graduated in Mathematics and Statistics from the University of Warwick before completing a PhD in Probability Theory at the University of Manchester. Her thesis examined the large deviations of random walks and Lévy processes. Prior to joining UCL, Elinor worked as a Research Associate in Genetic Epidemiology at the University of Leicester and as a Statistician at the Univeristy of Reading. **** edit this
Bartha M Knoppers
Previously Chair of the Ethics and Governance Committee of the International Cancer Genome Consortium (2009-2017). Currently Chair of the Ethics Advisory Panel of WADA (2015- ), and was the Co-Chair of the Regulatory and Ethics Workstream of the Global Alliance for Genomics and Health (2013-2019).
Stefan Lenz
Researcher and software developer working on implementing deep learning methods in DataSHIELD, integrating functionality from the Julia programming language. Also involved in the development and setup of the DataSHIELD infrastructure for the GESA and MIRACUM project.
Yannick Marcon
Leader of the OBiBa project and owner of the Epigeny consulting company. In charge of the design and implementation of the DataSHIELD middleware: the DataSHIELD R and Java APIs, and the Opal data warehouse. Maintainer of various R packages (DSI, DSOpal, DSLite, resourcer, opalr) available in the official R packages repository. Committed to finding innovative solutions for big and complex data handling in DataSHIELD.
Andrei Morgan
Andrei Morgan is a paediatrician / neonatologist, an epidemiogist, and a free software and open science advocate. He trained in the UK but since 2017 has been based in Paris, France, where he works as a researcher at INSERM (Institut National de la Santé et de la Recherche Médicale - the French National Institute for Health and Medical Research), and also does some work as a doctor. His main interest is focused on understanding differences between countries in the organisation of health care and their consequent impact on long-term population health. In his spare time, he enjoys being with friends, good conversation, cooking, reading, cycling and fire juggling.
Madeleine Murtagh
She leads a team of social and data scientists conducting social studies of sociotechnical and normative practices in health research data sharing and governance, particularly interventionist, collaborative ethnographic studies with and of the progress, emergent values, outcomes and social effects of health and genomic data science. Alongside this work, the she develops new forms of governance for responsible and responsive data sharing in health services, biobanks and cohort studies which centrally involve citizens and research participants in ethical and governance decision-making. Madeleine established and chairs the METADAC data and samples access governance infrastructure ( for nine UK longitudinal studies.
José Pedro Ornelas
Researcher at INESC TEC working for RECAP Preterm and EUCAN-Connect. Developing and maintaining the CORAL distribution that leverages on software containers to reduce the effort needed to get the Obiba and DataSHIELD infrastructure up and running both for development and production.
Angela Pinot de Moira
Angela is an epidemiologist at the University of Copenhagen, where her research focuses on how early-life exposures influence the gut microbiome, the development of the immune response and the risk of asthma and allergy. Since 2017, she has been actively involved in the Horizon2020 LifeCycle Project, a collaboration that is bringing together data from multiple birth cohorts across Europe and Australia to establish a FAIR (findable, accessible, interoperable and reusable) data resource known as the EU Child Cohort Network. In this project she has helped to develop protocols for data harmonisation and establish the infrastructure for the federated analysis of data using DataSHIELD. Angela is now using DataSHIELD to conduct her own research into how early-life exposure to pets influences the risk of allergic sensitization, asthma and impaired lung function in childhood.
Juan Ramón González
Juan R Gonzalez is an Associate Research Professor at ISGlobal where he leads the Bioinformatic Group in Genetic Epidemiology (BRGE). He has co-authored more than 160 scientific papers published in peer-reviewed journals and has extensive experience in large international research projects. As an Adjunct Professor at the Department of Mathematics at Autonomous University of Barcelona, his educational activities focus on providing graduate lectures on advanced statistical methods and post-graduate lectures in biostatistics and in omic data analyses based on his book “Omic association studies with R and Bioconductor”. His work focuses on the development of statistical and computational methods for the analysis and comprehension of different omic data motivated by a wide range of questions of biological and clinical relevance. These include genomic (CNVs, inversions, mosaicisms, loss of chromosome Y), transcriptomic (RNA-seq and alternative splicing) and proteomic among others. The analysis of the exposome, its characterization and how to integrate it with different omic layers is also another active line of research of his group as part of an FP7 European project called HELIX whose aims are going to be further investigated in the H2020 project called ATHLETE to which Juan R Gonzalez is a WP leader. His group is also very active in developing tools to perform non-disclosive data analyses using DataSHIELD (
Artur Rocha
Artur has been a senior researcher at INESC TEC, Portugal since 1998. He is assistant coordinator of the Centre for Information Systems and Computer Graphics (CSIG). Current research interests include platforms and methods for collaborative research, privacy-preserving distributed computation, the semantic sensor Web (IoT) and Big Data processing. From October 1996 to December 1997, he was an associate member of CERN - European Laboratory for High Energy Physics, IT Division/Web Office. His research is applied in two major areas: Personalized Health Research (PHR) and Earth and Ocean Observation Science (EOOS). The PHR area currently subdivides into: a) personalised Internet-based treatments; and b) human data storage, privacy-preserving processing and controlled FAIR data sharing. In this area, he participates in several European projects, including ICT4Depression (FP7), E-COMPARED (FP7), STOP Depression (EEA Grant), iCare4Depression (FCT), RECAP Preterm (H2020), EUCAN-Connect (H2020) and iReceptor Plus (H2020). In these projects, he often undertakes the responsibility for the system’s architecture, platform implementation, or technical coordination.
Paula Raissa Silva
Working on the RECAP Program
Morris Swertz
Becca Wilson
Becca Wilson holds an interdisciplinary research career spanning the space, atmospheric and health sciences and has contributed to the DataSHIELD project for the past seven years. She is currently a research fellow at University of Liverpool developing software for the sharing, access and immersive data visualisation of sensitive health data and is also a project specialist in the NIHR Advanced Research Collaboration North West Coast developing novel data processes fundamental to health inequalities research. She is an advocate for open-source software and open research practices through her fellowship with the UK Software Sustainability Institute and membership of the R Forwards Task Force in the R Software Foundation.
Daniela Zöller
Statistician developing statistical methodology in the context of DataSHIELD with a focus on variable selection, machine learning, and local prediction models. Additionally supports clinical researchers in using DataSHIELD in the context of the GESA and the MIRACUM project.
Daniela ZöllerResearch Associate

The DataSHIELD project would not have matured and developed in its current form without the contributions of the following individuals:


Hasan A.J.K.Al Radhi
Newcastle University
Dr Chris Newby
University of Leicester
Isabelle Budin
Norwegian Institute of Public Health
Dr Neil Parley
University of Bristol
Yutong (Sam) Cai
Imperial College London
Dr Anne Marie Tasse
McGill University
Professor Vincent Feretti
Ontario Institute of Cancer Research
Dr Jonathan Tedds
University of Leicester
Dr Amadou Gaye
University of Leicester, University of Bristol
Professor John Thompson
University of Leicester
Jennifer Harris
Norwegian Institute of Public Health
Dr Andrew Turner
University of Leicester, University of Bristol
Dr Susan Hodgson
Imperial College London
Professor Edwin van den Heuvel
Eindhoven University of Technology
Dr Julia Isaeva (Kutschke)
Norwegian Institute of Public Health
Dr Susan Wallace
University of Leicester
Dr Joel Minion
University of Leicester, University of Bristol
Dr Wilma Zijlema
University of Groningen