The UK Biobank project, a massive health database containing genetic and medical data from half a million British volunteers, has come under scrutiny after it was revealed that confidential health records were listed for sale on a Chinese website. While the records have been removed and no sales are believed to have occurred, the incident has reignited concerns about data protection and the security of participant information.
What is the UK Biobank Project?
Launched in 2003, the UK Biobank recruited 500,000 participants aged 40 to 69 between 2006 and 2010. Volunteers provided genetic data, clinical measurements, health information, biological samples, and lifestyle data, and continue to undergo regular follow-ups. Since 2012, researchers have been able to request access to anonymised data to study the causes, prevalence, and treatment of numerous diseases.
Scientific Achievements
The UK Biobank has been a resounding success, with thousands of research papers published based on its data. Key discoveries include identifying four blood proteins that could help diagnose dementia before symptoms develop. Last year, the project celebrated scanning the brains, hearts, and other organs of 100,000 participants, aiding early disease detection and providing insights into human ageing.
Scans have revealed that even small amounts of alcohol are linked to changes in brain size and structure, diabetes can affect heart structure, and Covid-19 infections may damage the brain's smell centre. Recent research using Biobank data has suggested that air pollution can accelerate the onset of many diseases, and the data has been used to train an AI tool that predicts the risk of over 1,000 diseases.
Prof Andrew Morris, director of HDR UK, highlighted the unique depth of the Biobank: "It is among the largest studies for imaging, protein biomarkers, genomics and more. But not only that – it links all this together for investigation. The depth of research enabled by this across all disease areas is really unique and why it is heralded worldwide."
Privacy Concerns and the Recent Incident
On Thursday, it was revealed that UK Biobank participant data was listed for sale on three separate listings on the Chinese website Alibaba. At least one listing is thought to have contained data from all 500,000 volunteers. The data was "de-identified," meaning names, addresses, and precise dates of birth were not included. While no sales are thought to have been made, this is not the first time data protection concerns have been raised.
Last month, the Guardian reported numerous instances of participant health data being leaked online by researchers, which in some cases could be traced back to volunteers. Prof Luc Rocher of the Oxford Internet Institute noted that this incident marks the 198th known exposure of UK Biobank data since last summer, with data remaining available online for anyone to download.
Response from UK Biobank
Prof Rory Collins, chief executive of UK Biobank, wrote to participants to reassure them that their personal identifying information is "safe and secure." He announced new security measures, including restricting the size of files that researchers can export from the research platform to "severely limit" their ability to export de-identified participant data. He also stated that a comprehensive board-led investigation would be conducted.
Expert Reactions
Experts have welcomed the swift removal of the listings from Alibaba. Prof John Gallacher of the University of Oxford expressed reassurance: "As a 'Biobanker' I am reassured that the value of my small contribution to global health is jealously guarded." However, others have called for a full investigation. Morris emphasised the importance of participant trust: "The future of healthcare is increasingly data-dependent. We must double down on implementation of secure systems to enable essential research that is responsible, trusted and can operate at scale."



