Page URL:

Chinese scientist removed early COVID-19 genomic data from scientific database

28 June 2021
Appeared in BioNews 1101

A controversial paper suggests that Chinese researchers retracted early COVID-19 viral sequences from a US database to obscure their origins.

Dr Jesse Bloom, evolutionary biologist of the Fred Hutchinson Cancer Research Centre in Seattle, Washington, detailed in an unreviewed preprint how he recovered these deleted files. He explained that these sequences support other evidence that SARS-CoV-2, the virus which leads to COVID-19, did not originate in Wuhan's Huanan seafood market, Hubei, China.

'I don't think this bolsters either the lab origin or zoonosis hypothesis... it provides additional evidence that this virus was probably circulating in Wuhan before December [2019]... and... we have a less than complete picture of the sequences of the early viruses,' said Dr Bloom.

Dr Bloom discovered the deleted sequences in a study published in June 2020 in the journal, Small. The paper contained sequence data that Dr Bloom had not previously seen, and that he could not find on the public database, Sequence Read Archive (SRA). However, he managed to find the deleted sequences in the SRA's Google cloud storage – recovering data from 50 samples in total.

The National Institute of Health (NIH), Bethesda, Maryland, which oversees the SRA, stated that they deleted the sequences at the request of the depositing author, who announced that he planned to submit them to another database.

Dr Bloom has not found these sequences elsewhere and argues that these early sequences will increase our understanding of the origins of the virus. The deleted sequences, likely collected in January and February 2020, are more closely related to bat viruses than the those seen in individuals linked to the seafood market. This supports the hypothesis that the SARS-CoV-2 virus did not originate in the Huanan seafood market.

The preprint has amassed conflicting reactions from across the globe. Professor Ian Lipkin from Columbia University, New York, said, 'This is a creative and rigorous approach to investigating the provenance of SARS-CoV-2... The two take-home points are that the virus was circulating before the outbreak linked to the Wuhan seafood market, and that there may have been active suppression of epidemiological and sequence data needed to track its origin.'

Others were less impressed with Dr Bloom's approach, including Professor Andrew Preston from the University of Bath, 'The language of the paper is unusual, it contains a significant degree of supposition and conjecture, cites blog posts, and appears to be pointing towards a deliberate cover up by Chinese authorities of early sequence data from Wuhan. However, this is an entirely subjective appraisal of the situation, which will be very difficult to confirm or disprove.'

Dr Bloom, aware of the controversy surrounding his preprint, added: 'No matter how much people like [my paper] or don't like it, or agree with the interpretation or disagree with the interpretation, they can at least go download it and repeat it themselves.'

The paper is currently published as a preprint in bioRxiv that has not yet been peer-reviewed.

25 January 2021 - by Christina Burke 
Monitoring of the COVID-19 pandemic has relied heavily on genomic sequencing, but experts warn more needs to be done to control the virus...
18 January 2021 - by Dr Rachael Boyle 
Genomic epidemiological analysis of the initial UK COVID-19 outbreak has allowed a team of researchers to accurately characterise the early chains of transmission throughout the country...
21 December 2020 - by Dr Joanne Delange 
Session three of the Progress Educational Trust (PET) annual conference explored the genetic and genomic links to susceptibility to severe COVID-19...
18 May 2020 - by Chris Wigley 
COVID-19 has caused unprecedented levels of damage and disruption to the global population and economy, with no current cure or vaccine to prevent it. But, five months after the virus began to spread, there are still many questions that need to be answered...
to add a Comment.

By posting a comment you agree to abide by the BioNews terms and conditions

Syndicate this story - click here to enquire about using this story.