Helix and Recursion join forces in quest to crack the AI-driven drug discovery code

 The UK Biobank, with genetic data on half a million individuals, set a benchmark for scale in medical research. It contains whole genome sequencing data for all 500,000 participants, making it the world’s largest genetic project to date.



A recent collaboration between the San Diego–headquartered clinico-genomics company Helix and the Salt Lake City–based biotech Recursion Pharmaceuticals aims to go even further. “We could surpass [UK Biobank] probably by the end of 2025,” says Hylton Kalvaria of Helix.

The partnership will combine Helix’s growing clinico-genomic dataset with more than 25 petabytes of Recursion’s proprietary biological and chemical data.

The world is awash in data. To cite one example, the latest CommonCrawl dataset, encompassing 17 years of website data, totals 377 tebibytes. That’s equivalent to the storage capacity of more than 3 million 128-gigabyte USB flash drives.

“There are larger data sources out there,” Kalvaria acknowledged. “But [the combined Helix-Recursion dataset] is still a heck of a lot of data and, importantly, it adds to itself over time.”

The “sequence once, query often” model: Unlocking a lifetime of genetic insights


Helix isn’t focused on building a massive genomic database alone. The company follows a “sequence once, query often” approach to genomic data management. “Your genetics are the one thing about your health that never changes over time,” Kalvaria said. Having that information available upfront can offer a range of advantages whether in routine care or disease diagnostics. “There are many reasons why you might come back to that [genomic] information over time,” Kalvaria added.

For patients, a single DNA sequencing can provide insights for years to come, eliminating the need for further repeated genetic tests. For physicians, access to a patient’s genomic information can support quicker, more informed decisions. Kalvaria notes, “If a patient has already been sequenced, results can be obtained in minutes rather than weeks.” In the longer run, doctors could potentially confirm or rule out genetic conditions during an office visit using such data in the context of patient care. As scientists learn more about how genes relate to diseases, such readily available genetic data could help identify patients who might benefit from specific treatments or preventive measures.

The advantages also extend to healthcare systems: For instance, reducing duplicate testing can lead to cost savings and more efficient use of resources. The potential impact of this method goes beyond just saving time and money.

Comments

Popular posts from this blog

Sanofi’s Sarclisa wins in transplant-eligible multiple myeloma trial

MDMA Papers Retracted for ‘Unethical Conduct’ on Heels of Lykos’ FDA Rejection

Cell and gene therapy companies must demonstrate strong value proposition for investment