Machine Learning Finds Tumor Gene Variants and Sensitivity to Drugs in The Cancer Genome Atlas

Cancer Genome Atlas

Matching unique genetic information from cancer patients’ tumors with treatment options – an emerging area of precision medicine efforts – often fails to identify all patients who may respond to certain therapies. Other molecular information from patients may reveal these so-called “hidden responders,” according to a Penn Medicine study in Cell Reportsthis week. The findings are published alongside several papers in other Cell journals this week examining molecular pathways using The Cancer Genome Atlas (TCGA).

“Targeted sequencing can find individuals with certain mutations that are thought to confer susceptibility to anti-cancer drugs,” said senior author Casey Greene, PhD, an assistant professor of Pharmacology in the Perelman School of Medicine at the University of Pennsylvania. “But many people may lack these mutations, and as machine learning approaches improve they may help guide these patients to appropriate therapies.”

Greene and first author and doctoral student Gregory P. Way used machine learning to classify abnormal protein activity in tumors. This branch of artificial intelligence develops computer programs that can use new data to learn and make predictions. The algorithm they devised to search TCGA integrates genetic data from 33 different cancer types. Greene and Way used information from the transcriptome, the grand total of all messenger RNAs expressed within an individual.

They specifically applied their model to the Ras pathway, a family of genes that make proteins that govern cell replication and death. Changes in the normal function of Ras proteins – mutations which are responsible for 30 percent of all cancers – can power cancer cells to grow and spread. These mutations are often referred to as the “undruggable Ras,” having beaten back a variety of investigational inhibitor drugs and vaccine-based therapies.

“This model was trained on genetic data from human tumors in The Cancer Genome Atlas and was able to predict response to certain inhibitors that affect cancers with overactive Ras signaling in an encyclopedia of cancer cell lines,” Greene said. The upshot is that the transcriptome is underused in bringing precision to oncology, but when combined with machine learning it can aid in identifying potential hidden responders.

The Penn team collaborated with coauthor Yolanda Sanchez, PhD, a cancer biologist from the Geisel School of Medicine at Dartmouth College. They are working together to mesh her identification of compounds that target tumors with runaway Ras activity and tumor data (analyzed by machine learning) to find patients who could benefit from these potential cancer drugs.

“For precision medicine to benefit individuals in real time, we must develop robust models to efficiently test efficacy of potential therapies,” Sanchez said. “We can use this very powerful combined approach of machine learning-guided drug discovery using Avatars, which are mice carrying identical copies of a patient’s tumors. The Avatars allow our interdisciplinary team to identify the tumors with runaway Ras activity and evaluate and compare multiple therapies in real time.”

This work was funded in part by grants from the Gordon and Betty Moore Foundation (GBMF 4552), the National Institutes of Health (R01 NS095411, T32 HG000046), and the National Cancer Institute funding of TCGA (U54 HG003273, U54 HG003067, U54 HG003079, U24 CA143799, U24 CA143835, U24 CA143840, U24 CA143843, U24 CA143845, U24 CA143848, U24 CA143858, U24 CA143866, U24 CA143867, U24 CA143882, U24 CA143883, U24 CA144025, P30 CA016672).

Source : Penn Medicine