91影库

News

Machine learning plus insights from genetic research shows the workings of cells

And may help develop new drugs for COVID-19 and other diseases
Shang Gao Jalees Rehman
By Shang Gao and Jalees Rehman
Aug. 29, 2021

We combined a machine learning algorithm with knowledge gleaned from hundreds of biological experiments to develop that allows biomedical researchers to figure out the functions of the proteins that turn genes on and off in cells, called transcription factors. This knowledge could make it easier to develop drugs for a wide range of diseases.

RNA-445x247.jpg
The subtleties of how genes are transcribed into RNA molecules like the one depicted
here are key to understanding the inner workings of cells.

Early on during the COVID-19 pandemic, found that only a small group of cells in these organs were most vulnerable to being infected by the SARS-CoV-2 virus. That allowed researchers to focus on blocking the virus’s ability to enter these cells. Our technique could make it easier for researchers to find this kind of information.

The biological knowledge we work with comes from this kind of RNA sequencing, which gives researchers a snapshot of the hundreds of thousands of RNA molecules in a cell as they are being translated into proteins. A widely praised machine learning tool, the , has helped researchers all across the world discover new cell populations in healthy and diseased organs. This machine learning tool processes data from single-cell RNA sequencing without any information ahead of time about how these genes function and relate to each other.

Our technique takes a different approach by adding knowledge about certain genes and cell types to find clues about the distinct roles of cells. There has been more than a decade of research identifying all the potential targets of transcription factors.

Armed with this knowledge, we used a mathematical approach called . In this technique, prior knowledge is converted into probabilities that can be calculated on a computer. In our case it’s the probability of a gene being regulated by a given transcription factor. We then used a machine learning algorithm to figure out the function of the transcription factors in each one of the thousands of cells we analyzed.

We , called Bayesian Inference Transcription Factor Activity Model, in the journal Genome Research and also made the software so that other researchers can test and use it.

Why it matters

Our approach works across a broad range of cell types and organs and could be used to develop treatments for diseases like COVID-19 or Alzheimer’s. Drugs for these difficult-to-treat diseases work best if they target cells that cause the disease and avoid collateral damage to other cells. Our technique makes it easier for researchers to home in on these targets.

SARS-CoV-2-infection-890x765.jpg
National Institute of Allergy and Infectious Diseases
A human cell (greenish blob) is heavily infected with SARS-CoV-2 (orange dots), the virus that causes COVID-19, in this colorized microscope image.

What other research is being done

Single-cell RNA-sequencing has revealed how each organ can have 10, 20 or even more subtypes of specialized cells, each with distinct functions. A very exciting new development is the emergence of spatial transcriptomics, in which RNA sequencing is performed in a spatial grid that allows researchers to study the RNA of cells at specific locations in an organ.

A used a Bayesian statistics approach similar to ours to figure out distinct roles of cells while taking into account their proximity to one another. Another research group and studied the distinct functions of neighboring cells.

What’s next

We plan to work with colleagues to use our new technique to study complex diseases such as Alzheimer’s disease and COVID-19, work that could lead to new drugs for these diseases. We also want to work with colleagues to better understand the complexity of interactions among cells.

This article is republished from under a Creative Commons license. Read the .

The Conversation

Enjoy reading 91影库Today?

Become a member to receive the print edition four times a year and the digital edition monthly.

Learn more
Shang Gao
Shang Gao

Shang Gao is a doctoral student in bioinformatics at the University of Illinois at Chicago.

Jalees Rehman
Jalees Rehman

Jalees Rehman is a professor of medicine and pharmacology at the University of Illinois at Chicago.

Get the latest from 91影库Today

Enter your email address, and we鈥檒l send you a weekly email with recent articles, interviews and more.

Latest in Science

Science highlights or most popular articles

Hope for a cure hangs on research
Essay

Hope for a cure hangs on research

July 17, 2025

Amid drastic proposed cuts to biomedical research, rare disease families like Hailey Adkisson鈥檚 fight for survival and hope. Without funding, science can鈥檛 鈥渃atch up鈥 to help the patients who need it most.

Before we鈥檝e lost what we can鈥檛 rebuild: Hope for prion disease
Feature

Before we鈥檝e lost what we can鈥檛 rebuild: Hope for prion disease

July 15, 2025

Sonia Vallabh and Eric Minikel, a husband-and-wife team racing to cure prion disease, helped develop ION717, an antisense oligonucleotide treatment now in clinical trials. Their mission is personal 鈥 and just getting started.

Defeating deletions and duplications
News

Defeating deletions and duplications

July 11, 2025

Promising therapeutics for chromosome 15 rare neurodevelopmental disorders, including Angelman syndrome, Dup15q syndrome and Prader鈥揥illi syndrome.

Using 'nature鈥檚 mistakes' as a window into Lafora disease
Feature

Using 'nature鈥檚 mistakes' as a window into Lafora disease

July 10, 2025

After years of heartbreak, Lafora disease families are fueling glycogen storage research breakthroughs, helping develop therapies that may treat not only Lafora but other related neurological disorders.

Cracking cancer鈥檚 code through functional connections
News

Cracking cancer鈥檚 code through functional connections

July 2, 2025

A machine learning鈥揹erived protein cofunction network is transforming how scientists understand and uncover relationships between proteins in cancer.

Gaze into the proteomics crystal ball
In-person Conference

Gaze into the proteomics crystal ball

July 1, 2025

The 15th International Symposium on Proteomics in the Life Sciences symposium will be held August 17鈥21 in Cambridge, Massachusetts.