May 19, 2026

Scientists uncover thousands of new proteins in ‘dark proteome’

Scientists uncover thousands of new proteins in ‘dark proteome’

An international consortium has found over 1,700 new proteins from noncoding DNA, many of which show potential as targets for cancer immunotherapy

Predicted binding between a non-canonical Open Reading Frame (blue) and traditional protein (yellow). Credit: Leron Kok Princess Máxima Center for Pediatric Oncology.

In a collaborative effort, scientists in the TransCODE consortium reveal more than 1,700 new proteins that could have implications for human diseases, including cancer. These tiny proteins were found in what’s called the ‘dark proteome’, a portion of the human genome historically thought to be nonfunctional. 

The TransCODE consortium is a collaboration between experts at EMBL’s European Bioinformatics Institute (EMBL-EBI), the Princess Máxima Center for Pediatric Oncology in the Netherlands, the University of Michigan, the Institute for Systems Biology in Seattle, and the Massachusetts Institute of Technology (MIT).

In the study, published in Nature, the scientists looked at 7,264 non-canonical Open Reading Frames (ncORFs) and found that around 25% of them produce small, protein-like molecules. The team coined the term ‘peptidein’ for this new category of microproteins. All data about these new proteins have been made publicly available to accelerate research worldwide.

“What we’re now seeing is a vast set of protein-like molecules that were effectively invisible before,” said Jonathan Mudge, co-first author on the paper and Annotation Project Leader at EMBL-EBI. “In a sense, we’ve been looking at biology through an incomplete lens.”

A new group of proteins called peptideins

The consortium will add peptideins to reference databases, such as GENCODEUniProt, and PeptideAtlas. Due to their small size, ncORFs and the proteins they encode have largely been absent from these databases. “We introduced the term ‘peptidein’ as a way to bring these molecules out of the shadows and into reference annotation,” said Mudge.

These data will help researchers get a more complete picture of biological processes in and around different types of cells. Scientists could, for example, identify more precise targets for immunotherapy, or compare candidates for new therapies and analyse data more effectively. The consortium intends to continue making peptidein data open as soon as it becomes available.

Implications for cancer treatment and beyond

As many of the newly detected peptideins are found on immune cell surfaces, they could be potential targets for cancer immunotherapy. “Cancer cells express high levels of these molecules, making them a potential new source of biomarkers and therapeutic targets,” said Mudge. A number of peptideins are already under development as drug targets. 

To find out whether any of the peptideins are essential for cell survival, the consortium used CRISPR gene-editing screens. A gene called OLMALINC, which was previously thought to be noncoding, stood out. When this peptidein was made inactive, 85% of the 485 cancer cell lines tested showed impaired survival.

They also found that it plays a role in cell division and DNA damage response. Despite these promising findings, a role for the OLMALINC peptidein in normal, healthy cells remains unclear.

Previous work by members of the consortium has already identified a peptidein that plays an essential role in medulloblastoma, a particularly aggressive form of childhood brain cancer. With the new classification, the team hopes to encourage similar investigations into other diseases.

This press release was adapted from the full press release published on the Princess Máxima Center website.

Funding

The study was conducted by the TransCODE consortium, an international collaboration of more than 60 researchers from over 30 institutions worldwide. The consortium is co-led by the Princess Máxima Center for pediatric oncology in the Netherlands, the University of Michigan Medical School, EMBL’s European Bioinformatics Institute (EMBL-EBI) in Hinxton (UK), and the Institute for Systems Biology in Seattle. The consortium continues its work and collaborates with other dark proteome initiatives, such as the recently awarded Cancer Grand Challenge to Team ILLUMINE.

The study was supported by funders including the US National Institutes of Health, the National Science Foundation, Oncode Accelerator (a Dutch National Growth Fund program) and the European Union’s Marie-Skłodowska-Curie program.

Our latest News

discover more
Providing a cellular ‘all-clear’ signal to resume protein synthesis

Providing a cellular ‘all-clear’ signal to resume protein synthesis

Researchers’ newest technologies allow them to discover tiny protein that plays a big role in awakening dormant cells when it’s safe to resume normal operations Summary It’s a tough world for microbes. When resources grow limited and environments worsen, microbes have figured out ways to hunker down and go dormant until conditions improve. However, scientists […]

Scientists uncover thousands of new proteins in ‘dark proteome’

Scientists uncover thousands of new proteins in ‘dark proteome’

An international consortium has found over 1,700 new proteins from noncoding DNA, many of which show potential as targets for cancer immunotherapy In a collaborative effort, scientists in the TransCODE consortium reveal more than 1,700 new proteins that could have implications for human diseases, including cancer. These tiny proteins were found in what’s called the […]

The Plasticity of Malignant Brain Tumors as a Potential Therapeutic Approach: Sibylle Assmus Prize for Neuro-Oncology Awarded to Moritz Mall

The Plasticity of Malignant Brain Tumors as a Potential Therapeutic Approach: Sibylle Assmus Prize for Neuro-Oncology Awarded to Moritz Mall

The ability of cancer cells to change their identity makes tumors such as glioblastoma particularly aggressive and resistant to treatment. Moritz Mall, of the German Cancer Research Center (DKFZ) and the Hector Institute for Translational Brain Research (HITBR), is investigating how this plasticity can be specifically restricted. His approach focuses on genetic “guardians” that suppress […]

GET IN TOUCH

Stay Updated with bioRN’s Newsletter

Sign up for our newsletter to discover more!
* required

BioRN (BioRN Network e.V. and BioRN Cluster Management GmbH) will use the information you provide on this form to be in touch with you and to provide updates and marketing. Please let us know all the ways you would like to hear from us:

You can update your subscription preferences or unsubscribe at any time. Just follow the unsubscribe or update link in the footer of automated emails you receive from us, or by contacting us at info@biorn.org. We will treat your information with respect. For more information about our privacy practices please visit our website: www.biorn.org. By clicking below, you agree that we may process your information in accordance with these terms.

We use Mailchimp as our marketing platform. By clicking below to subscribe, you acknowledge that your information will be transferred to Mailchimp for processing. Learn more about Mailchimp's privacy practices.

Intuit Mailchimp