Confirming Proteoforms from Long-read mRNA-seq Data in Deep Proteomics Data
Title: Confirming Proteoforms from Long-read mRNA-seq Data in Deep Proteomics Data
DNr: NAISS 2024/23-587
Project Type: NAISS Small Storage
Principal Investigator: Yuqi Zheng <yuqizh@kth.se>
Affiliation: Kungliga Tekniska högskolan
Duration: 2024-10-24 – 2025-11-01
Classification: 10203
Keywords:

Abstract

The central dogma describes the conversion process from DNA to proteins: "DNA is transcribed into RNA, and then RNA is translated into proteins.” However, it is an intricate process with numerous potential variations in practice, resulting in a diverse set of protein products from each gene. Here, “proteoform” describes each possible molecular form of proteins generated from a single gene, accounting for genetic variations, alternatively spliced RNA transcripts, and post-translational modifications. We have identified two interesting resources. First, we have access to a deep proteomics set and, second, we have an extensive long-read mRNA-seq data describing the abundance and variation of mRNAs, for such cells. We aim to examine if the long-read transcripts that are different from the canonical definitions of transcripts can also be retrieved in the deep proteomics data.