Abstract
Individuals affected by a rare disease often experience a long and arduous diagnostic odyssey. Delivery of genetic answers in a timely manner is critical to affected individuals and their families. Multi-omics, a term which usually encompasses genomics, transcriptomics, proteomics, metabolomics and lipidomics, has gained increasing popularity in rare disease research and diagnosis over the past decade. Mass spectrometry (MS) is a technique allowing the study of proteins, metabolites and lipids and their fragments at scale, enabling researchers to effectively determine the presence and abundance of thousands of molecules in a single test, accurately quantify their specific levels, identify potential therapeutic biomarkers, detect differentially expressed proteins in patients with rare diseases, and monitor disease progression and treatment response. In this review, we focus on mass spectrometry (MS)-based omics and survey the literature describing the utility of different MS-based omics and how they have transformed rare disease research and diagnosis.
Similar content being viewed by others
Introduction
Mass spectrometry (MS) is an analytical tool in modern research that has advanced greatly since it was first invented by J.J. Thomson more than a century ago1,2. The technology itself has improved from the first-generation mass spectrometers that were used to separate the isotopes of uranium, through to liquid chromatography coupled tandem mass spectrometry (LC-MS/MS) now being used in day-to-day research activities3,4,5. Moreover, its application has evolved from fundamental chemistry to routine application in all aspects of the life sciences6,7,8,9,10. MS-based proteomics offers several modalities, including accurate quantitation of specific protein levels, the detection of post-translational modifications, and the detection of differentially expressed proteins. MS-based metabolomics and lipidomics have gained significant attention in the fields of newborn bloodspot screening (NBS) and biomarker discovery. Of note, we have seen the uptake of MS in rare disease research at unprecedented levels over the past decade.
Contrary to common perceptions, rare diseases are collectively not rare. Genetic conditions are a leading cause of death in children worldwide, with ~400 million individuals affected by rare diseases globally11. Most of these conditions have their onset in childhood, and one-third of those affected children will not survive until 5 years of age (https://globalgenes.org). The application of next-generation sequencing (NGS), exome or genome sequencing, has transformed the approach to genetic diagnosis for individuals with rare, presumed monogenic diseases. However, one of the major limitations is the difficulty in assessing the pathogenicity of the selected candidate variants. The situation is augmented by the growing integration of whole genome sequencing (WGS) into standard clinical practice, which has enabled the identification of additional variants detectable exclusively through WGS12. In many cases, these variants require further functional characterisation. MS, like many other laboratory-based methods (e.g. RNA sequencing or RNAseq), has received growing attention in the last two decades. This is due to its untargeted and high-throughput nature and ability to accurately identify the differences in the proteome of a patient compared to unaffected individuals.
MS has been used extensively in the rare disease research setting, including for biomarker discovery and protein profiling during functional genomics studies. More recently, it has started taking centre stage in providing additional evidence for the validation of variant pathogenicity in rare disease diagnosis. A recent study demonstrated an increased diagnostic yield using a multi-omics approach focused on mitochondrial diseases13, while another showed how proteomics can replace traditional clinical pathology testing for the detection of mitochondrial disorders14. However, a systematic validation of the utility of MS in the broader rare disease context is yet to be undertaken. This review of MS-based omics techniques focuses primarily on proteomics but also discusses some metabolomics and lipidomics studies. About three quarters of the reviewed articles were focused on proteomics, the reasons being: (1) proteins are the direct effectors of biological processes that are crucial for understanding disease mechanisms, and provide the most straightforward actional insights; (2) proteins are directly related to the structure and function of the cells and tissues, enabling the understanding of the complex molecular pathways disrupted in diseases; (3) the ability to detect low abundance proteins allowing the capture of a greater breadth of diagnostic information compared to metabolomics/lipidomics which usually identifies a narrower range of analytes; (4) proteomics is a more well-established approach compared to metabolomics and lipidomics in terms of conceptual and technical aspects of its delivery. We have also excluded epitope-based assays coupled with NGS, such as the proximity extension assay (PEA) known as Olink®15,16. This is due to it being a targeted rather than untargeted approach and relatively new technology with limited published use cases in the context of diagnostic genomics at the time of writing17,18. While Olink technology is powerful for profiling large numbers of samples with a dynamic range exceeding that of MS-based proteomics19, its targeted nature relies on the presence of epitopes for each protein. This may impact its suitability for the identification of proteins with missense or protein-truncating variants. Moreover, these assays use a panel of bead-antibody complexes rather than MS, restricting their use to specific sample types and the identification of known and usually well-characterised proteins.
Here, we have surveyed the published literature where MS was used in rare disease functional analyses to examine practical applications of the technology across common variant types and different modes of inheritance. Publications have been broadly categorised according to the reasons for using MS. Combinations of search terms, including 'rare disease', 'rare condition', 'diagnosis', 'mutation', 'proteomics', and 'mass spectrometry', were used to search PubMed and MEDLINE databases. A three-step workflow was adopted in this review (Fig. 1). A total of 267 papers were identified using these predefined search terms. These papers were then reviewed for their eligibility and relevance to the scope of this article. The final selection process led to the inclusion of relevant studies but excluded general reviews and monogenic germline cancer studies to produce this review paper.
We searched two databases, PubMed and MEDLINE, to generate the list of articles for review. A combination of different search terms, including 'rare disease', 'rare condition', 'diagnosis', 'mutation', 'mass spectrometry' and 'proteomics' were used. A total of 267 publications from both databases were selected for further analysis. Thirteen publications were duplicated and, therefore, removed from the list. This left us with 254 publications for the next filtering step. We first screened the title of all 254 publications and excluded 60 that were either general review articles or publications that related to monogenic germline cancer studies. The resultant 194 publications were all research articles focusing on monogenic inherited diseases. They were all retrieved and assessed for their eligibility to be included in this manuscript. We then excluded 25 publications that were not MS-based omics studies. This left a total of 169 publications that were included in this final manuscript.
Orthogonal testing
While this is a great success that NGS can achieve diagnostic rates of up to 50%, nearly half the individuals with a suspected rare disease remain undiagnosed20,21. Receiving a definitive genetic answer is crucial for effective patient care and management and can potentially avoid further testing and lead to savings in healthcare costs22. Two major contributors to cases being undiagnosed are when true causative variants are either not selected during genomic data analysis or are recognised as potentially relevant but remain a variant of uncertain significance (VUS). The latter typically require functional evidence to support VUS pathogenicity. There are two barriers in generating functional evidence in a timely and efficient manner. These are the broad range of (mostly low throughput) potential functional assays and the large number of potential causative genes associated with monogenic conditions. This time-consuming and costly process could be alleviated by the availability of validated high-throughput, untargeted functional assays. An example of an emerging approach is RNAseq, which offers the advantage of characterising and identifying splicing defects on a transcriptome-wide scale in patients with suspected rare diseases23,24.
Similarly, in recent years MS-based proteomics has also contributed to the diagnosis of many mitochondrial-related conditions25,26,27,28,29,30. In most instances, filtered variants of interest are selected from the initial genomic NGS analysis, and MS was used as an orthogonal validation tool to assist with the interpretation of variant pathogenicity. Proteomic data can enable the examination of (1) the abundance of candidate proteins encoded by candidate genes, (2) complexes associated with candidate protein, and (3) interactors and binding partners of the candidate protein. A recent publication looking at relative changes to interactors of mitochondrial proteins has benchmarked MS-based proteomics against the traditional targeted analysis of respiratory chain enzyme assays for mitochondrial diseases14. Importantly, this study also demonstrated that MS-based proteomics can be deployed for real-time use in a diagnostic context (and is discussed in a later section of this review).
The MS approach has been exemplified by several independent studies on Leigh syndrome, a mitochondrial condition known to be associated with defects in 16 mitochondrial DNA genes and approximately 100 nuclear genes31. For example, potential causative variants were found in genes such as MRPS34, NDUFC2 and NDUFAF8. The negative impact of the variants on the abundance of both the candidate protein and interacting proteins was demonstrated in patient cell lines32,33,34. In one of these studies, MS-based proteomics also identified a complex I assembly defect through complexome profiling. This approach involves separating mitochondrial protein complexes by native polyacrylamide gel electrophoresis and analysing gel fractions to identify their individual protein components33. Combined with other lines of evidence, the results confirmed the pathogenicity of the selected variants.
The application of MS extends beyond mitochondrial disorders. Examples of its utility have been demonstrated in FHL1, EXOC2 and COL3A1 disorders, which impact processes such as signal transduction, vesicle trafficking and collagen synthesis35,36,37. MS-based proteomics identified FHL1 as a major component of intracytoplasmic inclusions, a pathogenic feature of FHL1-related myopathies. Subsequent genomic NGS identified the disease-causing variants. MS-based proteomics was also used to demonstrate the loss of the candidate EXOC2 protein carrying a pathogenic variant as well as other protein components of the exocyst complex in patient fibroblasts. Similarly, a study of the dysregulated extracellular matrix proteins associated with pathogenic variants in the COL3A1 gene also highlights the utility of MS-based proteomics in rare disease diagnosis. Moreover, a recent study also applied MS following ultra-rapid genomic sequencing of a patient with encephalopathic episodes found to have biallelic NUP214 variants38. One of the missense variants was already classified as pathogenic, while the other was a missense VUS. Quantitative MS-based proteomics confirmed the reduced level of the NUP214 protein. It also showed a reduction in NUP88, its physical interactor within the human nuclear pore complex, along with several other nuclear pore complex components. This allowed the reclassification of the VUS variant to likely pathogenic, thereby establishing the molecular diagnosis.
Disease gene discovery
Up to half of individuals with a rare disease remain undiagnosed despite exhaustive examination of clinically relevant gene panels restricted to known monogenic disorder genes. In these cases, unknown pathogenic variants may lie in a novel gene and go unnoticed during variant curation. MS-based proteomics has proven its worth in tackling this challenge. For example, a recent study presented two undiagnosed individuals with Leigh syndrome39, despite extensive genomic analysis that included clinical exome sequencing. One of them endured a diagnostic odyssey for more than two decades. Proteomics was performed on skin fibroblasts from both individuals and identified a significant reduction in the abundance of several proteins of the large mitoribosomal subunit, with MRPL39 protein showing the most pronounced alteration. Functional defects of mitochondrial complexes I, III and IV were also identified through relative complex abundance (RCA) analysis. This is a metric derived from proteomics data that quantifies co-dependence of subunit abundances in mitochondrial complexes, in both individuals. These proteomic findings instigated subsequent focused genomic and transcriptomic studies that confirmed the presence of pathogenic compound heterozygous variants in the novel mitoribosomal gene MRPL39. Other novel genes have also been linked to disease through the application of proteomic analysis, such as MRPL5040, MRPS3941, TEFM42 and HMGCS143. These examples highlight some of the most promising outcomes that MS-based proteomic studies can offer in the discovery of new disease genes.
Proteomics focused studies on disease models have also led to disease gene discovery. In one example, a comprehensive catalogue of human mitochondrial protein function was developed through MS-based multi-omics analysis of more than 200 knockout cell lines44. Among millions of distinct biomolecules measured, they discovered novel associations of the PYURF protein with both complex I assembly and coenzyme Q biosynthesis. This led to the identification of a homozygous pathogenic variant in a previously undiagnosed individual affected by a multisystemic mitochondrial disorder. In the same study, RAB5IF was also identified as a novel disease gene through the same approach where a causal relationship was identified between the gene and an individual with craniofacial dysmorphism, skeletal anomalies, and impaired intellectual development syndrome 2.
Alternative analysis techniques, for example, genomics, transcriptomics and traditional targeted assays have also been used widely for novel disease gene discovery. Both genomic and transcriptomic approaches have become well-established technologies in clinical settings45,46,47. They offer significant value for identifying genetic variants and elucidating their impact on gene expression. However, these approaches often fall short of providing the functional insights on protein stability and abundance provided by MS. On the other hand, traditional targeted assays like western blotting and ELISAs may be more practical for validation but lack the comprehensive depth, broad scope and sensitivity provided by MS-based proteomics.
Disease modelling
Understanding the interactors, binding partners, basic biological functions, and pathways of a protein can often provide valuable insights into the disease mechanisms associated with that protein. There are approximately 20,000 protein-coding genes according to the Human Genome Project48, implying that there should be a similar number of proteins in the human proteome49. However, this number could be up to several million if we consider various splicing events, tissue-specific isoforms and post-translational modifications50. Traditional methods for protein characterisation include western blotting, yeast two-hybrid and immunoprecipitation studies, with each having specific strengths and limitations. For example, SDS-PAGE and western blotting rely on the use of one antibody, which generally detects one protein. Therefore, the cost of performing such analyses rises steadily as the number of antibodies used increases. Moreover, the specificity of antibodies is also often not clear, with many commercial antibodies identifying multiple protein products51. Untargeted MS-based proteomics is a powerful tool in the field of biomedical research that has transformed the way we can study protein function and biology. The ability to identify thousands of proteins is a great advantage of MS-based proteomics over targeted and low-throughput protein-based approaches such as western blotting. In a report published in 2016, LC-MS/MS was successfully used to show cystinosin interacted with multiple components of the mTORC1 signalling pathway, for which there was a lack of reliable antibodies52. Another study highlighted the potential of MS-based proteomics in elucidating large multi-subunit complexes, thereby enabling the investigation of complex dysfunction at a cellular level using human knockout cell lines53.
The mechanisms underlying many rare diseases remain unclear to this day, requiring extensive research to unveil the disease aetiology. Over the past few decades, MS has been increasingly applied in functional studies to understand the molecular function of newly discovered genes53. This is often accomplished through model systems instead of patient samples, particularly when traditional diagnostic approaches have often been inadequate. A study using affinity purification of ectopically expressed POLR3B coupled to MS-based proteomics characterised the pathogenic impact of POLR3B variants on the protein interaction with other members of the RNA polymerase III complex54. Combined with knock-in mouse models, the researchers showed that the variants cause severe interruption of RNA polymerase III complex assembly, thus advancing our understanding of the disease mechanisms of this leukodystrophy.
Newborn bloodspot screening—first line of testing
Newborn bloodspot screening (NBS) was introduced in the 1960s, initially to detect phenylketonuria. Later developments allowed for the introduction of tests for conditions such as congenital hypothyroidism and cystic fibrosis. However, the paradigm for NBS was very much 'one test for one condition' until the late 1990s. Developments in MS, particularly the introduction of electrospray ionisation, which equally benefited MS-based proteomics, allowed the development of a single MS-based metabolomic test for multiple inborn errors of metabolism55. Since the 2000s, most NBS laboratories have introduced this technology56,57 and babies are now typically screened for 20–30 inborn errors of metabolism using a targeted panel of up to 50 metabolites, mainly amino acids and acylcarnitines. Previously, many of these disorders were diagnosed clinically, often when infants were extremely unwell and sometimes post-mortem. The early detection of these conditions has greatly improved outcomes. For example, medium-chain acyl-CoA dehydrogenase deficiency occurs at an overall birth prevalence of 5.3 per 100,000, with a higher prevalence for individuals with European ancestry58 and can be rapidly fatal in undiagnosed infants59. This condition is now readily detected as part of the NBS metabolomic panel resulting in a significant reduction in infant mortality60,61.
NBS testing is rather different to clinical testing: there is no phenotype information to guide result interpretation, and unlike clinical testing, the starting premise is that babies are assumed to be normal until there is convincing screening evidence that a baby has an NBS condition. False positive results cause considerable parental anxiety as well as unnecessary work for medical and pathology staff involved with confirmatory testing. Biomarkers, therefore, require high diagnostic sensitivity and specificity and should be limited to those specifically targeting the corresponding NBS conditions. Furthermore, the requirement to screen hundreds of babies per day with fast turn-around times and low assay failure rates places considerable demands on testing technology and laboratory organisation. Flow injection tandem mass spectrometry (FIA-MSMS) has been successful in meeting these demands, and metabolomic panels continue to be expanded to include new conditions such as X-linked adrenoleukodystrophy (X-ALD). Suboptimal biomarker diagnostic performance can be improved by using second-tier metabolomic tests that also use MS. For example, FIA-MSMS analysis of C26:0 lysophosphatidylcholine, the main biomarker for X-ALD, suffers from interferences which can cause false positives. Second-tier LC-MSMS analysis of these samples is effective at minimising interference and greatly improves NBS diagnostic performance62.
Biomarker screening
The ability to detect multiple proteins or metabolites concurrently, and the high-throughput nature of MS, brings another major application in the field of biomarker discovery. Friedreich ataxia (FRDA) is a rare disease caused by pathogenic variants in the FXN gene in almost all cases63. The rarity of the disorder and its relatively slow progression mean that any clinical trials will face the challenges of not hitting recruitment targets. MS-led biomarker studies can assist with these issues by the inclusion of proteome analysis in a smaller cohort over a shorter timeframe to assess treatment efficacy. A study designed to identify changes in cerebrospinal fluid (CSF) proteins occurring in FRDA patients exemplifies this approach64. This group identified 172 differentially expressed proteins (DEPs) in FRDA individuals compared to controls. Some top candidate proteins, such as SORCS3, SCG5, CNDP1 and NRXN2, can potentially be used as biomarkers to help with disease diagnosis or potentially as outcome measures of treatment response.
Identified biomarkers can be utilised to screen individuals with suspected conditions and confirm a diagnosis. Acid sphingomyelinase deficiency (ASMD) and Niemann-Pick type C disease (NPC) share some clinical features and are caused by pathogenic variants in the SMPD1 gene and the NPC1/NPC2 genes, respectively. Two research groups identified the differential biochemical profiles found in controls and individuals with ASMD or NPC using MS-based lipidomics65,66. They measured various lipid biomarkers and were able to show variable elevations of those markers in individuals affected by one or other of the two conditions. These results using MS showcase its potential in disease screening and its ability to help with making a precise diagnosis.
Numerous reports have been published on the use of MS in biomarker discovery in rare diseases using various types of patient samples67,68,69,70,71,72,73,74,75,76,77,78,79,80,81. One group used MS-based proteomics to examine the DEPs found in conjunctival impression cytology samples from individuals with congenital aniridia in order to discover potential biomarkers82. In another study, MS-based proteomics was used to identify potential drug targets by characterising downstream sequelae of CAPN5 hyperactivity in individuals affected with neovascular inflammatory vitreoretinopathy83. A recent study used targeted MS-based proteomics to identify protein signatures in a small cohort of individuals with systemic sclerosis78. Another study analysed differential protein profiles in serum and urine of mucopolysaccharidosis type I sufferers80. These studies highlight the growing attention and progress in this field.
Disease monitoring
Early diagnosis of rare diseases and monitoring of disease progression are likely the most effective clinical management to offer to patients, and MS plays an important role in this area. Tay-Sachs disease (TSD) is a rare lysosomal storage disease caused by pathogenic variants in the HEXA gene, which triggers the accumulation of GM2 ganglioside in lysosomes. There are no effective treatments for TSD, and the ability to diagnose the condition and monitor its progression early is particularly crucial to patient management. A group of researchers was able to demonstrate a significant accumulation of the GM2 sphingolipid in induced pluripotent stem cells from TSD individuals compared to controls using MS-based lipidomics84. The detection of GM2 build-up using MS enabled the early identification of TSD individuals.
A 6-month-old infant with complex I deficiency and biallelic pathogenic ACAD9 variants presented with severe cardiomyopathy and lactic acidosis, unresponsive to conventional riboflavin treatment85. In this case, alternative treatments of bezafibrate and nicotinamide riboside were administered. Disease progression was monitored through MS-based proteomics. This was achieved by measuring the level of mitochondrial complex I abundance in peripheral blood mononuclear cells (PBMC) before and during treatment. The study demonstrated the utility of MS-based proteomics as a tool for tracking disease progression and evaluating treatment efficacy of therapeutic interventions.
Finally, another group used MS-based lipidomics to measure phosphatidylcholine containing very long chain fatty acid signals in cerebrospinal fluid in a small cohort of patients with X-linked adrenoleukodystrophy86. The traditional method of measuring plasma levels has been reported to be insufficient for prediction of disease severity. This group found that MS could detect some signals in plasma that were typically detected only in the CSF of patients with the condition. Another example comes from a research group that studied Hutchinson-Gilford progeria syndrome87. Endogenous levels of lamin A and progerin were quantified using MS-based proteomics. It offered an alternative solution to traditional techniques that are often limited by their specificity and quantitative accuracy.
Efficacy of mass spectrometry in rare disease functional genomics
To examine the efficacy, specificity, and usage of MS using samples from patients with a range of rare diseases, a comprehensive review of the literature was conducted25,26,27,28,29,30,32,33,35,36,37,38,39,44,52,53,54,64,65,66,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117,118,119,120,121,122,123,124,125,126,127,128,129,130,131,132,133,134,135,136,137,138,139,140,141,142,143,144,145,146,147,148,149,150,151,152,153,154,155,156,157,158,159,160,161,162,163,164,165,166,167,168,169,170,171,172,173,174,175,176,177,178,179,180,181,182,183,184,185,186,187,188,189,190,191,192,193,194,195,196,197,198,199,200,201,202,203,204,205,206,207,208,209,210,211,212,213,214,215. Several aspects were investigated in this review, including (1) the number of cases reported from 1993 to Jan 2024, (2) the distribution of inheritance patterns observed, (3) the types of variant zygosity reported, (4) the application of MS in the context of the studies, and (5) the variety of sample types used in MS analyses.
It is not surprising to see the uptake of MS in rare disease studies has significantly increased over the last decade (Fig. 2A). Over half of the literature reviewed featured disorders with an autosomal recessive inheritance, which is the most studied inheritance pattern within this review (Fig. 2B). This is not unexpected, given that loss of function is a common disease mechanism for most autosomal recessive conditions, and a reduction in protein abundance can potentially be measured by MS-based proteomics. The zygosity distribution of studied cases also fits with the inheritance pattern. Homozygous and compound heterozygous are two of the three most common types of zygosity reported (Fig. 2C).
A Inheritance patterns of different rare diseases that have been studied using MS over time. B The overall contribution of each inheritance pattern in MS studies of rare diseases. C The contribution of each variant zygosity reported in all surveyed publications. AD autosomal dominant, AR autosomal recessive, XL X-linked, XLD X-linked dominant, XLR X-linked recessive, mito mitochondrial.
The variant type appears to be another important factor to consider when determining the utility of MS in rare disease studies (Table 1). Nearly 40% of missense variants are expected to cause reduced expression of the protein due to protein instability and subsequent degradation216. Missense variants are by far the most common type of pathogenic variant in many rare diseases, and one of the most challenging disease mechanisms for predicting the associated functional consequences140. Functional assays, such as RT-PCR or RNAseq, measure transcript abundance. However, they are of limited value for determining the functional impact of missense variants because RNA abundance, splicing or stability are typically not affected. In contrast, MS-based proteomics is a technique that has been shown to be able to detect changes in an individual’s proteome that may be caused by missense variants135,214. Almost half of the examined reports featured rare diseases caused by missense variants either in autosomal dominant or autosomal recessive conditions. However, missense variants can also lead to changes in metabolites or lipids without changes in protein abundance. In these scenarios, MS-based metabolomics or lipidomics focused on specific analytes would be more suitable to detect the consequences of the mutated protein.
The utility of MS in rare diseases included in this review is shown in Fig. 3A. The most common use of MS in rare disease research is for orthogonal validation of variants through proteome profiling. Almost one-third of the reviewed papers reported MS being used as an orthogonal tool to validate the pathogenicity of an identified variant in suspected rare diseases. Proteome profiling can facilitate the identification of potential drug targets or biomarkers, which may subsequently be leveraged for future disease screening and therapeutic development201. The majority of the reviewed articles are MS-based proteomic studies which cover various rare diseases, and almost all NBS articles are MS-based metabolomic or lipidomic studies (Fig. 3B).
Disease-appropriate biological samples are important for establishing genotype-phenotype correlations, particularly given the tissue-specific expression of many proteins. Primary samples such as skin fibroblasts, muscle, brain, and CSF often involve invasive sample collection procedures. Historically, the most common type of material used for orthogonal testing has been patient-derived fibroblasts (Fig. 3C)217. However, one disadvantage of using fibroblasts is the time required to establish the cell line from a skin biopsy before functional tests can be performed. This makes fibroblasts not a suitable sample type for critically ill patients requiring a rapid diagnosis. Opting for other sample types, such as PBMC, can address this concern. This is supported by a recent report where ultra-rapid proteomics was carried out in under 54 hours for a critically ill infant with a suspected mitochondrial disorder. Proteomics provided functional evidence to support the pathogenicity of a homozygous intronic variant in NDUFS8 identified by ultra-rapid whole genome sequencing.
Clinical translation of MS-based proteomics
Despite numerous reports highlighting the utility of MS-based proteomics in the context of rare disease, this technique is currently restricted to research investigations. The translation of proteomics into clinical pathology laboratories would facilitate its incorporation into rare disease diagnosis and management. A recent study benchmarked MS-based proteomics for use in routine clinical practice for rare mitochondrial disorders14. The study demonstrated that a single untargeted proteomics assay outperformed traditional targeted respiratory chain enzymology, offering superior diagnostic and clinical insights. Further studies are needed to evaluate the feasibility of translating this technique into the broader spectrum of rare disease diagnosis. A recent micro-costing study highlighted the importance of prioritisation and reimbursement decisions for genomic technologies218. In addition, a micro-costing study focusing on MS-based proteomics indicated that proteomics can be performed at a similar cost to traditional functional assays in clinical laboratories219. Further studies of the health and economic implications of MS-based proteomics are now needed to support its introduction to routine clinical care.
While MS-based proteomics is a powerful tool for rare disease diagnosis, its translation into clinical settings holds several challenges. First, this assay requires initial setup with costly instruments, which require ongoing maintenance and dedicated staffing. Second, each experiment will generate vast amounts of data that are highly complex to interpret. Bioinformatic techniques are usually employed to deconvolute the data into a more transparent and standardised format for scientists to interpret. Moreover, the inherent complexity of biological systems and the diversity of sample types used as sources could introduce multiple challenges in standardising the acquired data across laboratories. Third, although MS-based proteomics has a high sensitivity for protein detection, it is still challenging to detect small proteins and low-abundant proteins. Lastly, and not unique to proteomics, tissue-specific expression of many genes means that the identification of some proteins is limited to certain sample types, making it challenging to investigate proteins in less accessible tissues.
There are no current regulations, accreditation systems (such as NATA, CLIA and ASQ), and/or standards that govern the translation of untargeted MS-based proteomics into the clinical setting. This means no consistent methods have been established relating to the acquisition, quantification and interpretation of the data collected, making it difficult to compare results between different laboratories. This also creates difficulties for clinicians aiming to use the results to guide clinical decision-making. The Human Proteome Organisation (HUPO) is a global scientific organisation dedicated to advancing proteomics through international collaboration. It promotes the development of new technologies, techniques, and training to advance the field of proteomics. In addition, international groups such as RD-Connect and the UDN network have teamed up clinicians and scientists to exchange data and explore possible causes of rare and undiagnosed disease using muti-omics. It is hoped that collaborative efforts from organisations like these will develop standardised approaches to the analysis of proteomic data, much like the ACMG have done with genomic sequencing interpretation guidelines. Standardised protocols will reduce discrepancies between different laboratories, alleviate difficulties in interpretation, and make this technology more accessible for delivery in the clinical setting.
Conclusions and future directions
In this review, we have examined the literature providing evidence for the utility of MS in rare diseases. It is evident that MS is a powerful tool in the field of rare disease research and diagnosis. We see a significant increase in the application of MS in the rare disease space, including in biomarker discovery, protein profiling in functional genomics studies, and various screening approaches of newborn and adult cohorts. Importantly, there has been a marked uptake in recent years of MS in providing functional insights into the pathogenicity of a given variant during diagnostic workup of rare diseases. There is no doubt that the importance of MS in rare disease research and diagnosis will continue to expand in the years ahead.
MS-based proteomics, though less integrated into clinical practice than targeted metabolomics, can help with the interpretation of VUSs in patients with existing genomic and clinical data. We hope to see this time-efficient and potentially cost-efficient approach increasing the diagnostic yield and improving the management of individuals affected by rare diseases. A single clinically validated high-throughput proteomics assay could revolutionise how many rare diseases are diagnosed, end the often lengthy diagnostic odyssey endured by many families, restore their reproductive confidence, and for some affected individuals, will potentially allow access to the growing range of targeted advanced genetic therapies.
Data availability
No datasets were generated or analysed during the current study.
References
Maher, S., Jjunju, F. P. M. & Taylor, S. Colloquium: 100 years of mass spectrometry: perspectives and future trends. Rev. Mod. Phys. 87, 113–135 (2015).
Griffiths, J. A brief history of mass spectrometry. Anal. Chem. 80, 5678–5683 (2008).
Parkins, W. & E, W. The uranium bomb, the calutron, and the space-charge problem. Phys. Today 58, 45–51 (2005).
Pitt, J. J. Principles and applications of liquid chromatography-mass spectrometry in clinical biochemistry. Clin. Biochem. Rev. 30, 19–34 (2009).
Aebersold, R. & Mann, M. Mass spectrometry-based proteomics. Nature 422, 198–207 (2003).
Nkuipou-Kenfack, E., Zurbig, P. & Mischak, H. The long path towards implementation of clinical proteomics: exemplified based on CKD273. Proteomics Clin. Appl. 11 (2017).
Roos, A., Thompson, R., Horvath, R., Lochmuller, H. & Sickmann, A. Intersection of proteomics and genomics to “solve the unsolved” in rare disorders such as neurodegenerative and neuromuscular diseases. Proteomics Clin. Appl. 12 (2018).
Braconi, D., Bernardini, G., Spiga, O. & Santucci, A. Leveraging proteomics in orphan disease research: pitfalls and potential. Expert Rev. Proteom. 18, 315–327 (2021).
Chantada-Vazquez, M. D. P., Bravo, S. B., Barbosa-Gouveia, S., Alvarez, J. V. & Couce, M. L. Proteomics in inherited metabolic disorders. Int. J. Mol. Sci. 23, 14744 (2022).
Suran, M. After the genome-a brief history of proteomics. JAMA 328, 1168–1169 (2022).
Hartin, S. N., Means, J. C., Alaimo, J. T. & Younger, S. T. Expediting rare disease diagnosis: a call to bridge the gap between clinical and functional genomics. Mol. Med. 26, 117 (2020).
Wojcik, M. H. et al. Genome sequencing for diagnosing rare diseases. N. Engl. J. Med. 390, 1985–1997 (2024).
Kopajtich, R. et al. Integration of proteomics with genomics and transcriptomics increases the diagnostic rate of Mendelian disorders. Preprint at medRxiv 2021.03.09.21253187 (2021).
Hock, D. H. et al. Untargeted proteomics enables ultra-rapid variant prioritization in mitochondrial and other rare diseases. Preprint at medRxiv 2024.08.06.24311318 (2024).
Cui, M., Cheng, C. & Zhang, L. High-throughput proteomics: a methodological mini-review. Lab Invest. 102, 1170–1181 (2022).
Fredriksson, S. et al. Protein detection using proximity-dependent DNA ligation assays. Nat. Biotechnol. 20, 473–477 (2002).
Rodari, M. M. et al. Insights into the expanding intestinal phenotypic spectrum of SOCS1 haploinsufficiency and therapeutic options. J. Clin. Immunol. 43, 1403–1413 (2023).
Yu, F. et al. Loss of the TRPM4 channel in humans causes immune dysregulation with defective monocyte migration. J. Allergy Clin. Immunol. 154, 792–806 (2024).
Deng, Y. T. et al. Atlas of the plasma proteome in health and disease in 53,026 adults. Cell 188, 253–271.e7 (2024).
Stark, Z. et al. A prospective evaluation of whole-exome sequencing as a first-tier molecular test in infants with suspected monogenic disorders. Genet. Med. 18, 1090–1096 (2016).
Tan, T. Y. et al. Diagnostic impact and cost-effectiveness of whole-exome sequencing for ambulant children with suspected monogenic conditions. JAMA Pediatr. 171, 855–862 (2017).
Stark, Z. et al. Australian genomics: outcomes of a 5-year national program to accelerate the integration of genomics in healthcare. Am. J. Hum. Genet. 110, 419–426 (2023).
Kremer, L. S. et al. Genetic diagnosis of Mendelian disorders via RNA sequencing. Nat. Commun. 8, 15824 (2017).
Yépez, V. A. et al. Detection of aberrant gene expression events in RNA sequencing data. Nat. Protoc. 16, 1276–1296 (2021).
Alston, C. L. et al. Bi-allelic mutations in NDUFA6 establish its role in early-onset isolated mitochondrial complex I deficiency. Am. J. Hum. Genet. 103, 592–601 (2018).
Sbardella, D. et al. Defective proteasome biogenesis into skin fibroblasts isolated from Rett syndrome subjects with MeCP2 non-sense mutations. Biochim. Biophys. Acta Mol. Basis Dis. 1866, 165793 (2020).
Frazier, A. E. et al. Fatal perinatal mitochondrial cardiac failure caused by recurrent de novo duplications in the ATAD3 locus. Med 2, 49–73 (2021).
Helman, G. et al. Multiomic analysis elucidates Complex I deficiency caused by a deep intronic variant in NDUFB10. Hum. Mutat. 42, 19–24 (2021).
Sathe, G. et al. Ethylmalonic encephalopathy ETHE1 p. D165H mutation alters the mitochondrial function in human skeletal muscle proteome. Mitochondrion 58, 64–71 (2021).
Lee, R. G. et al. Deleterious variants in CRLS1 lead to cardiolipin deficiency and cause an autosomal recessive multi-system mitochondrial disease. Hum. Mol. Genet. 31, 3597–3612 (2022).
Rahman, S. Leigh syndrome. Handb. Clin. Neurol. 194, 43–63 (2023).
Lake, N. J. et al. Biallelic mutations in MRPS34 lead to instability of the small mitoribosomal subunit and Leigh syndrome. Am. J. Hum. Genet. 101, 239–254 (2017).
Alahmad, A. et al. Bi-allelic pathogenic variants in NDUFC2 cause early-onset Leigh syndrome and stalled biogenesis of complex I. EMBO Mol. Med. 12, e12619 (2020).
Alston, C. L. et al. Pathogenic bi-allelic mutations in NDUFAF8 cause Leigh syndrome with an isolated complex I deficiency. Am. J. Hum. Genet. 106, 92–101 (2020).
Feldkirchner, S. et al. Proteomic characterization of aggregate components in an intrafamilial variable FHL1-associated myopathy. Neuromuscul. Disord. 23, 418–426 (2013).
Van Bergen, N. J. et al. Mutations in the exocyst component EXOC2 cause severe defects in human brain development. J. Exp. Med. 217, e20192040 (2020).
Foehr, R., Anderson, K., Dombrowski, O., Foehr, A. & Foehr, E. D. Dysregulation of extracellular matrix and lysyl oxidase in Ehlers-Danlos syndrome type IV skin fibroblasts. Orphanet J. Rare Dis. 19, 9 (2024).
Lunke, S. et al. Integrated multi-omics for rapid rare disease diagnosis on a national scale. Nat. Med. 29, 1681–1691 (2023).
Amarasekera, S. S. C. et al. Multi-omics identifies large mitoribosomal subunit instability caused by pathogenic MRPL39 variants as a cause of pediatric onset mitochondrial disease. Hum. Mol. Genet. 32, 2441–2454 (2023).
Bakhshalizadeh, S. et al. Deficiency of the mitochondrial ribosomal subunit, MRPL50, causes autosomal recessive syndromic premature ovarian insufficiency. Hum. Genet. 142, 879–907 (2023).
Borna, N. N. et al. Mitochondrial ribosomal protein PTCD3 mutations cause oxidative phosphorylation defects with Leigh syndrome. Neurogenetics 20, 9–25 (2019).
Van Haute, L. et al. TEFM variants impair mitochondrial transcription causing childhood-onset neurological disease. Nat. Commun. 14, 1009 (2023).
Dofash, L. N. H. et al. HMGCS1 variants cause rigid spine syndrome amenable to mevalonic acid treatment in an animal model. Brain awae371 (2024).
Rensvold, J. W. et al. Defining mitochondrial protein functions through deep multiomic profiling. Nature 606, 382–388 (2022).
Cummings, B. B. et al. Improving genetic diagnosis in Mendelian disease with transcriptome sequencing. Sci. Transl. Med. 9, eaal5209 (2017).
Ketkar, S., Burrage, L. C. & Lee, B. RNA sequencing as a diagnostic tool. JAMA 329, 85–86 (2023).
Zech, M. & Winkelmann, J. Next-generation sequencing and bioinformatics in rare movement disorders. Nat. Rev. Neurol. 20, 114–126 (2024).
Gaudet, P. et al. The neXtProt knowledgebase on human proteins: 2017 update. Nucleic Acids Res. 45, D177–d182 (2017).
Aebersold, R. et al. How many human proteoforms are there? Nat. Chem. Biol. 14, 206–214 (2018).
Apweiler, R. et al. UniProt: the universal protein knowledgebase in 2023. Nucleic Acids Res. 51, D523–D531 (2023).
Signore, M., Manganelli, V. & Hodge, A. Antibody validation by Western blotting. Methods Mol. Biol. 1606, 51–70 (2017).
Andrzejewska, Z. et al. Cystinosin is a component of the vacuolar H+-ATPase-ragulator-Rag complex controlling mammalian target of rapamycin complex 1 signaling. J. Am. Soc. Nephrol. 27, 1678–1688 (2016).
Stroud, D. A. et al. Accessory subunits are integral for assembly and function of human mitochondrial complex I. Nature 538, 123–126 (2016).
Choquet, K. et al. The leukodystrophy mutation Polr3b R103H causes homozygote mouse embryonic lethality and impairs RNA polymerase III biogenesis. Mol. Brain 12, 59 (2019).
Millington, D. S. How mass spectrometry revolutionized newborn screening. J. Mass Spectrom. Adv. Clin. Lab 32, 1–10 (2024).
Wilcken, B., Wiley, V., Hammond, J. & Carpenter, K. Screening newborns for inborn errors of metabolism by tandem mass spectrometry. N. Engl. J. Med. 348, 2304–2312 (2003).
Frazier, D. M. et al. The tandem mass spectrometry newborn screening experience in North Carolina: 1997-2005. J. Inherit. Metab. Dis. 29, 76–85 (2006).
Chang, I. J. et al. GeneReviews (1993).
Iafolla, A. K., Thompson, R. J. Jr. & Roe, C. R. Medium-chain acyl-coenzyme A dehydrogenase deficiency: clinical course in 120 affected children. J. Pediatr. 124, 409–415 (1994).
Wilcken, B. et al. Outcome of neonatal screening for medium-chain acyl-CoA dehydrogenase deficiency in Australia: a cohort study. Lancet 369, 37–42 (2007).
Mütze, U. et al. Sudden neonatal death in individuals with medium-chain acyl-coenzyme A dehydrogenase deficiency: limit of newborn screening. Eur. J. Pediatr. 181, 2415–2422 (2022).
Albersen, M. et al. Sex-specific newborn screening for X-linked adrenoleukodystrophy. J. Inherit. Metab. Dis. 46, 116–128 (2023).
Delatycki, M. B. & Bidichandani, S. I. Friedreich ataxia- pathogenesis and implications for therapies. Neurobiol. Dis. 132, 104606 (2019).
Imbault, V., Dionisi, C., Naeije, G., Communi, D. & Pandolfo, M. Cerebrospinal fluid proteomics in Friedreich ataxia reveals markers of neurodegeneration and neuroinflammation. Front. Neurosci. 16, 885313 (2022).
Kubaski, F. et al. Quantification of lysosphingomyelin and lysosphingomyelin-509 for the screening of acid sphingomyelinase deficiency. Orphanet J. Rare Dis. 17, 407 (2022).
Kuchar, L. et al. Quantitation of plasmatic lysosphingomyelin and lysosphingomyelin-509 for differential screening of Niemann-Pick A/B and C diseases. Anal. Biochem. 525, 73–77 (2017).
Kamai, T. et al. Protein profiling of blood samples from patients with hereditary leiomyomatosis and renal cell cancer by surface-enhanced laser desorption/ionization time-of-flight mass spectrometry. Int. J. Mol. Sci. 13, 14518–14532 (2012).
Giuliano, S. et al. Proteomic analysis of lymphoblastoid cells from Nasu-Hakola patients: a step forward in our understanding of this neurodegenerative disorder. PLoS One 9, e110073 (2014).
Ruiz, M. et al. Altered glycolipid and glycerophospholipid signaling drive inflammatory cascades in adrenomyeloneuropathy. Hum. Mol. Genet. 24, 6861–6876 (2015).
Motta, M. et al. Clinical, biochemical and molecular characterization of prosaposin deficiency. Clin. Genet. 90, 220–229 (2016).
Oonk, S. et al. Comparative mass spectrometric and immunoassay-based proteome analysis in serum of Duchenne muscular dystrophy patients. Proteom. Clin. Appl. 10, 290–299 (2016).
Zmyslowska, A. et al. Serum metabolic fingerprinting identified putatively annotated Sphinganine isomer as a biomarker of Wolfram syndrome. J. Proteome Res. 16, 4000–4008 (2017).
Juneja, M. et al. PFN2 and GAMT as common molecular determinants of axonal Charcot-Marie-tooth disease. J. Neurol. Neurosurg. Psychiatry 89, 870–878 (2018).
Velez, G. & Mahajan, V. B. Molecular surgery: proteomics of a rare genetic disease gives insight into common causes of blindness. iScience 23, 101667 (2020).
Li, Z. et al. Comparative proteomics analysis for identifying the lipid metabolism related pathways in patients with Klippel-Feil syndrome. Ann. Transl. Med. 9, 255 (2021).
Burat, B. et al. Sweat proteomics in cystic fibrosis: discovering companion biomarkers for precision medicine and therapeutic development. Cells 11, 2358 (2022).
Rowan, D. J. et al. Metallothionein immunohistochemistry has high sensitivity and specificity for detection of Wilson disease. Mod. Pathol. 35, 946–955 (2022).
Szabo, I. et al. Immune profiling of patients with systemic sclerosis through targeted proteomic analysis. Int. J. Mol. Sci. 24, 17601 (2023).
van Heugten, M. H. et al. Matrix metalloproteinase-7 in urinary extracellular vesicles identifies rapid disease progression in autosomal dominant polycystic kidney disease. J. Am. Soc. Nephrol. 35, 321–334 (2023).
Liu, K. et al. Co-analysis of serum and urine differentially expressed proteins in mucopolysaccharidosis type I. J. Proteome Res. 23, 718–727 (2024).
Chatzispyrou, I. A. et al. Barth syndrome cells display widespread remodeling of mitochondrial complexes without affecting metabolic flux distribution. Biochim. Biophys. Acta Mol. Basis Dis. 1864, 3650–3658 (2018).
Stachon, T. et al. Protein profiling of conjunctival impression cytology samples of aniridia subjects. Acta Ophthalmol. 102, e635–e645 (2023).
Velez, G. et al. Proteomic insight into the pathogenesis of CAPN5-vitreoretinopathy. Sci. Rep. 9, 7608 (2019).
Vu, M. et al. Neural stem cells for disease modeling and evaluation of therapeutics for Tay-Sachs disease. Orphanet J. Rare Dis. 13, 152 (2018).
Van Hove, J. L. K. et al. ACAD9 treatment with bezafibrate and nicotinamide riboside temporarily stabilizes cardiomyopathy and lactic acidosis. Mitochondrion 78, 101905 (2024).
Fujitani, N. Detection of characteristic phosphatidylcholine containing very long chain fatty acids in cerebrospinal fluid from patients with X-linked adrenoleukodystrophy. Biol. Pharm. Bull. 45, 1725–1727 (2022).
Camafeita, E., Jorge, I., Rivera-Torres, J., Andres, V. & Vazquez, J. Quantification of farnesylated progerin in Hutchinson-Gilford progeria patient cells by mass spectrometry. Int. J. Mol. Sci. 23, 11733 (2022).
Mancini, J. et al. Mevalonic aciduria in 3 siblings: a new recognizable metabolic encephalopathy. Pediatr. Neurol. 9, 243–246 (1993).
Rahbeeni, Z. et al. Identification of two novel mutations in OCTN2 from two Saudi patients with systemic carnitine deficiency. J. Inherit. Metab. Dis. 25, 363–369 (2002).
Yu, M. et al. Proteomic analysis of the serum in patients with idiopathic pulmonary arterial hypertension. J. Zhejiang Univ. Sci. B 8, 221–227 (2007).
Schessl, J. et al. Proteomic identification of FHL1 as the protein mutated in human reducing body myopathy. J. Clin. Invest. 118, 904–912 (2008).
Kranendijk, M. et al. Development and implementation of a novel assay for L-2-hydroxyglutarate dehydrogenase (L-2-HGDH) in cell lysates: L-2-HGDH deficiency in 15 patients with L-2-hydroxyglutaric aciduria. J. Inherit. Metab. Dis. 32, 713 (2009).
Keller, L. et al. The PSEN1 I143T mutation in a Swedish family with Alzheimer’s disease: clinical report and quantification of Abeta in different brain regions. Eur. J. Hum. Genet. 18, 1202–1208 (2010).
Lee, H. A novel dup at the putative DNA polymerase alpha arrest site and a founder mutation in Chinese in the IVD gene underlie isovaleric acidaemia. Hong Kong Med. J. 16, 219–222 (2010).
Sokoro, A. A. et al. Diagnosis and high incidence of hyperornithinemia-hyperammonemia-homocitrullinemia (HHH) syndrome in northern Saskatchewan. J. Inherit. Metab. Dis. 33, S275–S281 (2010).
Reddy, G. S. & Sujatha, M. A rare case of short-chain acyl-COA dehydrogenase deficiency: the apparent rarity of the disorder results in under diagnosis. Indian J. Clin. Biochem. 26, 312–315 (2011).
Goncalves, A. et al. SAMHD1 is a nucleic-acid binding protein that is mislocalized due to aicardi-goutieres syndrome-associated mutations. Hum. Mutat. 33, 1116–1122 (2012).
Hara, Y. et al. Quantitative proteomic analysis of cultured skin fibroblast cells derived from patients with triglyceride deposit cardiomyovasculopathy. Orphanet J. Rare Dis. 8, 197 (2013).
Jang, M. Identification of PRODH mutations in Korean neonates with type I hyperprolinemia.pdf. Ann. Clin. Lab. Sci. 43, 31–36 (2013).
Rivera-Torres, J. et al. Identification of mitochondrial dysfunction in Hutchinson-Gilford progeria syndrome through use of stable isotope labeling with amino acids in cell culture. J. Proteom. 91, 466–477 (2013).
Edhager, A. V. et al. Proteomic investigation of cultivated fibroblasts from patients with mitochondrial short-chain acyl-CoA dehydrogenase deficiency. Mol. Genet. Metab. 111, 360–368 (2014).
Kuzaj, P. et al. Large-scaled metabolic profiling of human dermal fibroblasts derived from pseudoxanthoma elasticum patients and healthy controls. PLoS ONE 9, e108336 (2014).
Motta, M. et al. Gaucher disease due to saposin C deficiency is an inherited lysosomal disease caused by rapidly degraded mutant proteins. Hum. Mol. Genet. 23, 5814–5826 (2014).
Goyal, M. et al. Molecular diagnosis of infantile neuro axonal dystrophy by next generation sequencing. Indian J. Pediatr. 82, 474–477 (2015).
Rong, J. et al. A rare co-segregation-mutation in the insulin receptor substrate 1 gene in one Chinese family with ankylosing spondylitis. PLoS ONE 10, e0126348 (2015).
Bartram, M. P. et al. Three-layered proteomic characterization of a novel ACTN4 mutation unravels its pathogenic potential in FSGS. Hum. Mol. Genet. 25, 1152–1164 (2016).
Orsini, J. J. et al. Newborn screening for Krabbe disease in New York State: the first eight years’ experience. Genet. Med. 18, 239–248 (2016).
Costanzo, M. et al. Integration of proteomics and metabolomics in exploring genetic and rare metabolic diseases. Kidney Dis. 3, 66–77 (2017).
Janecke, A. R. et al. Deficiency of the sphingosine-1-phosphate lyase SGPL1 is associated with congenital nephrotic syndrome and congenital adrenal calcifications. Hum. Mutat. 38, 365–372 (2017).
Jonsson, S. et al. Identification of sequence variants influencing immunoglobulin levels. Nat. Genet. 49, 1182–1191 (2017).
Konkolova, J. et al. Severe child form of primary hyperoxaluria type 2 - a case report revealing consequence of GRHPR deficiency on metabolism. BMC Med. Genet. 18, 59 (2017).
Yan, H. M. et al. Carnitine-acylcarnitine translocase deficiency with c.199-10 T>G and novel c.1A>G mutation: Two case reports and brief literature review. Medicine 96, e8549 (2017).
Bozal-Basterra, L. et al. Truncated SALL1 impedes primary cilia function in Townes-Brocks syndrome. Am. J. Hum. Genet. 102, 249–265 (2018).
Distelmaier, F. et al. Blue diaper syndrome and PCSK1 mutations. Pediatrics 141, S501–S505 (2018).
Fitzsimons, P. E. et al. Clinical, biochemical, and genetic features of four patients with short-chain enoyl-CoA hydratase (ECHS1) deficiency. Am. J. Med. Genet. A 176, 1115–1127 (2018).
Horike, K. et al. Two novel APOA1 gene mutations in a Japanese renal transplant recipient with recurrent apolipoprotein A-I related amyloidosis. Nephrology 23, 17–21 (2018).
Marti-Sanchez, L. et al. Hypermanganesemia due to mutations in SLC39A14: further insights into Mn deposition in the central nervous system. Orphanet J. Rare Dis. 13, 28 (2018).
Mendes, M. I. et al. Bi-allelic mutations in EPRS, encoding the glutamyl-prolyl-aminoacyl-tRNA synthetase, cause a hypomyelinating leukodystrophy. Am. J. Hum. Genet. 102, 676–684 (2018).
Miller, S. et al. The vitamin D binding protein axis modifies disease severity in lymphangioleiomyomatosis. Eur. Respir. J. 52, 1800951 (2018).
Peng, W. et al. Two novel L2HGDH mutations identified in a rare Chinese family with L-2-hydroxyglutaric aciduria. BMC Med. Genet. 19, 167 (2018).
Sethi, S. et al. Apolipoprotein CII amyloidosis associated with p.Lys41Thr mutation. Kidney Int. Rep. 3, 1193–1201 (2018).
Staubach, S. et al. Autosomal tubulointerstitial kidney disease-MUC1 type: differential proteomics suggests that mutated MUC1 (insC) affects vesicular transport in renal epithelial cells. Proteomics 18, e1700456 (2018).
Zlatic, S. A. et al. Rare disease mechanisms identified by genealogical proteomics of copper homeostasis mutant pedigrees. Cell Syst. 6, 368–380.e6 (2018).
Davids, M. et al. Glycomics in rare diseases: from diagnosis tomechanism. Transl. Res. 206, 5–17 (2019).
Goos, H. et al. Gain-of-function CEBPE mutation causes noncanonical autoinflammatory inflammasomopathy. J. Allergy Clin. Immunol. 144, 1364–1376 (2019).
Grabowski, P. et al. Proteome analysis of human neutrophil granulocytes from patients with monogenic disease using data-independent acquisition. Mol. Cell Proteom. 18, 760–772 (2019).
Ni, C. et al. Novel and recurrent PHGDH and PSAT1 mutations in Chinese patients with Neu-Laxova syndrome. Eur. J. Dermatol. 29, 641–646 (2019).
Rehman, A. U. et al. Biallelic loss of function variants in PPP1R21 cause a neurodevelopmental syndrome with impaired endocytic function. Hum. Mutat. 40, 267–280 (2019).
Song, H. Y. et al. Generation of GLA-knockout human embryonic stem cell lines to model autophagic dysfunction and exosome secretion in Fabry disease-associated hypertrophic cardiomyopathy. Cells 8, 327 (2019).
Sturiale, L. et al. ALG12-CDG: novel glycophenotype insights endorse the molecular defect. Glycoconj. J. 36, 461–472 (2019).
Suriyanarayanan, S. et al. A novel variant (Asn177Asp) in SPTLC2 causing hereditary sensory autonomic neuropathy type 1C. Neuromol. Med. 21, 182–191 (2019).
Cicaloni, V. et al. A proteomics approach to further highlight the altered inflammatory condition in Rett syndrome. Arch. Biochem. Biophys. 696, 108660 (2020).
Doccini, S. et al. Proteomic and functional analyses in disease models reveal CLN5 protein involvement in mitochondrial dysfunction. Cell Death Discov. 6, 18 (2020).
Jiang, L. et al. A quantitative proteome map of the human body. Cell 183, 269–283.e19 (2020).
Johnson, B. V. et al. Partial loss of USP9X function leads to a male neurodevelopmental and behavioral disorder converging on transforming growth factor beta signaling. Biol. Psychiatry 87, 100–112 (2020).
Rotunno, M. S. et al. Cerebrospinal fluid proteomics implicates the granin family in Parkinson’s disease. Sci. Rep. 10, 2479 (2020).
Staps, P. et al. Disturbed brain ether lipid metabolism and histology in Sjogren-Larsson syndrome. J. Inherit. Metab. Dis. 43, 1265–1278 (2020).
Tebani, A. et al. A proteomics-based analysis reveals predictive biological patterns in Fabry disease. J. Clin. Med. 9, 1325 (2020).
Zhang, X. et al. Structure-function analyses of the G729R 2-oxoadipate dehydrogenase genetic variant associated with a disorder of l-lysine metabolism. J. Biol. Chem. 295, 8078–8095 (2020).
Alston, C. L., Stenton, S. L., Hudson, G., Prokisch, H. & Taylor, R. W. The genetics of mitochondrial disease: dissecting mitochondrial pathology using multi-omic pipelines. J. Pathol. 254, 430–442 (2021).
Bachetti, T. et al. The OSMR gene is involved in Hirschsprung associated enterocolitis susceptibility through an altered downstream signaling. Int. J. Mol. Sci. 22, 3831 (2021).
De Palma, A. et al. A shotgun proteomic platform for a global mapping of lymphoblastoid cells to gain insight into Nasu-Hakola disease. Int. J. Mol. Sci. 22, 9959 (2021).
Derksen, A. et al. Variants in LSM7 impair LSM complexes assembly, neurodevelopment in zebrafish and may be associated with an ultra-rare neurological disease. HGG Adv. 2, 100034 (2021).
Formosa, L. E. et al. Optic atrophy-associated TMEM126A is an assembly factor for the ND4-module of mitochondrial complex I. Proc. Natl Acad. Sci. USA 118, e2019665118 (2021).
Gupta, N. et al. Differential levels of CHMP2B, LLPH, and SLC25A51 proteins in secondary renal amyloidosis. Expert Rev. Proteom. 18, 65–73 (2021).
Hentschel, A. et al. Protein signature of human skin fibroblasts allows the study of the molecular etiology of rare neurological diseases. Orphanet J. Rare Dis. 16, 73 (2021).
Jackson, T. D. et al. The TIM22 complex mediates the import of sideroflexins and is required for efficient mitochondrial one-carbon metabolism. Mol. Biol. Cell 32, 475–491 (2021).
Jalali, H., Najafi, M., Khoshaeen, A., Mahdavi, M. R. & Mahdavi, M. First report of c.425-1G>A mutation in ornithine aminotransferase gene causing gyrate atrophy of the choroid and retina with hyperornithinemia.pdf. Eur. J. Ophthalmol. 31, NP23–NP26 (2021).
Lee, C. Q. E. et al. Coding and non-coding roles of MOCCI (C15ORF48) coordinate to regulate host inflammation and immunity. Nat. Commun. 12, 2130 (2021).
Martinez-Fierro, M. L. et al. Whole-exome sequencing, proteome landscape, and immune cell migration patterns in a clinical context of Menkes disease. Genes 12, 744 (2021).
Philippi, A. et al. Mutations and variants of ONECUT1 in diabetes. Nat. Med. 27, 1928–1940 (2021).
Sferra, A. et al. Biallelic mutations in RNF220 cause laminopathies featuring leukodystrophy, ataxia and deafness. Brain 144, 3020–3035 (2021).
Stenton, S. L. et al. Impaired complex I repair causes recessive Leber’s hereditary optic neuropathy. J. Clin. Invest. 131, e138267 (2021).
Straub, I. R., Weraarpachai, W. & Shoubridge, E. A. Multi-OMICS study of a CHCHD10 variant causing ALS demonstrates metabolic rewiring and activation of endoplasmic reticulum and mitochondrial unfolded protein responses. Hum. Mol. Genet. 30, 687–705 (2021).
Vidali, S. et al. Characterising a homozygous two-exon deletion in UQCRH: comparing human and mouse phenotypes. EMBO Mol. Med. 13, e14397 (2021).
Ziburova, J. et al. A novel homozygous mutation in the human ALG12 gene results in an aberrant profile of oligomannose N-glycans in patient’s serum. Am. J. Med. Genet. A 185, 3494–3501 (2021).
Ali Alghamdi, M. et al. Genomic, proteomic, and phenotypic spectrum of novel O-sialoglycoprotein endopeptidase variant in four affected individuals with Galloway-Mowat syndrome. Front. Genet. 13, 806190 (2022).
Bilches Medinas, D. et al. Mutation in protein disulfide isomerase A3 causes neurodevelopmental defects by disturbing endoplasmic reticulum proteostasis. EMBO J. 41, e105531 (2022).
Bongaerts, M. et al. Integration of metabolomics with genomics: Metabolic gene prioritization using metabolomics data and genomic variant (CADD) scores. Mol. Genet Metab. 136, 199–218 (2022).
Brown, S. J. et al. The proteome signatures of fibroblasts from patients with severe, intermediate and mild spinal muscular atrophy show limited overlap. Cells 11, 2624 (2022).
Bury, L. et al. A p.Arg127Gln variant in GPIbalpha LRR5 allosterically enhances affinity for VWF: a novel form of platelet-type VWD. Blood Adv. 6, 2236–2246 (2022).
Chang, X. et al. Data-independent acquisition-based quantitative proteomic analysis of m.3243A>G MELAS reveals novel potential pathogenesis and therapeutic targets. Medicine 101, e30938 (2022).
Delgadillo, D. M. et al. Differential expression of proteins in an atypical presentation of autoimmune lymphoproliferative syndrome. Int. J. Mol. Sci. 23, 5366 (2022).
Fliegauf, M. et al. Detrimental NFKB1 missense variants affecting the Rel-homology domain of p105/p50. Front. Immunol. 13, 965326 (2022).
Franzin, M. et al. A new proof of evidence of cysteamine quantification for therapeutic drug monitoring in patients with cystinosis. Orphanet J. Rare Dis. 17, 400 (2022).
Ganne, B. et al. GM2 gangliosidosis AB variant: first case of late onset and review of the literature. Neurol. Sci. 43, 6517–6527 (2022).
Gomes, A. M. et al. Potential renal damage biomarkers in Alport syndrome-a review of the literature. Int. J. Mol. Sci. 23, 7276 (2022).
Granata, S. et al. Proteomics insights into medullary sponge kidney (MSK) disease: review of the recent results of an Italian research collaborative network. Kidney Blood Press. Res. 47, 683–692 (2022).
Hertzog, A. et al. A narrative review of metabolomics in the era of “-omics”: integration into clinical practice for inborn errors of metabolism. Transl. Pediatr. 11, 1704–1716 (2022).
Jiao, K. et al. High-risk screening of late-onset Pompe disease: a different early portrait in China. Front. Neurol. 13, 965207 (2022).
Kieffer, F. et al. Combining affinity purification and mass spectrometry to define the network of the nuclear proteins interacting with the N-terminal region of FMRP. Front. Mol. Biosci. 9, 954087 (2022).
Li, H. et al. Integrated proteomic and metabolomic analyses of the mitochondrial neurodegenerative disease MELAS. Mol. Omics 18, 196–205 (2022).
Li, W. & Cologna, S. M. Mass spectrometry-based proteomics in neurodegenerative lysosomal storage disorders. Mol. Omics 18, 256–278 (2022).
Lin, Y. et al. Incorporating second-tier genetic screening for multiple acyl-CoA dehydrogenase deficiency. Clin. Chim. Acta 537, 181–187 (2022).
Liu, Y. et al. Comparing amniotic fluid mass spectrometry assays and amniocyte gene analyses for the prenatal diagnosis of methylmalonic aciduria. PLoS ONE 17, e0265766 (2022).
Liu, Y. et al. Sex differences in protein expression and their perturbations in amniotic fluid cells of down syndrome fetuses. ACS Omega 7, 35981–35992 (2022).
Ma, R. et al. Case report: a novel missense mutation c.517G>C in the UMPS gene associated with mild orotic aciduria. Front. Neurol. 13, 819116 (2022).
Magrinelli, F. et al. Detection and characterization of a de novo Alu retrotransposition event causing NKX2-1-related disorder. Mov. Disord. 38, 347–353 (2022).
Marchese, E. et al. Multi-omics studies unveil extraciliary functions of BBS10 and show metabolic aberrations underlying renal disease in Bardet-Biedl syndrome. Int. J. Mol. Sci. 23, 9420 (2022).
Menkovic, I. et al. Quantitation of a urinary profile of biomarkers in Gaucher disease type 1 patients using tandem mass spectrometry. Diagnostics 12, 1414 (2022).
Penco, F. et al. Proteomic signatures of monocytes in hereditary recurrent fevers. Front. Immunol. 13, 921253 (2022).
Ranu, N. et al. NEB mutations disrupt the super-relaxed state of myosin and remodel the muscle metabolic proteome in nemaline myopathy. Acta Neuropathol. Commun. 10, 185 (2022).
Rauscher, B. et al. Patient-derived gene and protein expression signatures of NGLY1 deficiency. J. Biochem. 171, 187–199 (2022).
Reusch, B. et al. MAGED2 controls vasopressin-induced aquaporin-2 expression in collecting duct cells. J. Proteom. 252, 104424 (2022).
Rosario, R. et al. Evidence for a fragile X messenger ribonucleoprotein 1 (FMR1) mRNA gain-of-function toxicity mechanism contributing to the pathogenesis of fragile X-associated premature ovarian insufficiency. FASEB J. 36, e22612 (2022).
Ruoppolo, M. et al. Expanded newborn screening in Italy using tandem mass spectrometry: two years of national experience. Int. J. Neonatal. Screen. 8, 47 (2022).
Shu, J. et al. Case report: a case of beta-ureidopropionase deficiency complicated with MELAS syndrome caused by UPB1 variant and mitochondrial gene variant. Front. Pediatr. 10, 838341 (2022).
Staels, F. et al. Common variable immunodeficiency in two kindreds with heterogeneous phenotypes caused by novel heterozygous NFKB1 mutations. Front. Immunol. 13, 973543 (2022).
Sun, Y. et al. Complement system deregulation in SAPHO syndrome revealed by proteomic profiling. J. Proteom. 251, 104399 (2022).
Taibi, L. et al. LC-MS/MS identification of prolidase deficiency: a rare cause of infantile hepatosplenomegaly. Clin. Chem. hvab208 (2022).
Takahashi, T. et al. Hypomyelinating spastic dyskinesia and ichthyosis caused by a homozygous splice site mutation leading to exon skipping in ELOVL1. Brain Dev. 44, 391–400 (2022).
Tian, Y. et al. Analysis of gene mutations of medium-chain acyl-coenzyme a dehydrogenase deficiency (MCADD) by next-generation sequencing in Henan, China. Clin. Chim. Acta 536, 155–161 (2022).
Xia, Z. J. et al. COG4 mutation in Saul-Wilson syndrome selectively affects secretion of proteins involved in chondrogenesis in chondrocyte-like cells. Front. Cell Dev. Biol. 10, 979096 (2022).
Yao, S. et al. Proteomic profiling reveals increased glycolysis, decreased oxidoreductase activity and fatty acid degradation in skin derived fibroblasts from LHON patients bearing m.G11778A. Biomolecules 12, 1568 (2022).
Zandl-Lang, M. et al. Changes in the cerebrospinal fluid and plasma lipidome in patients with Rett syndrome. Metabolites 12, 291 (2022).
Zaqout, S. et al. Proteome changes in autosomal recessive primary microcephaly. Ann. Hum. Genet. 87, 50–62 (2022).
Bea-Mascato, B., Gómez-Castañeda, E., Sánchez-Corrales, Y. E., Castellano, S. & Valverde, D. Loss of the centrosomal protein ALMS1 alters lipid metabolism and the regulation of extracellular matrix-related processes. Biol. Direct 18, 84 (2023).
Broadway-Stringer, S. et al. Insights into the role of a cardiomyopathy-causing genetic variant in ACTN2. Cells 12, 721 (2023).
Chen, Z. et al. Investigation of the pathogenic mechanism of ciprofloxacin in aortic aneurysm and dissection by an integrated proteomics and network pharmacology strategy. J. Clin. Med. 12, 1270 (2023).
Indelicato, E. et al. Skeletal muscle proteome analysis underpins multifaceted mitochondrial dysfunction in Friedreich’s ataxia. Front. Neurosci. 17, 1289027 (2023).
Iuraşcu, M. I. et al. Application of a dried blood spot based proteomic and genetic assay for diagnosing hereditary angioedema. Clin. Transl. Allergy 13, e12317 (2023).
Kaiyrzhanov, R. et al. Bi-allelic ACBD6 variants lead to a neurodevelopmental syndrome with progressive and complex movement disorders. Brain 147, 1436–1456 (2023).
Kreft, I. C. et al. Proteomic landscapes of inherited platelet disorders with different etiologies. J. Thromb. Haemost. 21, 359–372.e3 (2023).
Manole, A. et al. NGLY1 mutations cause protein aggregation in human neurons. Cell Rep. 42, 113466 (2023).
Nalbach, K. et al. Spatial proteomics reveals secretory pathway disturbances caused by neuropathy-associated TECPR2. Nat. Commun. 14, 870 (2023).
Qiao, W. et al. Trem2 H157Y increases soluble TREM2 production and reduces amyloid pathology. Mol. Neurodegener. 18, 8 (2023).
Rossi, G. et al. MCT1-dependent energetic failure and neuroinflammation underlie optic nerve degeneration in Wolfram syndrome mice. Elife 12, e81779 (2023).
Rossi, R. et al. A proof of principle proteomic study detects dystrophin in human plasma: implications in DMD diagnosis and clinical monitoring. Int. J. Mol. Sci. 24, 5215 (2023).
Schulte, U. et al. Mitochondrial complexome reveals quality-control pathways of protein import. Nature 614, 153–159 (2023).
Van Bergen, N. J. et al. Severe NAD(P)HX dehydratase (NAXD) neurometabolic syndrome may present in adulthood after mild head trauma. Int. J. Mol. Sci. 24, 3582 (2023).
Zhang, Z. et al. Microprotein dysregulation in the serum of patients with atrial fibrillation. J. Proteome Res. 22, 1172–1180 (2023).
Cao, J. et al. Plasma proteomics to identify drug targets and potential drugs for retinal artery occlusion: an integrated analysis in the UK Biobank. J. Proteome Res. 23, 3754–3763 (2024).
Chiu, C. et al. Skeletal muscle vulnerability in a child with Pitt-Hopkins syndrome. Skelet. Muscle 14, 15 (2024).
Tan, Z. et al. Multi-omics analyses reveal aberrant differentiation trajectory with WNT1 loss-of-function in type XV osteogenesis imperfecta. J. Bone Min. Res. 39, 1253–1267 (2024).
Tshilenge, K. T. et al. Proteomic analysis of X-linked dystonia parkinsonism disease striatal neurons reveals altered RNA metabolism and splicing. Neurobiol. Dis. 190, 106367 (2024).
Sahni, N. et al. Widespread macromolecular interaction perturbations in human genetic disorders. Cell 161, 647–660 (2015).
Plikus, M. V. et al. Fibroblasts: origins, definitions, and functions in health and disease. Cell 184, 3852–3872 (2021).
Santos-Gonzalez, F. et al. Micro-costing genomics: challenges and opportunities. Genet. Med. 27, 101310 (2024).
Santos Gonzalez, F. et al. A micro-costing study of mass-spectrometry based quantitative proteomics testing applied to the diagnostic pipeline of mitochondrial and other rare disorders. Orphanet J. Rare Dis. 19, 443 (2024).
Acknowledgements
This study was supported by the Research Training Programme Scholarship offered by the University of Melbourne, the Australian National Health and Medical Research Council (NHMRC) Investigator Fellowship (2009732 to D.A.S.) and a Principal Research Fellowship (1155244 to D.R.T.) along with funding by the Australian Genomics Health Alliance (Australian Genomics) NHMRC Targeted Call for Research grant GNT1113531. Additional support came from the Australian Medical Research Future Fund Genomics Health Futures Missions (2007959 to D.R.T., 2016030 to D.A.S., and 2007567 to J.C.) and the Mito Foundation. The research conducted at the Murdoch Children’s Research Institute was supported by the Victorian Government’s Operational Infrastructure Support Program. The Chair in Genomic Medicine awarded to J.C. is generously supported by The Royal Children’s Hospital Foundation.
Author information
Authors and Affiliations
Contributions
The manuscript was conceived, designed, and written by T.Z. Critical review and approval of the final manuscript were performed by all authors.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Zhao, T., Hock, D.H., Pitt, J. et al. Review: Utility of mass spectrometry in rare disease research and diagnosis. npj Genom. Med. 10, 29 (2025). https://doi.org/10.1038/s41525-025-00487-3
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41525-025-00487-3





