Proteomics and Human Diseases

Rosa Lippolis; Maria De Angelis

doi:10.4172/jpb.1000391

Review Article - (2016) Volume 9, Issue 3

View PDF Download PDF

Proteomics and Human Diseases

Rosa Lippolis¹^* and Maria De Angelis²: ¹Institute of Biomembranes and Bioenergetics, Italian National Research Council, (CNR) Bari, Italy; ²Department of Soil, Plant and Food Sciences, University "A Moro", Bari, Italy

^*Corresponding Author: Rosa Lippolis, Institute of Biomembranes and Bioenergetics (IBBE), National Research Council (CNR), 70124 Bari, Italy, Tel: +39 080 5448509-31, Fax: +39 080 5448538

Abstract

Proteomics, the large scale study of proteins and protein variations, contributes to a better understanding of the molecular basis of variability in susceptibility to diseases that are associated with genetic diversity and environmental factors. The development of proteomic technologies has permitted the unprecedented large-scale identification of proteins in any biological system. Human proteomics point out physiological conditions that could greatly impact on medicine. This knowledge has the potential to decode the pathogenic mechanisms underlying diseases, elucidate potential risk factors and molecular targets for drug development and therapeutic interventions and identify promising biomolecules that could be developed for diagnostic and prognostic purposes and for improved disease management strategies. Thus proteomics can translate basic scientific discoveries into the clinical practice for precision medicine. This report provides an overview of targeted proteomics in biomedical clinical science by focusing on current stateof- the-art relatively to advances in biomedical proteomic applications. Future prospective metaproteomics and proteogenomics studies are highlighted.

Keywords: Cancer proteomics, Heart proteomics, Proteomic of neurodegenerative diseases, Metaproteomics, Protogenomics

Abbreviations

PTMs: Post-Translational Modifications; DRE: Digital Rectal Examination; TRUS: Trans Rectal Ultrasound; HCC: Hepatocellular Carcinoma; 2DE: Two Dimensional Electrophoresis; MALDI-TOF: Matrix Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry; DCMP: Dilated Cardiomyopathy; PCTs: Primary Cardiac Tumors; AD: Alzheimer Disease; FTD: Frontotemporal Dementia; ALS: Amyotrophic Lateral Sclerosis; VD: Vascular Dementia; PD: Parkinson’s Disease; CJD: Creutzfeldt- Jakob Disease; SNpc: Central and Peripheral Nervous System; CSF: Cerebrospinal Fluid; GI: The Human Gastro-intestinal; IBDs: Inflammatory Bowel Diseases; CD: Crohn’s Disease; UC: Ulcerative Colitis; TCGA: Cancer Genomic Atlas; CPTAC 2: Clinical Proteomics Tumor Analysis Consortium

Introduction

During the last two decades, genome analyses have resulted in greater knowledge [1,2]. Particular interest is assigned to the genetics of human disease and specific aberrations in the cancer cells genome and other diseases have been identified. Despite tremendous advances in the understanding of the molecular basis of diseases, substantial gaps remain in the knowledge of the molecular processes involved in pathogenesis and in the development of diagnostic tools for early disease diagnosis and effective strategies for therapeutic interventions. In other words, understanding of the genome alone, i.e., without the necessary knowledge of the encoded gene products, does not permit clear extrapolations regarding the complex biological processes that occur in the cells. It is clear that there is a large gap in predicting phenotype from genotype. Proteins functionally govern cellular processes and execute activities dictated by genes to determine the cellular phenotype and the targets of natural selection. They are involved in virtually every cellular function and processes and perform all the work of biological systems. Post-translational modifications (PTMs) of proteins, such as phosphorylation or glycosylation, are very important for determining protein function. Similarly, the effects of environmental factors or multigenic processes, such as aging or disease, cannot be assessed simply by examining the genome alone. Moreover, variations in the levels of DNA or transcripts do not correlate well with the protein abundance [3]. Genomic sequencing has been a major focus in recent decades [4]; however, genomics alone cannot provide information regarding the dynamics of proteinprotein interactions, which are the key to delineating the biological mechanisms and functional consequences of the expression of the diverse proteins associated with diseases. Proteomics and the socalled ‘omics’ sciences (metabolome, expressome and interactome, etc.) are aimed at decoding the information contained in the genomic sequences in terms of protein structures and functions. The goal of proteomics is to bridge the gap between genomic information and functional proteins to lay the foundations for an integrated knowledge to understand the biology system dynamics (Figure 1). The emerging proteomic technologies [5] (Figure 2) that are capable of comprehensive measurements of gene products at a systems level [6,7] offer considerable opportunities for an improved understanding of cell biology. Proteomic analyses provide insight that enables the molecular characterization of various human diseases and the ability to decipher the molecular mechanisms dictating the initiation and progression of the disease. Disease involves alterations in protein expression levels. Hence, by studying the change in the abundance of proteins between different cellular states (normal vs. diseased states), proteomics provide insights into the pathophysiological basis of protein target identification and validation for intervention and treatment of disease. Specific biomarkers identified from proteomics may be used as protein signatures to screen new chemical entities for target organ toxicity in preclinical trials and in the development of clinical trials, ensuring their usefulness in the diagnosis and prognosis of diseases. By holistically examining the entire protein profile of cellular tissues treated with drugs or drug candidates, proteomics establishes a comprehensive protein interaction map (‘interactomes’) related to disease pathways, hence optimizing the drug development process [8]. Results of these proteomic investigations to disease will overcome some of the limitations the current diagnostic and therapeutic methods. The applications of clinical proteomics are direct especially to human cancers, cardiac and neurological diseases that are the most widely occurring pathologies in throughout the world. Brief overviews of the results obtained by proteomic applications in these biomedical areas are provided below. Metaproteomics and proteogenomics studies are also highlighted as new emergenting field of clinical research. The metaproteomics provides new insights into the importance of the gut microbiome in human health. This has established the importance of the gut microbiome in the disease pathogenesis for numerous systemic disease states, such as obesity [9] and cardiovascular disease [10], and intestinal conditions, such as inflammatory bowel disease [11] witch can be associated with an increased risk of colorectal carcinoma [12]. Protogenomic integrate and complete proteomic study by correlating genomic and proteomic data and his directed to a better understanding of the processes that underlie the onset of the disease. In this way, proteomic study of cardiovascular, neurodegenerative and cancer diseases, metagenomics and protogenomic are not so far away.

proteomics-bioinformatics-proteomics-clinical-research

Figure 1: Potential applications of proteomics in clinical research and advantages of integrating proteomic and genomic data.

proteomics-bioinformatics-proteomics-technologies-polyacrylamide

Figure 2: Schematic representation of advanced proteomics technologies.

Cancer Proteomics

Cancer is a class of complex diseases in which the cells in a specific tissue (or) organ are no longer fully responsive to the signals that regulate cellular differentiation, survival, proliferation and death. The diagnosis and classification of cancer are based on the cellular morphology and histological architecture. However, the applied methods are subjective and prone to variable interpretations, and patients with a similar histopathology, cancer staging, and treatments regimen, have demonstrated variable clinical outcomes. Therefore the diagnostic tools for cancers have evolved from histology to methods such as genomic testing and chromosome karyotype analyses. Gene expression profiles can be used as unique molecular signatures to facilitate diagnosis, classify histopathologically similar tumors into biologically distinct subtypes [13] and identify patients with a high risk for occurrence and poor survival. Current cancer research is directed at deciphering the molecular mechanisms that dictate the pathogenesis and progression of the disease by translating basic science discoveries into the clinical practice of personalized medicine [14]. By bridging genomics and cellular behavior, proteomics and functional proteomic, are important tools in cancer research; comparative proteomics approaches are used to study the (i) tumor cell biology; (ii) identify the proteins, proteins complexes and specific cellular process related to cancer states and (iii) screen each biochemical, antigenic, and epigenetic cancer biomarkers for early disease diagnosis, tumor staging, tumor subtyping and determining the prognosis and response of patient to therapeutic interventions (Table 1). A brief overview of the contribution of proteomics to cancer research is provided below.

Type of Biomarker	Use of Biomarker	Clinical Goal
Prognostic Biomarker	Risk Assessment, Prevention	To determine the cellular and molecular mechanisms of carcinogenesis in pre-neoplastic tissue
Diagnostic Biomarker	Diagnosis	To determine the presence of cancer
	Classification	To distinguish the specific type of cancer
	Screening	To recognize early-stage cancer and administer early treatment.
Predictive Biomarker	Prognosis	Helps in estimating the likely outcome of the disease
	Prediction of Treatment	Anticipate the response to specific treatment
	Risk Stratification	Used in evaluating the probability of occurrence or recurrence of cancer
	Personalized Medicine	To select the therapy with the highest probability of being effective in a particular patient.
Biomarker for Post-treatment Surveillance	Monitoring Treatment Response	Used in assessing the effectiveness and adverse effects of a treatment.
Table 1B: Biomarkers in drug development.
	Target Verification	Discovery new drug target molecules
	Early Compound Selection	Determine the most favorable compounds in terms of safety and efficacy
	Pharmacodynamics Assays	Ascertain the drug’s effect and establish a dosing regimen.
	Patient Selection for Clinical Trials	Aids in patient selection based on disease subtype
	Surrogate Endpoint in Drug Approval	Used for a quick assessment of the safety and efficacy of the therapy by using a short-term outcome (biomarker) instead of a long-term primary endpoint.

Table 1: A biomarker is a biological molecule of body fluids or tissues that is a sign of a normal or pathogenic processes, or pharmacologic responses to therapeutic treatment. Biomarkers are used as analytical tools to early diagnosis, for a rapid and comprehensive therapeutic analysis and the development and evaluation of new therapies. In the table is schematically shown the use of biomarkers for cancer medicine) and for the development of new drugs [30].

Ovarian cancer

Ovarian cancer is the most lethal gynecological malignancy due to the lack of highly sensitive and specific screening tools for detection of early-stage disease. Recent developments in the identification of potential biomarkers using proteomic technologies have resulted in improved positive predictive value. The glycoprotein antigen 125 (CA-125) is the most commonly measured tumor marker for epithelial ovarian tumors which account for 85–90% of ovarian cancers, and it is used in the clinic to provide a prognosis, monitor progression, and optimize the care of women diagnosed with ovarian cancer [15]. However, the protein levels of CA-125 in the blood have been found to be elevated in only approximately 50% of the patients [16].

The pleomorphic cytokines interleukin 6 and 8 (IL-6 and IL8) are implicated in features of tumor growth, disease progression and / or treatment [17]. Compared with healthy individuals, elevated levels of mesothelin (MSLN) [18] and vascular endothelial growth factor (VEGF) [19] are detected in the sera of patients with ovarian carcinoma. VEGF inhibition decreases tumor growth and suppresses tumor invasion and metastasis [20]. Conversely, serum levels of full-length transthyretin (TTR), apolipoprotein A1 (Apo-A1) and transferrin (TF), an iron-binding transport protein, are reduced in the serum of patients with ovarian cancer [21]. Recently, a large number of combinations of biomarkers have been investigated to improve the sensitivity and specificity for the early detection of ovarian cancer. When combined with ApoA1 and TF, CA-125 yields a sensitivity and specificity of 96% and 98%, respectively, for the detection of early-stage ovarian cancer [21]. Gorelik et al. identified a panel of biomarkers that combined CA-125 with epidermal growth factor (EGF), VEGF, IL-6 and IL-8 and achieved a sensitivity of 84% and a specificity of 95% for a total of 126 serum specimens [22]. CA-125, human epididymis protein 4 (HE4), carcinoembryonic antigen (CEA) and vascular cell adhesion molecule 1 (VCAM-1) have also been identified as a biomarker panel and tested using the sera from 139 patients with early-stage ovarian cancer, 149 patients with late-stage ovarian cancer and 1102 healthy women. This four-biomarker panel provided the highest diagnostic power of 86% sensitivity for early-stage and 93% sensitivity for late-stage ovarian cancer, both with a specificity of 98% [23]. A different panel of biomarkers, consisting of IL-8 and anti-IL-8, has been proposed by Lokshin [24]. This panel was tested using 211 serum specimens and was able to predict early ovarian cancer with 98% specificity and 65.5% sensitivity. HE4 also correlates with adenocarcinoma of the endometrium [25]; carbonic anhydrase 12 (CA-12), MSLN and HE4 have been proposed as a panel of ovarian cancer biomarkers [26]. These results, suggest that combinations of biomarkers may provide improved detection of disease as the first step of disease in a multimodal screening protocol.

Prostate cancer

Prostate cancer is one of the most commonly diagnosed malignancies and generally does not present any symptoms until it becomes locally advanced or metastatic disease; it is characterized by an unusual biological heterogeneity and demands distinctive classification [27]. Nonetheless, a greater number of patients are now diagnosed at an earlier stage due to recent improvements in advanced tools for the diagnosis of prostate cancer. Some of the most commonly used techniques to assess early stage prostate cancer are the DRE, TRUS [28] and prostate-specific antigen (PSA) blood test [29]. Current advancements in proteomics and other technologies have significantly accelerated the discovery and development of biomarkers for the diagnosis and prognosis of prostate cancer and include DNA-based markers, RNA-based biomarkers, and, particulary, protein biomarkers. Human prostatic acid phosphatase (PAP) and serum acid phosphatase (AP) were reportedly the first serum protein biomarkers for prostate cancer [30]. PSA is considered the most important test for detecting, staging, and monitoring early-stage prostate cancer [31,32]. PSA is a kallikrein protease with chymotryptic-like activity that was initially thought to be synthesized by the epithelial cells of the prostate and thus was used as a biomarker for the diagnosis and management of prostate cancer [33]. PSA was quantified in serum, and the highest levels were detected in men with prostate cancer. The main disadvantage of the PSA test is its lack of specificity; common pathological conditions such as benign prostatic hyperplasia and prostatitis can provide false-positive results [34]. Overall, 70–75% of men who undergo biopsies due to abnormal PSA levels do not have cancer. To improve specificity, several variations of the basic PSA test have been proposed, such as the free PSA ratio, which compares the amount of free PSA circulating in the blood (unbound) to the amount attached to other blood proteins [30]. Methylacyl coenzyme A racemase (AMACR) is an enzyme localized in peroxisomes that is involved in fat metabolism and has been shown to function as a growth promoter in prostate cancer, independent of androgens [35,36]. Using various experimental methods and different prostate cancer specimens, the AMACR gene has been shown to be overexpressed in prostate cancer tissue at the mRNA and protein levels, making it a highly specific tissue biomarker that is currently used to facilitate in the pathological diagnosis [37,38]. Glutathione S-trasferase (GST) is an example of a biomarker that has been extensively studied in prostate cancer, primarily as a tissue marker. GSTs are a ubiquitous family of multifunctional enzymes that conjugate reactive substrates to reduced glutathione (GSH) and are involved in detoxification [39,40]. Because cancer cells have high levels of free radicals, GST and GSH are up regulated under pathogenic conditions and play key roles in protecting cells against oxidative stress [41]. Chromogranin A (CRN-A), a member of the granin family of proteins, is an acidic protein that has been identified in all neuroendocrine cell types studied and is produced in larger amounts than other proteins secreted by the cells. The growth of prostate cells is regulated by peptides derived from CRN-A. Because it is produced and secreted by prostate cells, CRN-A has been assessed for its diagnostic and prognostic values, as a biomarker for prostate cancer [42].

Hepatocellular carcinoma

Hepatocellular carcinoma (HCC) is one of the most frequently diagnosed cancers worldwide. The majority of HCC develops in patients with a history of chronic hepatitis or cirrhosis, and the coexistence of inflammation and cirrhosis makes it much more difficult to provide an early diagnosis and prognostic assessment of HCC. Different genes have been studied with promising results, including telomerase reverse transcriptase (TERT), glypican 3 (GPC3), survivin (BIRC5), lymphatic vessel endothelial hyaluronan receptor 1 (LYVE1), 70 KDa heat shock protein (HSP70) and insulin-like growth factors 1 (IGF-1)-associated genes. In recent years the gene-expression patterns associated with hepato-carcinogenesis have enabled clinicians to reach a conclusive differential diagnosis between HCC and the various pre-neoplastic stages. Other tumor markers, including protein induced by vitamin K absence (PIVKA), fraction L3 of alpha-fetoprotein (L3–AFP) and human hepatocyte growth factor (HGF), have been suggested to be the diagnostic markers of HCC [43]. Jia et al. [44] reported a significant increase in the level of five proteins, GPC3, paternally expressed 10, (PEG10), midkine (MDK), neuroserpin (SERPINI1) and the small molecular mass ubiquinone-binding protein (QP-C) in patients with HCC compared with healthy subjects. The combined score for these proteins can be used to accurately classify noncancerous hepatic tissues (100%) and HCC (71%). Consequently, these proteins may serve as biomarkers to aid in the diagnosis of HCC. A highly significant difference was observed in the serum level of calreticulin (CARL) and protein disulfide-isomerase A3 (PDIA3) fragments between patients with HCC and healthy individuals, as well as between patients with HCC and at-risk patients [45]. Altered glycosylation of several glycoproteins has been identified in patients with HCC. Among these, Golgi protein 73 (GP73) is overexpressed in patients with HCC, compared with healthy controls. Level of GP73 decreased following surgical resection of HCC lesions and increased with tumor recurrence [46]. The sensitivity and specificity of the serum level of GP73 for the diagnosis of HCC are 74.6% and 97.4%, respectively. In addition, increased levels of GP73 have been detected in the sera of patients with HCC [47,48]. Charkady et al. [49] quantitated more than 600 proteins, among which 59 were upregulated and 92 proteins were downregulated in HCC compared with adjacent normal tissue. Some of the upregulated proteins included fibroleukin or fibrinogen-like protein 2 (FGL2), interferon induced 56-kDa protein (IFN), milk fat globule-factor 8 (MFGE8) and myeloid-associated differentiation marker (MYADM). Another study showed the differential expression of s six proteins in the SMCC-7721 human HCC cell line: mitofilin (Mt-IM), endoplasmic reticulum protein 29 (ERp29), ubiquinolcytochrome C reductase complex core protein, (UQCRC1), peroxisomal enoyl CoA hydratase (ECH2), peroxiredoxin-4 (PRDX4), and probable 3- oxoacid CoA transferase 1 precursor (OXCT1) [50]. These proteins might promote tumorigenesis by (i) inhibiting cell death induced by several cancer-related stresses and/or (ii) inhibiting apoptosis at multiple points in the apoptotic signaling pathway. Recently, Shen et al. [51] reported that the serum level of dickkopf-related protein 1 (DKK1) is a promising candidate for the diagnosis of HCC. DKK1 belongs to a family of secreted proteins that play an important role in HCC progression by promoting the cytoplasmic / nuclear accumulation of beta-catenin in HCC cells via the Wnt / beta-catenin signaling pathway [52]. Members of the heat shock proteins (HSPs) families, HSP70, HSP27 and HSP90, the nuclear heterogeneous ribonucleoproteins C1/C2 (HNRNPC) and the metastases suppressor nm23-H2 (NM323-H2) were up regulated in HCC samples [53]. Overexpression of these proteins was confirmed by western blot analysis and immunohistochemistry, suggesting their potential use as tumor markers. The HSPs are ubiquitous molecules that are induced in cells in response to various stress conditions. HSP70 and HSP27 are of special relevance in human cancer because they inhibit apoptosis. HSP70 and HSP27expression level might play an important role in hepatocarcinogenesis, and are closely related to tumor progression by promoting tumor cell proliferation. The use of a three marker panel consisting of HSP70, GPC3 and glutamine synthetase (GS), was found to increase the sensitivity and specificity (72 and 100%, respectively) [54]. Using 2-DE image analysis and mass spectrometry identification eight proteins including HSP27, l’alfa-feto proteina (AFP), alfa 1 antitrypsin (A1AT) clusterin (CLU), caeruloplasmin (CP), haptoglobin α2 chain (HP2), TF and TTR showed significantly difference in the expression among healthy, and HCC groups. The hepatoma model displayed dynamic expression of hepatic TERT during HCC development and peaked at the stage of HCC formation. Abnormal expression of telomerase in liver tissues or in the peripheral blood could be a useful marker for the diagnosis of HCC [53,55]. The recent discovery of new therapeutic targets based on the molecular pathways that are involved in carcinogenesis has led to exciting discovering for targeted treatment strategies in tumor patients. Investigators have attempted to select individualized therapeutic options for patients based on the molecular profile of their tumor. This treatment modality will pave the way for personalized treatment of HCC.

Proteomics in Heart Diseases

Heart failure arising from systemic or specific heart-muscle diseases is one of the leading causes of morbidity and mortality [56]. The pathogenesis of the cardiac dysfunction remains largely unknown. In the last decade the quantification and characterization of changes in the proteome of cardiac myocytes at the onset of disease, have provided insight regarding the holistic response of injured tissue. A number of biomarkers have been established in clinical practice and reflects the decompensated hemodynamic status, myocardial strain, acute global stress response and other organ-specific injuries [57]. The natriuretic peptides, which include B-type natriuretic peptide (BNP) and the N-terminal fragment of its prohormone (NT-proBNP), as well as atrial natriuretic peptide (ANP), adrenomedullin (ADM) and the mid-regional fragment of the prohormone (MR-proANP), are currently the most widely used markers of myocardial strain and the gold standard for diagnosing heart failure and monitoring therapy. These prohormones are released in response to hemodynamic stress and processed into biologically active natriuretic peptides that can counteract the stress by inducing vasodilation, natriuresis and diuresis [58]. Recently, proteomics has been used to study variations in the proteome of patients with dilated cardiomyopathy (DCM). This pathological state is characterized by impaired of systolic function and is one of the most frequent causes of heart failure [59]. Patients with DCM showed significantly different expression pattern of a large number of proteins. Many of these proteins are classified into three broad functional classes: cytoskeletal and myofibrillar proteins, proteins associated with stress responses and proteins associated with mitochondria and energy production [60]. The levels of pyruvate dehydrogenase E1 (PDHA1) and pyruvate oxidoreductase (PORC) have been shown to be significantly increased in patients with DCM. Significant overexpression of dihydrolipoyl dehydrogenase (DLDH) and ATP synthase has also been observed. Considered together these results lead to the concept that mitochondrial dysfunction plays a critical role in the development of DCM [61]. In addition, a good correlation has been observed between the level of ATP and the mRNA and protein expression of the elongation factor Tu (EFTU) and the translation elongation factot Tu (TUFM), respectively. All these changes correlate well with the enhanced mitochondrial biogenesis that occurs in DMC [62]. The increased levels of the antioxidant protein thioredoxin-dependent peroxide reductase (PRDX3) are also of great importance; the overexpression of PRDX3 in mitochondria protects the heart against post-myocardial infarct remodeling and failure in mice by reducing left ventricular cavity dilation, fibrosis, and apoptosis [61]. Expression of the isoelectric isoform of the HSP27 increases during hypertrophy suggesting an altered stress response [63]. The mRNA and protein levels of the ubiquitin C-terminal hydrolase (UCH) enzyme increase five- and eight-fold respectively, in patients with DCM [64]. Overall, protein ubiquitination increases five-fold in patients with DCM compared with control subjects. These findings strengthens the hypothesis that inappropriate ubiquitin conjugation leads to proteolysis and the depletion of certain proteins in the DCM heart, which may contribute to the loss of normal cellular function in the diseased heart. In addition, cardiac antigens that produce specific antibody responses have been identified in vivo. Several of these antigens may be involved in the processes of acute and chronic rejection after cardiac transplantation [65], and they are currently being investigated as potential noninvasive markers. More recently, the phosphoproteome and protein kinase signal transduction pathways have been extensively studied and characterized in the myocardium [66]. The signal perturbations were mapped in postnatal mouse hearts, and revealed that protein mitogen-activated protein kinase kinase (MKK) promoted the development of cardiomyopathy by interacting wit multiple substrates. Clarification of the mechanisms leading to energetic derangement and, experimental silencing or overexpression of a specific gene in a murine model of DCM nay result in the identification of a promising therapeutic option to restore ventricular function in patients with DMC. Thus proteomics would allow us to improve and to develop etiology-specific therapies. Recently proteomics studies have led to the characterization of different expression patterns in primary cardiac tumors (PCTs). Amog these atrial myxoma, is the most common tumor and exhibits a heterogeneous phenotype, consisting of adult cells expressing specific protein antigens that are specific to various cell lineages, often within the same tumor, including antigen specific to epithelial, endothelial, myogenic, myofibroblast, neural and neuroendocrine cell lineages [67]. The detection of candidate protein biomarkers for cardiac myxoma is a complex because of the known heterogeneity and their relatively rare incidence; moreover, the use of proteomic profiling to discover biomarkers of for myxoma is still in its infancy.

Proteomics in Neurodegenerative Disease

Neurodegenerative diseases are a particularly devastating class of disorders that are characterized by neuronal impairment that leads to neuronal cell death. Diagnosis necessitates a brain biopsy or necropsy sample, although many sporadic cases have a typical clinical picture. Treatments aimed at inhibiting the neurodegenerative processes only focus on symptom management [68] and are likely to be most effective if the treatment is initiated as early as possible. In recent years, there has been a growing interest in applying proteomics to study on clinical diagnostics and predictive medicine for neurodegenerative disorders and to discover proteins that are associated wit pathogenic mechanisms for use as disease biomarkers; more advanced proteomic studies of degenerative diseases have been published [69-71]. Differences in the levels of various proteins in patients with neurodegenerative diseases compared with healthy control subject have been reported. It is interesting that the expression levels of the nerve growth factor (VGF), a neurosecretory protein that is known to regulate metabolism [72] and synaptic plasticity, are reduced in AD [73,74]; in FTD [75] and ALS [76]. Reduced expression of this protein seems to be a general phenomenon in diseases characterized by premature neuronal cells death, and thus may be a general marker for neurodegeneration. Increased levels of the microtubule-associated tau protein (MAPT) have been reported in the CSF of patients with AD, [77], vascular dementia (VD) [78] and Parkinson’s disease (PD) [79] and they reflect the degree of neuronal degeneration and damage [80]. This phenomenon is supported by the demonstration of a marked transient increase in CSF-tau following acute stroke, with a positive correlation between the CSF-tau levels and the infarct size [80,81]. A marked increase in the levels of the 42 amino acid fragment of amyloid-β protein (Aβ42) has been found in the CSF of AD patients [82]. The accumulation of Aβ42 fragment in the brain leads to neuronal damage, which in turn triggers inflammatory responses as the brain attempts to repair itself. This process is also thought to cause the formation of neurofibrillary tangles, consisting of tau protein witch contribute to neuronal damage and cause the symptoms of dementia [83]. Total and hyper-phosphorylated tau and Aβ42 proteins have been established as candidate biomarkers for the early detection of disease before the onset of dementia [84-86]. Apolipoproteins (ApoA-I, ApoD and ApoE) are, also, involved in the pathology of neurodegenerative diseases. Recently, significantly lower levels of ApoA-I were observed in AD compared with controls serum [87]; this result is in agreement with the findings obtained for the CSF [88,89]. Increased ApoD levels have been described in AD [90,91] and more recently, in PD [92]. This protein is regarded as a potential marker of neuronal cell death [90-92]. Many studies note the importance of protein-protein interactions and PTMs in neurological diseases, including ubiquitination, glycosylation, glycation, and, particulary, phosphorylation. A proteomic analysis of the CSF revealed different abundances of two proteins, designated p130 and p131, witch are members of the 14-3-3 family, and can discriminate between Creutzfeldt-Jakob disease (CJD) and other types of dementia with a sensitivity of 88% and specificity of 99% [93]. These proteins are also highly abundant in some patients with other neurological disorders without dementia, and they can reflect neuronal damage rather than disease pathogenesis. Alterations in the expression or PTMs of a number of proteins, including parkin (PARK2) [94], tau [90] and superoxide dismutase (SOD1) [95,96], have been linked to neurodegenerative diseases, such as PD, AS, and ALS. Recent proteomic investigations of the pathogenic mechanisms underlying neurodegenerative disorders have focused mainly on AD and PD witch are the most common neurodegenerative diseases.

Proteomics in Parkinson’s disease (PD)

In 1984, Harrington and colleagues showed for the first time, evidence that the protein expression pattern in the CSF of PD patients differs from healthy controls [97]. The first unbiased profile of the human SNpc identified 44 proteins nine of witch were significantly differentially expressed in PD compared with controls tissues [98]. The L and M neurofilament chains were downregulated in PD specimens. In addition, the levels of mitochondrial complex III, ATP synthase D chain, (ATPD), complexin I (CPLX1), calcium channel delta-subunit (CCHLA2) and fatty-acid binding protein (FABPs) were significantly increased in PD samples compared with control samples [99]. Using proteomics, Werner and colleagues identified 222 differentially expressed proteins in the brains of patients with PD. This work confirmed the above results and identified other differentially expressed proteins including V-type ATPase A1 (V-ATPase A1), ferritin H (FTH), ferritin L (FTL) and mortalin (HSPA9) [100]. Current research investigating the etiology of PD has recognized that mitochondrial impairments, oxidative stress and altered calcium homeostasis are the major idiopathic factors in PD [101-104]. A panel of eight CSF proteins, brain-derived neurotrophic factor (BDNF), IL-8, vitamin D binding protein (VDBP), β2-microglobulin (B2M), APOAII and APOE, tau and Aβ1-42, have been tested as markers in different and larger sets of patients/subjects, and can classify PD patients with sensitivity and specificity values of 95% each [101]. Furthermore, the other differentially expressed proteins includes Glutathione S-Transferase (GST), Glutathione S-transferase P (GSTP1), glutathione S-transferase omega 1 (GSTO1) muscarinic 3 (M3), and SH3-binding glutamic acid-rich like protein (SH3BGRL), all of which were expressed at higher levels in patients with PD than in healthy controls. Overall PD patients showed increased levels of the oxidized DNA, proteins and lipids and decreased levels of reduced GSH. Based on the proteomic data, Jami et al. [105] hypothesized that L-3, 4-dihydroxyphenylalanine (L-DOPA), witch is normally used for PD medications, reverses some of the negative effects of H₂O₂.

In addition to the sporadic forms, of PD, there are dominant and recessive forms of familial PD that are linked to mutations in specific genes. As for autosomal dominant forms, mutations in three genes have been identified: α-synuclein (PARK1 and PARK4) UCL1 (ubiquitin carboxy-terminal hydrolase L1, PARK5) and LRRK2 (leucine-rich repeat kinase 2 PARK8). The autosomal recessive forms of Parkinsonism are linked to mutations in the parkin (PARK2, the most common) PINK1 (PARK6), and DJ-1 (PARK7) genes, and are characterized by early onset of the disease [106]. All of these mutations affect molecular processes that are involved in the pathogenesis of PD, such as mitochondrial activity (DJ-1 and PINK1), oxidative stress (α−synuclein, DJ-1), intracellular signaling (DJ-1, LRRK2) and, in particularly, protein aggregation, cytoskeleton structure–dynamics, and calcium homeostasis (parkin, α−synuclein, UCHL1) [107-111]. Some of the above proteins are emerging candidate biomarkers; α-synuclein and DJ-1 have been identified in the CSF and plasma of PD patients [112-116]. Reduced levels of α- synuclein in the CSF have been reported to be associated with increased severity of Parkinsonism in patients with PD [112-114]. DJ-1 level was found to increase in the CSF and plasma of PD patients, and correlated with the disease severity [112,114]. The glial related proteins (glial fibrillary acidic protein (GFAP), glial maturation factor beta (GMFB), galectin 1 (LGALS1) and sorcin (SRI) are differentially upregulated in PD and it has been shown to serve as biomarker of neurotoxicity [117]. The upregulationn of proteins belonging to other metabolic pathways that are not usually associated with PD have also observed, including S-adenosyl homocysteine hydrolase 1 (SAHcy), aldehyde dehydrogenase A1 (ADH1A1) and cellular retinol-binding protein 1 (CRBP1), annexin V, beta-tubulin cofactor A (TBCA) and coactosin-like protein 1(COLT1) [100]. A recent study showed that CP and Apo-H appeared to differentiate PD patients from healthy controls as well as from patients with other diseases. As an important iron transport protein CP is of great interest because it has been suggested to play a central role in PD the pathogenesis of PD [118,119]. Chromogranin B (CHGB) together with Apo-H provide increased sensitivity to differentiate PD patients from healthy controls. Although CHGB alone cannot differentiate AD or PD from controls, the ratio between CHGA and CHGB may be a correcting factor for the neuropeptides that are present in human CSF [120].

Proteomics in Alzheimer’s disease (AD)

Several proteins that are involved in inflammation and the complement system are significantly up regulated in patients with AD; chemokines and chemokine receptors are also up regulated in AD brain cells and contributed to plaque-associated inflammation and neurodegeneration processes [121]. The increased levels of CSF chemokine ligand 2 (CCL2), witch plays a significant role in the inflammatory processes in AD, correlate positively with the level of p-tau [122]. Consequently, CCL2 may serve as a potential biomarker to monitor the progression of the pathologic state of AD. The expression of the 40-kDa secreted glycoprotein YKL-40, also known as chitinase 3-like 1 (CHI3L1), which participates in a inflammation and tissue remodeling, is elevated in early AD but its physiological functions remain unclear. The ratio of YKL-40 to Aβ42 predicts cognitive impairment as well as the best CSF biomarkers (Aβ42, t-tau, and p-tau) [123], suggesting that it may be a potential biomarker for preclinical AD. Visinin-like protein-1 (VILIP-1) is a potential marker of neuronal injury. Investigation of CSF VILIP-1 as a diagnostic and prognostic marker in early AD indicates that VILIP-1 levels in the CSF can differentiate individuals with AD from cognitively normal controls and from patients with other dementias, and that CSF VILIP-1 / Aβ42 as well as tau / Aβ42 and p-tau181 / Aβ42 predicts cognitive impairment [122,123]. S100A7, a small calcium-binding protein that is involved in immune responses was recently found to be elevated in the CSF and brain of patients with AD and the levels of this protein correlated with disease severity [124] S100A7 has the potential to serve as another biomarker of AD.

Metaproteomics

The human gastro-intestinal (GI) tract contains a vast and diverse microbial ecosystem: the microbiome that has co-evolved with our species in terms of our metabolic and nutritional requirements [125,126] and is essential for human health [127,128]. The GI microbiota perform a large number of important functions that define the physiology of the host, such as immune system maturation [129] conditioning and response [130]; the intestinal response to epithelial cell injury [131]; and xenobiotic [132] and energy metabolism [133]. The microbiota of the GI tract also influences drug metabolism and toxicity [134], dietary calorific bioavailability [135], and post-surgical recovery [136]. Perturbations of the composition of the microbiota, also known as dysbiosis, have been recognized in various diseases. The microbiota have been directly implicated in the etiopathogenesis of a number of pathological states such as cardiovascular disease, [137], inflammatory bowel diseases (IBDs) [138], autism [9] and obesity [10], (Figure 3). The phylogenetic characterization of the microbiota of individuals with disease, compared with apparently healthy individuals, is the main approach to examining changes in the composition of the intestinal microbiota [139]. Systems biology approaches based on next generation ‘omics’ technologies (summarized in Figure 2), have stimulated the field of metaproteomics [139-142] and can now describe the gut microbiome at detailed genetic and functional (genomic, transcriptomic, proteomic and metabolic) levels, providing new insights into their importance in human health, and they can be used to map microbiome variability between species, individuals and populations [139]. This has led to the concept of ‘global metabolic profiling’, which provides a unique overview of the metabolic state of an individual because it is able to indirectly measure complex transgenomic co-metabolic interactions that are vital for human health, and which are often modulated by disease [143-146].

proteomics-bioinformatics-microbiota-human-microbial

Figure 3: Systemic diseases directly influenced by gut microbiota.

Along with the reconstruction of the metabolic pathways associated with the GI tract, metaproteomic approach (Table 2) provides information regarding the role of the gut microbiota in metabolic syndromes, inflammation and obesity [10]. The absence of intestinal microorganisms has been shown to protect germ-free mice against the development of obesity after being fed a high-fat, sugar-rich diet [145]. Furtemore, colonization of germ-free mice with the microbiota of obese mice induced a significantly greater increase in body fat weight compared with germ-free mice colonized with the microbiota of lean mice [10]. These experiments demonstrated that the intestinal microbiota is involved in the regulation of fat storage. Introduction of the intestinal microbiota resulted in an increase in the metabolic rate, modulation of de novo lipogenesis and an increase in the uptake of monosaccharides from the intestine [146]. Based on these results, it has been hypothesized that obese individuals convert food into usable and stored fat energy more efficiently than learn individuals [147]. There is increasing evidence that dysbiosis of the intestinal microbiota plays a role in the pathogenesis of IBD [11,148]. Chronic IBDs affect the entire intestinal tract and are characterized by discontinuous inflammation of the epithelial lining and deep ulcers. Crohn’s disease (CD) and ulcerative colitis (UC) are the most form of prevalent IBD witch can be associated with an increased risk of colorectal carcinoma [12]. Numerous studies have compared the composition of the intestinal microbiota of IBD patients with that of healthy individuals. The observed compositional and functional changes in patients with IBD suggest that the intestinal microbiota plays an important role in the etiology and pathogenesis of IBD.

➢ Comparative analysis of different microbioma strains

➢ Comparative analysis of different physiological states

➢ Characterization of sub-microbial proteomes (surface proteins, secreted proteins, and immunogenic proteins)

➢ Identification of proteins related to pathogenicity.

➢ Identification of proteins involved in host-pathogens interactions

➢ Evaluation of mechanisms of action of antimicrobials

Table 2: Targeted proteomic approaches to microbial pathogens.

However, metaproteomics studies are just emerging as there are various practical challenges including the high complexity of the microbial ecosystem, difficulties in separating human and prokaryotic proteins, and the fact that the databases have just been developed due to the advances in metagenomic sequencing. In the near future, metagenomic and metaproteomic studies will provide more insight regarding the role of the GI microbiota in diseases.

Proteogenomics

Proteogenomics integrates proteomic and genomic data and is an emerging and promising area of study in omics research [6]. The goal is to develop novel methods to integrate and visualize the proteomic and genomic data to better understand biological processes. Proteogenomics originates from advances in proteomics that have made it feasible to obtain coverage comparable to that achieved using genomics and transcriptomics and it is a direct method for effective systems biology studies. The base concept is that protein data can shed light on the consequences of various genomic features, such as the determination of whether a specific genetic variant may become a functional protein. By matching deep MS-based proteomics to a personalized database constructed from a sample-specific genome and transcriptome, thousands of peptides that would otherwise escape identification can be identified. One requirement for effective systems biology studies is sufficient proteomics coverage. In the last few years, it has become routine in some laboratories to obtain coverage in the range of 10,000 to 12,000 proteins [149,150]. Using integrated proteogenomics technologies, Munoz and colleagues identified 13,078 human and 10,637 mouse proteins, including 39,941 peptides that were not previously described in the human dataset of the Peptide Atlas and 224 novel human and 122 novel mouse peptides, which mapped to 164 and 101 genomic loci, respectively [151]. Using high resolution isoelectric focusing, they also probed a six-reading-frame translation of the human and mouse genomes and identified 98 and 52 previously undiscovered protein-coding loci, respectively [151]. Heck et al. [152], have completed a proteogenomics study of rat liver tissue, and identified 13,088 proteins by integrating whole genome sequencing, RNA-seq, and mass spectrometry-based proteomics resulting in one of the most comprehensive proteome analyses performed to date. By integrating the genomics data, they validate 1,195 gene predictions, 83 splice events, 120 proteins with nonsynonymous variants, and 20 protein isoforms with nonsynonymous RNA editing, providing a greater understanding of the RNA editing process. The effort also provided biological insights including information concerning outstanding questions regarding RNA editing-a process in which an RNA sequence is modified after it has been generated. To highlight the relationship between genomic (DNA and RNA) and proteomic (protein) abnormalities, a number of researchers, independently or as part of large-scale initiatives, have systematically identified proteins from geomically characterized tumors, such as those from TCGA program, to decipher the complex proteogenomic relationship between genomic and proteomic abnormalities and to provide a deeper understanding of cancer biology. The CPTAC program is currently analyzing more than 300 samples from colorectal, breast, and ovarian cancer. The first set of proteogenomic data was made public in September 2013 (http://proteomics.cancer.gov/). Integrative proteogenomic analyses using genomic, transcriptomic, and proteomic technologies, appears to be the future techniques of choice for understanding the genetic control of molecular dynamics and phenotypic diversity in a system-wide manner [153]. The integration and interrogation of the proteomic and genomic data will provide potential biomarker candidates, which will be prioritized for downstream-targeted proteomic analyses. These biomarker targets will be used to create quantitative multiplex assays to verify and prescreen the relevance of the targets in clinically relevant and unbiased samples. The outcomes from this approach will provide the community with verified biomarkers that can be used for clinical qualification studies; high quality and publicly accessible datasets, and analytically validated, multiplex, quantitative protein / peptide assays and their associated high quality reagents for the research and clinical communities.

Concluding Remarks

Integrative analyses of different data types can to correctly delineate the effects of genomic and transcriptomic variations on molecular processes and cellular functioning that delineate the proteomic phenotype in specific pathophysiological states. Due the potential to decode the pathogenic mechanisms underlying diseases, human proteomics greatly impact on medicine with the prospects that results of proteomic disease investigations will overcome some of the limitations the current diagnostic and therapeutic methods.

Future proteomics research, by supporting technology optimization, development, and data standardization, can successfully advancing the translation of proteomic findings into clinically relevant applications along with a rigorous assessment of the biospecimens and the generation of quality data through the implementation of quality assessment criteria at each step of biomarker development and will expedite the development of individualized medicine.

References

Citation: Lippolis R, De Angelis M (2016) Proteomics and Human Diseases. J Proteomics Bioinform 9:063-074.

Copyright: © 2016 Lippolis R, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Journal of Proteomics & BioinformaticsOpen Access

Proteomics and Human Diseases

Abstract

Abbreviations

Introduction

Cancer Proteomics

Proteomics in Heart Diseases

Proteomics in Neurodegenerative Disease

Proteomics in Parkinson’s disease (PD)

Proteomics in Alzheimer’s disease (AD)

Metaproteomics

Proteogenomics

Concluding Remarks

References

Journal of Proteomics & Bioinformatics
Open Access