Objective: The objective of this study was to determine if protein-by-products secondary to cancer related oncogenes appear in the saliva of breast cancer patients. Methods: Three pooled (n = 10 subjects/pool) stimulated whole saliva specimens from women were analyzed. One pooled specimen was from healthy women, another pooled specimen from women diagnosed with a benign breast tumor and the other one pooled specimen was from women diagnosed with ductal carcinoma in situ (DCIS). Differential expression of proteins was measured by isotopically tagging proteins in the tumor groups and comparing them to the healthy control group. Experimentally, saliva from each of the pooled samples was trypsinized and the peptide digests labeled with the appropriate iTRAQ reagent. Labeled peptides from each of the digests were combined and analyzed by reverse phase (C18) capillary chromatography on an Applied Biosystems QStar LC-MS/MS mass spectrometer equipped with an LC-Packings HPLC. Results: The results of the salivary analyses in this population of patients yielded approximately 130 proteins in the saliva specimens. Forty-nine proteins were differentially expressed between the healthy control pool and the benign and cancer patient groups. Conclusions: The study suggests that saliva is a fluid suffused with solubilized by-products of oncogenic expression and that these proteins may be modulated secondary to DCIS. Additionally, there may be salivary protein profiles that are unique to both DCIS and fibroadenoma tumors.
Keywords: LC-MS/MS; isotope labeling; breast cancer; Saliva
Proteomics was originally defined to represent the analysis of the entire protein component of a cell or tissue ([
Protein expression and function are subject to modulation through transcription as well as through posttranscriptional and translational events. Multiple RNA species can result from one gene through a process of differential splicing. Additionally, there are more than 200 post-translation modifications that proteins could undergo that affect function, protein-protein and nuclide-protein interaction, stability, targeting half-life, and so on ([
Technological advancements have benefited proteomic research to the point where saliva is now being assayed for protein content using the latest available proteomic technology ([
As this is a newly recognized endeavor, there is but a paucity of information regarding the salivary proteome and its constituents in the presence of disease such as carcinoma. Previous studies using immunological techniques have demonstrated that saliva from breast cancer patients exhibited elevated levels of CerbB-2, CA 15-3, EGFR, cathepsin D and p53, suggesting that there is communication between the breast tumor and the salivary gland ([
The investigators protein profiled three pooled, stimulated whole saliva specimens. One specimen consisted of pooled saliva from 10 healthy subjects, another specimen was a pooled saliva specimen from 10 benign tumor patients (fibroadenomas), and the third specimen was from 10 subjects diagnosed with ductal carcinoma in situ (DCIS). Fibroadenomas were selected due to its high prevalence among benign breast tumors. DCIS was selected as this represents the minimal, detectable tumor load ([
All participating subjects were explained their participation rights and signed an IRB consent form. The saliva specimens and related patient data were non-linked and bar coded in order to protect patient confidentiality. This study was performed under the UTHSC IRB approved protocol# HSC-DB-05-0394.
Stimulated whole salivary gland secretion is based on the reflex response occurring during the mastication of a bolus of food. Usually, a standardized bolus (1 gram) of paraffin or a gum base (generously provided by the Wrigley Co., Peoria, Illinois, USA) is given to the subject to chew at a regular rate. The individual, upon sufficient accumulation of saliva in the oral cavity, expectorates periodically into a preweighed disposable plastic cup. This procedure is continued for a period of five minutes. The volume and flow rate is then recorded along with a brief description of the specimen's physical appearance ([
Recent advances in mass spectrometry, liquid chromatography, analytical software and bioinformatics have enabled the researchers to analyze complex peptide mixtures with the ability to detect proteins differing in abundance by over 8 orders of magnitude ([
Isotopic labeling of protein mixtures has proven to be a useful technique for the analysis of relative expression levels of proteins in complex protein mixtures such as plasma, saliva, urine or cell extracts. There are numerous methods that are based on isotopically labeled protein modifying reagents to label or tag proteins to determine relative or absolute concentrations in complex mixtures. The higher resolution offered by the tandem Qq-TOF mass spectrometer is ideally suited to isotopically labeled applications ([
Applied Biosystems recently introduced iTRAQ reagents ([
Briefly, the saliva samples were thawed and immediately centrifuged to remove insoluble materials. The supernatant was assayed for protein using the Bio-Rad protein assay (Hercules, California, USA), and an aliquot containing 100 μ g of each specimen was precipitated with 6 volumes of −20°C acetone. The precipitate was resuspended and treated according to the manufacturers instructions. Protein digestion and reaction with iTRAQ labels was carried out as previously described and according to the manufacturer's instructions (Applied Biosystems, Foster City, California, USA). Briefly, the acetone precipitable protein was centrifuged in a table top centrifuge at 15,000 × g for 20 minutes. The acetone supernatant was removed and the pellet resuspended in 20L dissolution buffer. The soluble fraction was denatured and disulfides reduced by incubation in the presence of 0.1% SDS and 5 mM TCEP (tris-(2-carboxyethyl)phosphine)) at 60°C for one hour. Cysteine residues were blocked by incubation at room temperature for 10 minutes with MMTS (methyl methane-thiosulfonate). Trypsin was added to the mixture to a protein:trypsin ratio of 10:1. The mixture was incubated overnight at 37°C. The protein digests were labeled by mixing with the appropriate iTRAQ reagent and incubating at room temperature for one hour. On completion of the labeling reaction, the four separate iTRAQ reaction mixtures were combined. Since there are a number of components that can interfere with the LCMSMS analysis, the labeled peptides are partially purified by a combination of strong cation exchange followed by reverse phase chromatography on preparative columns. The combined peptide mixture is diluted 10 fold with loading buffer (10 mM KH
The desalted and concentrated peptide mixtures were quantified and identified by nano-LCMS/MS on an API QSTAR XL mass spectrometer (ABS Sciex Instruments Ontario, Canada) operating in positive ion mode. The chromatographic system consists of an UltiMate nano-HPLC and FAMOS autosampler (Dionex LC Packings Sunnyvale, California, USA). Peptides were loaded on a 75 cm × 10 cm, 3 mm fused silica C18 capillary column, followed by mobile phase elution: buffer (A) 0.1% formic acid in 2% acetonitrile/98% Milli-Q water and buffer (B): 0.1% formic acid in 98% acetonitrile/2% Milli-Q water. The peptides were eluted from 2% buffer B to 30% buffer B over 180 minutes at a flow rate 220 nL/min. The LC eluent was directed to a NanoES source for ESI/MS/MS analysis. Using information-dependent acquisition, peptides were selected for collision induced dissociation (CID) by alternating between an MS (1 sec) survey scan and MS/MS (3 sec) scans. The mass spectrometer automatically chooses the top two ions for fragmentation with a 60 s dynamic exclusion time. The IDA collision energy parameters were optimized based upon the charge state and mass value of the precursor ions. Each saliva sample set there are three separate LCMSMS analyses.
The accumulated MSMS spectra are analyzed by ProQuant and ProGroup software packages (Applied Biosystems) using the SwissProt fasta database for protein identification. The ProQuant analysis was carried out with a 75% confidence cutoff with a mass deviation of 0.15 Da for the precursor and 0.1 Da for the fragment ions. The ProGroup reports were generated with a 95% confidence level for protein identification.
The Swiss-Prot database was employed for protein identification while the PathwayStudio bioinformatics software package (Ariadne Genomics, Inc., Rockville, Maryland, USA) was used to determine Venn diagrams were also constructed using the NIH software program (
Table 1 summarizes the results of the iTRAQ analysis and illustrates protein comparisons between benign vs. healthy, cancer vs. benign and cancer vs. healthy subjects. In total, 130 proteins were identified at a confidence level > 95 and 72 at > 99. Of these, there were 40 proteins that were determined to be expressed significantly different (p < 0.05) in the benign or tumor saliva compared to healthy control. Fig. 1 represents a Venn diagram of the overlapping proteins between the three groups of women.
Table 1 Comparative Protein Counts Between Healthy, Benign, and Cancer Subjects
Comparison Up Regulated Down Regulated Total Markers Benign vs. Healthy 14 9 23 Cancer vs. Healthy 20 12 32 Cancer vs. Benign 17 11 28 Totals 51 32 83
Graph: Figure 1 Represents a Venn diagram of the overlapping proteins between the three groups of women.
Table 2 represents the up (n = 14) and down (n = 9) regulated proteins for the pooled saliva sample composed of individuals diagnosed with a fibroadenoma. The fold-increase of protein and p values are also presented. As shown in Table 2, 9 of the 29 proteins were significant at the p < 0.001 to p < 0.0001 levels, and 7 proteins had a greater that 50% change in concentration.
Table 2 Benign vs. Healthy
Accession Protein Name Ratio p Value Gene ID Up-Regulated Proteins in Benign P06733 Alpha enolase 1.4204 0.0006 ENOA P04083 Annexin A1 1.6282 0.0047 ANXA1 P05109 Calgranulin A 1.9393 0.0001 S10A8 P06702 Calgranulin B 1.6297 0.0002 S10A9 Q9UBC9 Cornifin beta 2.1353 0.0000 SPRR3 P01036 Cystatin S precursor 1.2584 0.0027 CYTS P01877 Ig alpha-2 chain C region 1.2781 0.0213 IGHA2 P01871 Ig mu chain C region 1.256 0.0196 MUC P13646 Keratin, type I cytoskeletal 13 1.3184 0.0180 K1CM Q9QWL7 Keratin, type I cytoskeletal 17 2.6008 0.0018 K1CQ P04264 Keratin, type II cytoskeletal 1 1.4504 0.0002 K2C1 P48666 Keratin, type II cytoskeletal 6C 2.0979 0.0003 K2C6C Q9HC84 Mucin 5B precursor 1.4306 0.0001 MUC5B P05164 Myeloperoxidase precursor 1.8949 0.0015 PERM Down-Regulated Proteins in Benign P28325 Cystatin D precursor 0.817 0.0455 CYTD P18510 Interleukin-1 receptor antagonist protein precursor 0.7484 0.0312 IL1RA P22079 Lactoperoxidase precursor 0.7408 0.0137 PERL P80188 Neutrophil gelatinase-associated lipocalin precursor 0.7971 0.0289 NGAL P31151 S100 calcium-binding protein A7 0.4737 0.0054 S10A7 P04745 Salivary alpha-amylase precursor 0.8245 0.0023 AMYS P02787 Serotransferrin precursor 0.6968 0.0000 TRFE P02768 Serum albumin precursor 0.6922 0.0000 ALBU Q96DR5 Short palate, lung and nasal epith. Ca assoc.protein 2 0.7798 0.0170 SPLC2
Table 3 is a list of the up (n = 20) and down (n = 12) regulated proteins observed in the Stage 0 cancer saliva compared to controls. There were 15 proteins that showed a 1.5 fold increase in levels in the cancer compared to control subjects. Of these 15 differentially expressed proteins, 12 were significant at the p < 0.001 to p < 0.0001 levels. Additionally, the Table is referenced in the literature for the presence of these proteins in both blood from cancer subjects and cell supernatants from cancer cell lines. Of the 32 proteins that were up or down regulated secondary to carcinoma of the breast, 79% of these proteins were cited in the literature as being involved, molecularly, with the breast cancer ([
Table 3 Cancer vs. Healthy
Accession Number Protein Name Ratio P Value Gene ID Reported Function Blood ( Tissue ( Up-Regulated Proteins in Cancer Saliva Q9DCT1 Aldo-keto reductase 1.44 0.0264 AK1E1 Detox & reduction — 25 P04083 Annexin A1 3.06 0.0001 ANXA1 Membrane associated protein 30 25 P05109 Calgranulin A 2.18 0.0001 S10A8 Cell adhesion & communication 30 — P06702 Calgranulin B 1.87 0.0001 S10A9 Cell adhesion & communication 30 — P23280 Carbonic anhydrase VI 1.52 0.0003 CAH6 Energy/metabolism 30 27 Q9UBC9 Cornifin beta 1.82 0.0001 SPRR3 Indicator of tissue damage — — P13646 Cytokeratin 13 6.56 0.0001 K1CM Intracytoplasmatic cytoskeleton protein 30 — P19013 Cytokeratin 4 6.50 0.0019 K2C4 Intracytoplasmatic cytoskeleton protein 30 25 P48666 Cytokeratin 6C 4.41 0.0001 K2C6C Intracytoplasmatic cytoskeleton protein — — P01040 Cystatin Ad 2.00 0.0014 CYTA Protein degradation & inhibitor 30 25 P01036 Cystatin SA-III 1.20 0.0115 CYTS Protein degradation & inhibitor — — Q01469 Epid. Fatty acid-binding prot. 2.1 0.0362 FABP4 Protein with binding functions 30 — P01857 Ig gamma-1 chain C region 1.44 0.0034 IGHG1 Immunoresponse — — P01871 Ig mu chain C region 1.51 0.0011 MUC Immunoresponse — — P06870 Kallikrein 1 precursor 1.23 0.0425 KLK1 Serine protease P02788 Lactoferrin 1.58 0.0001 TRFL Inhibits G1 CDK's, mod. NK activity 30 26 Q9HC84 Mucin 5B 1.68 0.0001 MUC5B Cell adhesion & communication — 36 P05164 Myeloperoxidase precursor 2.72 0.0005 PERM Defense Immunoresponse 30 — P31151 S100 calcium-binding protein 2.05 0.0001 S100P Calcium binding protein 30 25 P31025 Von Ebner's gland protein (lipocalin) 1.26 0.0043 VEGP Inflamation — 25 Down–Regulated Proteins in Cancer Saliva Q8N4F0 Bact. Perm.-increasing prot.-1 0.80 0.0004 BPIL1 Transport 30 — P04264 Cytokeratin 1 0.61 0.0001 K2C1 Intracytoplasmatic cytoskeleton protein — 25 P01034 Cystatin C 0.72 0.0187 CYTC Inhibitor of cysteine proteases — 31 P28325 Cystatin D precursor 0.68 0.001 CYTD Protein degradation & inhibitor — — P00738 Haptoglobin 0.83 0.0023 HPT Indicator of tissue damage and necrosis 30, 34 — P22079 Lactoperoxidase 0.82 0.0388 PERL Transport — 33 P01833 Poly-IG receptor protein 0.86 0.0234 PIGR Immunoresponse P07737 Profilin-1 0.68 0.0135 PROF1 Cytoskeleton associated — 25 P02768 Serum albumin precursor 0.73 0.0001 ALBU Transport 30 27 Q96DR5 Short palate, lung and nasal epith. carc. assoc. protein 2 0.61 0.0001 SPLC2 Immune response & detox. — 32 P02787 Transferrin 0.72 0.0001 TRFE Surface antigen assoc. with growth 34 — P25311 Zinc-alpha-2-glycoprotein 0.84 0.0009 ZA2G Signalling — 29
Graph: Figure 2 Represents protein functions.
A comparison of the differentially expressed proteins is shown in graphical form in Fig. 3. In this Figure, the log of the ratios for benign vs. control and cancer vs. control is plotted for each of the proteins. The error bars are the log of the error factor calculated by the ProQuant software. This comparison illustrates that there are a number of significant differences in the expression levels of several proteins between the cancer and benign saliva samples. A direct comparison of protein expression ratios between the benign and cancer pooled specimens that exhibited overlap or commonality among the proteins is shown in Table 4. Among the comparison of the overlapping proteins, 10 fell at or below a p value of 0.001, and 9 proteins were greater than 50% difference in the cancer compared to benign. These data are replotted in the log form in Figure 4.
Table 4 Cancer vs. Benign
Accession Protein name Ratio p Value Gene ID Up-Regulated Proteins in Cancer Q9HC84 Mucin 5B precursor 1.16 0.0046 MUC5B P80188 lipocalin precursor 1.18 0.0443 NGAL P01871 Ig mu chain C region 1.19 0.0231 MUC P04745 Salivary alpha-amylase precursor 1.19 0.001 AMYS P23280 Carbonic anhydrase VI precursor 1.29 0.0293 CAH6 P02788 Lactotransferrin precursor 1.30 0.0113 TRFL P05164 Myeloperoxidase precursor 1.41 0.0487 PERM P01857 Ig gamma-1 chain C region 1.43 0.0004 IGHG1 Q9DCT1 Aldo-keto reductase 1.47 0.018 AK1E1 P31025 Von Ebner's gland protein 1.55 0.0005 VEGP P04083 Annexin A1 1.86 0 ANXA1 P10599 Thioredoxin 1.93 0.03 THIO P01040 Cystatin A 1.95 0.0056 CYTA P48666 Keratin, type II cytoskeletal 6C 2.07 0 K2C6C P31151 S100 calcium-binding protein A7 4.08 0.0005 S10A7 P13646 Keratin, type I cytoskeletal 13 4.76 0 K1CM P19013 Keratin, type II cytoskeletal 4 5.59 0.0013 K2C4 Down-Regulated Proteins in Cancer P04264 Keratin, type II cytoskeletal 1 0.42 0 K2C1 Q9QWL7 Keratin, type I cytoskeletal 17 0.58 0.001 K1CQ P01034 Cystatin C precursor 0.67 0.0246 CYTC P13796 L-plastin 0.69 0.0145 PLSL P06733 Alpha enolase 0.73 0.0106 ENOA P01833 Polymeric-Ig receptor 0.76 0.0002 PIGR P12273 Prolactin-inducible protein precursor 0.77 0.0012 PIP Q96DR5 Short palate, lung and nasal epithelium carcinoma associated protein 2 precursor 0.78 0.0219 SPLC2 P07477 Trypsin I precursor 0.82 0.0196 TRY1 P28325 Cystatin D precursor 0.83 0.0155 CYTD Q9UBC9 Small proline-rich protein 3 0.84 0.0228 SPRR3
Graph: Figure 3 Differential protein expression in cancer saliva or benign saliva versus normal control.
To the best of our knowledge, this is the first attempt to determine numerous cancer related proteins in saliva. As a consequence, we have only a few references by which to compare our data.
With respect to the overall analyses, the total number of salivary proteins reported in healthy individuals at the 95% confidence levels was 130. In comparing our findings to other studies, which used 2D gel and mass spectrometry, Wilmarth reported 102 proteins ([
Table 3 and Table 4 list the proteins for the healthy pool vs. benign tumor pool and healthy pool vs. cancer tumor pool respectively. As illustrated in Table 4, many of these proteins have been reported as have been either up or down regulated in blood and cancer tissue. There is also an overlap of 13 up regulated proteins and five down regulated proteins between the protein profiles leaving the benign group with five proteins that are unique to fibroadenomas (ENOA, IGHA2, IL-1ra, S10A7, SPLC2) and 11 proteins unique to DCIS (CAH6, K2C4, CYTA, FABP4, IGHGI, TRFL, BPIL1, CYTC, HPT, PROF1, ZA2G). Figure 1 represents a Venn diagram of the overlapping proteins between the three groups of women. For reasons which the authors cannot explain, two-thirds of the total "overlap" proteins were up-regulated. One could speculate that a portion of the up-regulated proteins that exhibited overlap can be associated with pathways which are common to both disorders. This would include the proteins associated with cytoskeleton and cell growth. The investigators targeted the benign tumor in order to increase the specificity of the panel of markers. If there are markers specific to a benign tumor and specific only to the malignancy, then probability of making the correct clinical assessment is further increased.
Table 4 represents a comparison of the benign and cancer proteins that overlapped each group of pooled subjects. In this comparison, only seven proteins remained significantly different in the presence of carcinoma (p < 0.005). This would include the proteins associated with exocytosis, cytoskeleton and immuno-response. As the cell proliferation process is further enhanced in the presence of carcinoma, it stands to reason that these proteins should be significantly up-regulated in the presence of carcinoma. Fig. 3 and Fig. 4 provide further illustration of the protein comparisons.
Graph: Figure 4 Differential protein expression in cancer versus benign saliva.
The authors cannot explain the mechanism by which these proteins are altered in the presence of carcinoma of the breast. The authors, in single analyte reports, have found additional low abundance proteins, such as HER2/neu, Waf-1, pantropic p53, EGFR and cathepsin D, to be altered in the presence in addition to those cited in this manuscript ([
The authors have examined the salivary proteome that is altered in the presence of carcinoma of the breast; however, further study is required to determine the sensitivity and specificity of these proteins with respect to their diagnostic utility. We do not want to over emphasize the findings at this point, but we are encouraged to find that these protein dispositions have also been found to be altered in blood, cancer tissues and nipple aspirates, which provide further support of our findings.
The authors further encourage the exploration of saliva as a diagnostic media for several reasons. The fluid contains numerous proteins and protein fragments, which may have analytical value. The salivary proteome is in its infancy. Additionally, saliva can also be described as a media which provides "real time" results ([
By Charles F. Streckfus; Otilia Mayorga-Wark; Daniel Arreola; Cynthia Edwards; Lenora Bigler and William P. Dubinsky
Reported by Author; Author; Author; Author; Author; Author