Abstract
OBJECTIVE To establish a serum protein pattern model for screening pancreatic cancer.
METHODS Twenty-nine serum samples from patients with pancreatic cancer were collected before surgery, and an additional 57 serum samples from age and sex-matched individuals without cancer were used as controls. WCX magnetic beans and a PBS II-C protein chip reader (Ciphergen Biosystems Inc) were employed to detect the protein fingerprint expression of all serum samples. The resulting profiles comparing serum from cancer and normal patients were analyzed with the Biomarker Wizard system, to establish a model using the Biomarker Pattern system software. A double-blind test was used to determine the sensitivity and specificity of the model.
RESULTS A group of 4 biomarkers (relative molecular weights were 5,705 Da, 4,935 Da, 5,318 Da, 3,243 Da) were selected to set up a decision tree to produce the classification model to effectively screen pancreatic cancer patients. The results yielded a sensitivity of 100% (20/20), specificity of 97.4% (37/38). The ROC curve was 99.7%. A double-blind test used to challenge the model resulted in a sensitivity of 88.9% and a specificity of 89.5%.
CONCLUSION New serum biomarkers of pancreatic cancer have been identified. The pattern of combined markers provides a powerful and reliable diagnostic method for pancreatic cancer with high sensitivity and specificity.
keywords
Introduction
The majority of patients with pancreatic cancer die with in one year after a final diagnosis, with a surgical success rate of only 10~20%. The 5-year survival rate is only 3%~4%, a level which is similar to the rate 15 years ago[1,2]. Following a radical operation, the 5-year survival may reach 19%~41%, if the tumor is ≤2 cm, and if the diameter is ≤1.0 cm it may attain 67%[3], suggesting a remarkable difference. So an important way to increase the survival rate after pancreaticoduodenectomy and to improve patient prognosis is early diagnosis[4]. But at the present time the sensitivity and specificity of clinical diagnostic methods are low, which presents the biggest obstacle for the effective diagnosis and treatment of these patients[5-9].
Recently, it has become possible to identify new tumor markers to diagnose and monitor development of tumors as proteomics research develops[10-12]. Surface-enhanced laser desorption and ionization time of fight mass spectrometry (SELDI-TOF-MS) is a new proteomics research technique developed over the past few years[13-15]. This technique is better than 2-DIMENSIONAL electrophoresis and other mass-spectrum analysis methods, and it has been extensively applied for tumor marker screening[16,17] of prostate cancer[18-20], breast carcinoma[21] bladder cancer [22] hepatocellular carcinoma[23], etc.[24] However there have been only a few reports related to pancreatic cancer[25-29]. The purpose of our study was to screen a series of specific marker proteins, in order to establish early diagnostic criteria for pancreatic cancer patients, and to provide more sensitive and more reliable indexes by finger printing of serum proteins from patients with pancreatic cancer and healthy controls.
Materials and Methods
Patients and serum samples
Preoperative blood was collected from 29 pancreatic cancer patients (17 male and 12 female, aged 46~70 years) who had been diagnosed by histopathology analysis. The clinical characteristics of the patients are summarized in Table 1. Patients were classified as having clinical disease Stage I, II, III, or IV according to the UICC classification. Fifty-seven control blood samples were collected from sex- and age-matched individuals, including 30 healthy controls, 20 cases with chronic pancreatitis and 7 cases with pancreatic cystic tumors, as a composite control group. All sera were obtained according to a standardized sampling and coding protocol. After sample collection, the blood samples were collected in 5 mL BD Vacationers without anticoagulant, and allowed to clot at room temperature up to 1 h, followed by centrifugation at 4°C for 30 min at 3,000 rpm. The sera were frozen and stored at-80°C for future analysis. All serum samples were obtained between June 2006 and July 2007 in the department of General Surgery, The First Affiliated Hospital of China Medical University, Shenyang, Liaoning province, China. The study was approved by the ethics committee of the China Medical University, and written informed consent was obtained from all individuals from whom serum samples were collected.
Clinicopathologic characteristics of the 29 pancreatic cancers.
Serum pretreatments and magnetic bead binding
Serum samples were pretreated with WCX magnetic beads (Beijing SED Science and Technology, Ltd.). In brief, 10 μl of each serum sample was mixed with 20 μl of U9 solution (9 mol/L urea, 2% CHAPS) in a 0.5 ml centrifuge tube. After incubating for 30 min at 4°C, the samples were diluted 40 fold by adding 370 μl of binding buffer (50 mmol/L sodium acetate, pH 4.0~4.3). Then, 50 μl of WCX magnetic beads (Beijing SED Science and Technology, Ltd.) was added to a PCR tube, and the tube placed in a magnet separator for 1 min, after which the supernatant was removed carefully using a pipette. The magnetic beads were then washed twice with 100 μl of binding buffer. The diluted serum sample (100 μl) was added and mixed with the activated magnetic beads carefully by pipetting up and down several times, followed by incubation for 1 h at 4°C and washing twice with 100 μl binding buffer. Following binding and washing, the bound proteins were eluted from the magnetic beads using 10 μl of 0.5% TFA. Then 5 μl of the eluted sample was diluted 1:2 with 5 μl of SPA (50% CAN + 0.5% TFA). One μl of the resulting mixture was aspirated and spotted onto an 8 spot pre-structured sample chip (Au-chip, Ciphergen Biosystems Inc). After air-drying for approximately 5 min at room temperature, the protein crystal on the chip was examined by SELDI-TOF-MS (Ciphergen, PBS-IIc).
Detection and statistical analysis
Data were collected by averaging 80 laser shots per spot with an intensity setting of 205, and a detector sensitivity setting of 8. The spectra from all samples were processed first using Biomarker Wizard Version 3.1 as follows: selection mass 5901 peak to normalize the dimension, subtraction of the baseline, auto-detect peaks to cluster (First 5, Min Peak 10%, Cluster Mass 0.3%, and Second Pass 2), sample group statistics. The PBS-II (c) Protein Chip reader was calibrated with the ‘ALL-IN-ONE’ peptide standard (Ciphergen Biosystems). The CV was under 10% for all the selected mass peaks.
Bioinformatics and biostatistics
Fifty-eight serum samples were used to build up a classification model, based on 20 from patients with pancreatic cancer and 38 from age- and sex-matched controls, including 20 healthy controls, 14 cases of chronic pancreatitis and 4 cases of pancreatic cystic tumors, an important disease control group. The whole protein profiling spectra obtained from serum samples were first normalized with total ion current normalization, a feature of Ciphergen’s ProteinChip Software 3.1. Peak labeling was performed with the Biomarker Wizard feature of the Software. The data were processed with the t-test. A P value < 0.01 was considered statistically significant. The intensities of the selected peaks were then transferred to Biomarker Patterns Software Version 5.0 (BPS). The classification model was built up with BPS. A classification tree was set up to divide the training dataset into either the cancer group or the control group through multiple rounds of decision-making.
Blind test
Twenty-eight serum samples (patients with pancreatic cancer, chronic pancreatitis, pancreatic cystic tumors and healthy controls) were analyzed with blind prediction by using the same flowchart and procedure of data collection. The results were compared to clinical data for model evaluation.
Results
We obtained mass spectra for all the serum samples from SELDI-TOF-MS analysis using WCX magnetic beads. As depicted in Fig.1, up to 150 protein peaks per spot were detected between m/z 1,000 and m/z 200,000, showing that the SELDI technology separation of low molecular weight proteins (<10,000) was particularly effective.
Representative spectra of a plasma sample in the range of 2,000 to 10,000 m/z obtained using SELDI-TOF-MS.
The comparability among different samples showed that the serum profiles from cancer patients and control individuals were very similar in spite of a few inter sample variations. Therefore, the few variations that consistently differentiated these 2 different groups could be considered as potential disease biomarkers. Here a training serum sample group set, including 20 cases of cancer patients and 38 cases of age- and sex-matched controls were used. All protein profiling spectra were combined and normalized with total ion current normalization, a feature of Ciphergen’s ProteinChip Software 3.1. Peak labeling was performed with the Biomarker Wizard feature of the Software. The intensities of selected peaks were then transferred to BPS to build up a classification model.
There were 26 peaks from mass/charge ratio (m/z), in the 2,000~20,000 range where the protein content differences showed a statistical significance (P ≤ 0.01). Then the intensities of these peaks were transferred to the Biomarker Patterns Software Version 5.0 (BPS) to construct the classification model. Table 2 lists the most important peaks determined by BPS when the software built up the decision tree model. To set up the decision model, four peaks with m/z of 5,705 Da, 4,935 Da, 5,318 Da, 3,243 Da, respectively, were chosen (Figs.2 and 3). Based on the model, peak 5,705 was the most important peak and was allocated to Node 1. If the intensity of peak 5,705 was lower than or equal to 0.897, 31 samples went to Node 2, which may be pancreatic cancer patients. Otherwise, samples entered Terminal Node 5, which were diagnosed for health. At Node 2, samples with intensities of peak 4,935 lower than or equal to 1.734 went to node 3, while the other samples of Node 2 entered Node 4. At Node 3, if the intensity of the peak 5,318 was higher than 0.654, five cases of samples went to the left side of Terminal Node 1, which were diagnosed for pancreatic cancer patients, the other samples went to the right side of the Terminal Node 2 in the diagnosis of health. At nodes Node 4, samples with intensities of peak 3,243 lower than or equal to 0.797, one case of samples went to the left side of the Terminal Node 3, which was diagnosed for health, while the other 16 samples entered Terminal Node 4 in the diagnosis of pancreatic cancer patients. When all samples reached Terminal Nodes through this decision-making process, the model yielded a sensitivity of 100.0% (20/20) and a specificity of 97.4% (37/38) with a ROC curve of 99.7% (Fig.4).
Important protein peaks that can be used for comparison of patients with pancreatic cancer and controls selected by BPS.
Differential expression of surface-enhanced laser desorption and ionization peak m/z 5705 in the comparison of pancreatic cancer (top 4) and control (bottom 4) sera. Profile shown in peak display (top) and gel view (bottom) mode (Biomarker Wizard 3.0).
The decision tree detailed the decision-making procedure and sample distribution of the BPS model. Each square is a node, which was labeled by a node number. Blue frames represented parent nodes and red frames represented terminal nodes.
ROC curve of the decision tree model.
To test the model, another 28 serum samples (9 of pancreatic cancer patients and 19 of controls) were selected to perform the double-blind test. All samples were analyzed for the protein fingerprint with the preset method. The data from each sample were input into the previously built diagnostic model to obtain the prediction value. When the double-blind sample data set was used to challenge the model, the model predicted a sensitivity of 88.9% and a specificity of 89.5%.
Discussion
At present, the most commonly applied serum tumor marker used in the clinic for pancreatic cancer is CA19-9, but its sensitivity and specificity are not very satisfactory. False positives are particularly prevalent in obstructive jaundice, liver cirrhosis, pancreatitis and other malignant tumor patients. The sensitivity of CA19-9 for early pancreatic cancer is low, with a positive rate for detection of a tumor with a diameter of less than 2 cm being only 37.5%, so the value of serum CA19-9 in the diagnosis of early pancreatic cancer alone is limited[9]. Therefore, new early diagnostic methods with high sensitivity and specificity for pancreatic cancer are needed.
The occurrence and development of pancreatic cancer is a multi-gene, multi-step complex progress. In the view of the function of proteins as the determinants of the phonotype of cells, there must be some subtle but important changes in the level of proteins in the early steps of pancreatic cancer formation. Such changes can not be in one kind of protein, but perhaps in a few or a group of related proteins or polypeptides, or in their metabolites or soluble membrane antigens that are released into the blood[30,31]. Therefore if we apply some separate biological marker for the early diagnosis of pancreatic cancer, there will inevitably be problems with the sensitivity and specificity. Proteomics research can identify cells, tissues or the whole proteins of the body, can provide information concerning the function of a set of proteins, and can simultaneously reflect the results determined by the genetic characteristics and the external factors[15].
SELDI-TOF-MS is a new proteomics research method developed in recent years. It can rapidly analyze and compare a protein atlas expressed by a tumor and normal cells, to find some unique combination pattern of protein markers for some type of tumor, and to build a model for early cancer diagnosis. SELDI-TOF-MS reduces the reliance on a single indicator, is convenient, can directly analyze the original biological samples (blood, urine, cerebrospinal fluid, etc.), requires only a small amount of samples, and has the characteristic of high-throughput. In particular it can effectively capture small molecule proteins with low abundance, reflect the real in vivo protein and permit an overall display of protein[32].
At present, by the clinical use of SELDI technology, more results have been forthcoming in studies of early detection and early diagnosis for a variety of tumors, including prostate[18,19], breast[21] and bladder cancers[22], hepatocellular carcinoma[23], etc. In our study we use the SELDI technique, setup various parameters, and analyzed the sera from patients with pancreatic cancer and controls. We found 66 discrepancy protein peaks, in which there were 26 protein content differences showing a statistical significance (P ≤ 0.01). We classified, distinguished, analyzed and screened 4 protein peaks to produce the most excellent diagnosis, used the Bio-marker Wizard, Biomarker Patterns system software, to establish a diagnostic model. By blind test verification, the results were satisfactory, resulting in a sensitivity and specificity which were much higher than the existing kinds of tumor markers[7].
But if we only use the m/z 5,705 Da in the experimental results, the correct resolution rate was only 71.1%, which precisely illustrates that the SELDI detection technique combined with an artificial intelligence classification algorithm method to select the best effective combination of the protein peaks, rather than evaluate the diagnosis efficacy of a single protein peak. The combination we obtain is not a simple superposition of the peaks whose diagnostic weights are higher, but the optimal selection on the bases of the multi-analysis of the proteins differently expressed. So the overall diagnosis of proteomics is better than the conventional single tumor marker detection.
In order to reduce the confusion with other potential diseases, in the process of collecting specimens we strictly controlled standards, reduced biases to the greatest extent, to ensure the reliability of the experimental results. In this study our sample number was relatively limited, so the clinical significance of the results needs to be confirmed by increasing the number of cases, in particular cases of pancreatitis and pancreatic benign tumors as controls, In addition follow-up studies and verification of the clinical condition of the patients should be conducted, to reduce possible misdiagnosis or missed diagnosis. An objective evaluation for the application of SELDI technology needs to be made as well as to do research to provide new ideas and methods for the clinical molecular diagnosis of pancreatic cancer.
Acknowledgment
We thank our medical staff for help in collecting and processing blood samples and in documenting clinical data. We greatly appreciate the help from the Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, for providing access to the SELDI-TOF-MS laboratory and sharing of their expertise and for helpful comments.
Footnotes
This work was supported by a grant from the Key Project of Science and Technology Commission of Liaoning Province, China (No. 2005225003-3).
- Received January 17, 2008.
- Accepted May 19, 2008.
- Copyright © 2008 by Tianjin Medical University Cancer Institute & Hospital and Springer











