ISSN: 0974-276X
Research Article - (2011) Volume 4, Issue 12
Thyroid cancer is one of the major types of cancers worldwide. A large amount of effort has been put in order to find the genetic basis of this cancer. Many genes have been reported before out of which claudin family is one of the example. The claudin protein family consists of more than 20 members which are made up of key structural rudiments inside the tight junction. The association of claudin-1 with the thyroid cancer has already been predicted experimentally. In order to investigate the genetic reason of human thyroid cancer computationally, the systematic analysis of claudin gene family has been carried out. To fulfill this task the bioinformatics methodologies are combined for the assessment of claudin gene family expressions. Results obtained showed and verified the association of CLDN1 member with the thyroid cancer.
Keywords: PTC: Papillary Thyroid Carcinoma, SAGE: Serial Analysis of Gene Expression, TPM: Tags per Million, B-RAF: v-Raf murine sarcoma viral oncogene homolog B1, RAS Rat Sarcoma.
Thyroid cancer is the most common tumor of endocrine system. It has three major subtypes out of which papillary thyroid carcinoma accounts as the major type. The molecular genetics analysis of thyroid cancer has revealed the presence of B-RAF and RAS point mutations [1]
Papillary thyroid carcinoma is well differentiated thyroid cancer. Experimentally it has been shown that constitutive activation of MAP (Mitogen Activating Protein) kinase effectors play a major role in causing papillary thyroid carcinoma. In addition to that the activating mutations of the tyrosine receptor kinases RET and NTRK and point mutations of the intracellular signaling effectors RAS and BRAF are found to be associated with this. The chromosomal recombination is the cause of RET activation that ultimately fall out with a fusion protein made up of the intracellular tyrosine kinase domain of RET coupled to the N-terminal fragment of a heterologous gene, giving rise to the RET/PTC oncoproteins [2]. In comparison with the childhood and adult onest papillary thyroid carcinoma it has been reported that in childhood papillary thyroid cancer, despite of history of radiation exposure, RET/PTC rearrangements are a key event, while in adultonset papillary thyroid cancer, the most common molecular event is BRAF point mutation, instead of the RET/PTC rearrangements [3].
The previous analysis on BRAF gene shows that BRAF mutation may be a key genetic factor for the metastatic progression of papillary thyroid carcinoma. In addition to that it has also been reported that this gene mutation is a major risk factor for loco regional lymph node metastasis and has potential utility as a surrogate marker [4]. As 80% of all the thyroid cancers diagnosed are of papillary carcinoma type ,this study will mainly focus on this sub type of human thyroid cancer.
The claudin protein family consists of more than 20 members which are made up of key structural rudiments inside the tight junction [5]. Tight Junctions have the capability to maintain the cell polarity which may regulate cell proliferation and differentiation [6]. The claudin (CLDN) genes encode a family of proteins which play a major role in the configuration of tight junction as well as associated with the proper functioning of these junctions. In the past decades, it has become evident that CLDN gene expression is commonly mutated in several human cancers [7,8].
There is a likelihood that along with the activation of MAP kinase effectors, there are other gene family members as well that might be involved in the pathogenesis of papillary thyroid cancer. For that purpose a high throughput in silico analysis has been made to systematically evaluate the gene expression of claudin gene family members. It has been evident that mutations in claudin are associated with various human cancers. For this the whole claudin family is systematically analyzed to check its association with human thyroid cancer. The wet lab analysis has previously been done which shows the involvement of claudin-1 as a genetic change in causing papillary thyroid cancer [9-11].
With the help of this gene finder tool a particular gene or more than one gene can be found. This tool is available at CGAP http://cgap.nci.nih.gov/Genes/GeneFinder. By selecting the papillary thyroid cancer and claudin gene family as search criteria the tool will provide the list of claudin gene family members which are thought to be involved in papillary thyroid cancer.
SAGE and Virtual Northern Blot Analysis
SAGE is a web based tool that is freely available at NCBI and on CGAP database. SAGE provides the quantitative and concurrent analysis of a large number of transcripts and also makes available applicable means for the evaluation of expressed genes in different tissue states. SAGE measures the number of tags that contains the adequate information to distinctively identify a transcript. There are two SAGE libraries available, one for normal tissue and one for papillary thyroid tissue. The chosen tumor tissue was follicular variant as its library is available at SAGE genie [11]. SAGE Anatomic Viewer which is available at http://cgap.nci.nih.gov/SAGE/AnatomicViewer [12] was used for obtaining the tags. The reliable tags obtained from SAGE will then mapped to the unigene clusters. http://www.ncbi.nlm.nih.gov/unigene/. The authentic unigene clusters which were matched to SAGE tags were then picked. These tags were then used for the analysis of gene expression in normal and malignant tissue.
Microarray analysis
The two microarray datasets GDS1732 and GDS1665 available at GEO website is taken http://www.ncbi.nlm.nih.gov/geo/ [13-16].The first data set contains the expression profiling of 7 classical papillary thyroid carcinoma (PTC) samples and 7 paired normal thyroid tissue samples (normal group) while the second data set contains classical papillary thyroid carcinoma (PTC) tumors from 9 patients and 9 control samples. This analysis includes the statistical parameter t-test, and the clustering parameters to confirm the clauidin1 association with papillary thyroid cancer.
T-Test: It separates the significant and non-significant genes on the basis of P-value .The P-value less than 0.005 (p<0.005) are considered to be noteworthy. Further volcano plot marked the position of the susceptible gene as either significant or non-significant.
Clustering parameters: The genes which are found to be significant from the T-test analysis are further employed to clustering parameters. These parameters include the three algorithms which are Hierarchical clustering algorithm, K-Means clustering and Self Organizing Tree Algorithm.
Gene finder
The gene finder tool categorized those family members which are more recurrently found to be linked with the papillary thyroid cancer on the basis of evaluation of gene expression between normal and malignant thyroid tissue. The gene members found by the gene finder tool are then chosen for further analysis. It is summarized in Figure 1.
SAGE and virtual northern blot analysis
The reliable tags of 7 claudin genes were extracted as recognized by the gene finder tool. The Tags per million counts (TPM) and the fold change was computed for each gene respectively. The results are summarized in Table 1.
Gene ID | SAGE Anatomic Viewer | Virtual Northern Blot Analysis | ||||
---|---|---|---|---|---|---|
Sequence ID | SAGE tags | Unigene Clusters | Normal (TPM) | Cancer (TPM) | Fold Change/Fold Down | |
CLDN1 | NM_021101 | TGGATATCAG | Hs.439060 | 12.075 | 43.31 | 3.58 |
CLDND1 | NM_001040199 | AGCAAGAAAC | Hs.531371 | 8 | 1 | -8 /8 |
CLDND2 | NM_152353 | CAGAGCACCG | Hs.720536 | 0 | 0 | 0 |
CLDN8 | NM_199328 | GATAGGGTTG | Hs.162209 | 1 | 0 | 0 |
CLDN4 | NM_001305 | ATCGTGGCGG | Hs.647036 | 20 | 6 | -3.33/3.33 |
CLDN3 | NM_001306 | CTCGCGCTGG | Hs.647023 | 156 | 28 | -5.58/5.58 |
CLDN15 | NM_001185080 | GCCCCTCCAG | Hs.38738 | 1 | 1 | 1 |
Table 1: SAGE and Virtual Northern Blot Analysis.
Microarray analysis of claudin gene family
For the Micro array analysis the comparison is carried out between the two phenotypes in both the data sets. The two data sets GDS1732 and GDS1665 available at GEO website is taken. The Table 2 listed down the results obtained from microarray analysis which clearly shows CLDN1 as the most significant gene according to the two fold change difference. Further the results obtained from t-test and clustering parameters also confirmed the above mentioned outcomes (Table 3, Figure 2, 3, 4 and 5 respectively).
GDS1732 | GDS1665 | ||||||
---|---|---|---|---|---|---|---|
Gene ID | Normal | Cancer | Fold Change/Fold Down | Gene ID | Normal | Cancer | Fold Change/Fold Down |
CLDN1 | 188.83 | 654.2 | 3.46 | CLDN1 | 6.22 | 16.89 | 2.71 |
CLDND1 | 892.802 | 664.034 | -1.345/1.345 | CLDND1 | 8.81 | 8.1 | -1.08/1.08 |
CLND8 | 990.2 | 696.85 | -1.422/1.422 | CLND8 | 8.83 | 9 | 1.01 |
CLDND2 | 190.36 | 134.018 | -1.42/1.42 | CLDND2 | 6.51 | 6.47 | -1.01/1.01 |
CLDN4 | 123.791 | 150.692 | 1.21 | CLDN4 | 7.7 | 7.8 | -1.01/1.01 |
CLDN3 | 264.2 | 470.13 | 1.779 | CLDN3 | 10.1 | 9.83 | -1.02/1.02 |
CLDN15 | 0.0 | 175.615 | 0 | CLDN15 | 7.01 | 6.89 | -1.02/1.02 |
Table 2: Microarray Analysis in normal and papillary thyroid tissue cells.
Probe ID | Gene Symbol | P-value | Expression Level | |
---|---|---|---|---|
218182_s_at | CLDN1 | 0.001 | 0.003 | High (Up Regulated) |
Table 3: Significant P-values in both the datasets.
In the past few years a large amount of effort has been done in finding the genetic cause of thyroid cancer. All the practice done was based on experimental analysis. These experimental technologies had revealed some gene family members which are deemed to be associated with the human thyroid cancer.
In this study instead of experimental analysis the in silico analysis approach has been adopted for the assessment of gene expression analysis in human thyroid carcinoma. To implement this approach practically the claudin gene family members has been acquired. Up to our information for the very first time, the systematic investigation of claudin family members has been carried out for gene expression analysis in papillary thyroid cancer a major type of thyroid carcinoma. The methodology adopted will help in scanning the gene expression patterns in less time and money. The results obtained from SAGE and virtual Northern Blot analysis showed clearly that out of these 7 genes only CLDN1 is found to be over expressed by taking the fold change value > 2 fold as significant. The other genes CLDND1, CLDN3, CLDN4, are found to be expressed in normal thyroid tissue. While there is no significant difference of expression level found in CLDND2, CLDN4, CLDN15 genes.
In addition to this the microarray analysis also determined the over expression of CLDN1 member in papillary thyroid carcinoma. The claudin gene whose expression was high in both the data sets is considered to be as the valid results and this could be judged by comparing the fold change value. As mentioned above fold change value which is greater than two fold is considered to be noteworthy. This fold change difference is the minimum threshold value to compute the difference in expression level. In accordance that the t-test also shows CLDN1 as positive significant gene, which in turn means that it is up regulating in papillary thyroid carcinoma. Furthermore all the three means of clustering analysis predicted that CLDN1 is over expressed in case of papillary thyroid cancer as its expression status is higher in cancer datasets as compared to normal one.
As stated above the association of CLDN1 with papillary thyroid carcinoma has been predicted experimentally before. Our computational approach has verified that CLDN1 act as a genetic change in causing papillary thyroid cancer.
CLDND1, CLDN3, and CLDN4 these three genes are found to be down regulated by the SAGE analysis but in microarray analysis no clear significance is found in both of the datasets.
The in silico bioinformatics approaches are thought to be reliable and accurate as it involved the data for analysis from reliable sources and these are based upon the DNA sequencing. Combined serial analysis can act as good starting point in disease gene discovery. The use of in silico gene mining approaches provides an excellent scaffold for the initial identification of key genes and gene clusters whose expression is altered in disease tissue which provides a road map to assist the biologists.
This study verified the association of claudin-1 as a genetic cause in causing papillary thyroid cancer. By this it can be inferred that bioinformatics methodologies are convincing approaches for the evaluation of gene expression. The results will further be verified by experimental approaches but this study provides an initial point for the biologists to find the new cancer insights.