- Open Access
Impairment of organ-specific T cell negative selection by diabetes susceptibility genes: genomic analysis by mRNA profiling
Genome Biology volume 8, Article number: R12 (2007)
T cells in the thymus undergo opposing positive and negative selection processes so that the only T cells entering circulation are those bearing a T cell receptor (TCR) with a low affinity for self. The mechanism differentiating negative from positive selection is poorly understood, despite the fact that inherited defects in negative selection underlie organ-specific autoimmune disease in AIRE-deficient people and the non-obese diabetic (NOD) mouse strain
Here we use homogeneous populations of T cells undergoing either positive or negative selection in vivo together with genome-wide transcription profiling on microarrays to identify the gene expression differences underlying negative selection to an Aire-dependent organ-specific antigen, including the upregulation of a genomic cluster in the cytogenetic band 2F. Analysis of defective negative selection in the autoimmune-prone NOD strain demonstrates a global impairment in the induction of the negative selection response gene set, but little difference in positive selection response genes. Combining expression differences with genetic linkage data, we identify differentially expressed candidate genes, including Bim, Bnip3, Smox, Pdrg1, Id1, Pdcd1, Ly6c, Pdia3, Trim30 and Trim12.
The data provide a molecular map of the negative selection response in vivo and, by analysis of deviations from this pathway in the autoimmune susceptible NOD strain, suggest that susceptibility arises from small expression differences in genes acting at multiple points in the pathway between the TCR and cell death.
Immunological self-tolerance depends upon negative selection in the thymus, whereby T cells bearing T cell receptors (TCRs) with high avidity for self peptide-major histocompatibility complex (MHC) complexes are purged from the developing repertoire before they become functionally active in the periphery . Negative selection occurs by TCR-induced apoptosis during the transition of immature CD4+8+ double positive (DP) cells into mature CD4+ or CD8+ single positive (SP) cells. Nevertheless, a measure of TCR signaling is required for thymocytes to mature from DP into SP cells, an opposite process requiring a weak avidity for self peptide-MHC in order to initiate the changes of cell survival and maturation referred to as positive selection. It is thought that these two opposite processes, of cell survival or death, initiated by binding of the same receptor to its ligand, are controlled by quantitative differences in TCR affinity for self peptide-MHC that are translated into qualitatively opposite cellular responses. However, the molecular basis by which the two processes are differentially controlled and how the cellular responses initiated are achieved are unclear .
Recent studies have found that inherited defects in negative selection underlie autoimmune disease. In the human disorder autoimmune polyendocrinopathy syndrome type 1, defects in the AIRE gene reduce transcription of organ-specific genes in the thymus so that organ-reactive T cells are not negatively selected [3–5]. The non-obese diabetic (NOD) mouse strain is an intensely studied model for human autoimmune diabetes, as well as being susceptible to other autoimmune disorders , and displays a striking cellular deficiency in MHC class II- [7, 8] and class I-restricted negative selection , compared to non-autoimmune-prone strains. Elucidating the molecular basis for defective negative selection in NOD mice may shed light on the process of differentiating negative from positive selection and the pathogenesis of human autoimmunity.
The NOD thymic deletion defect is T cell autonomous [7, 8] and represents a quantitative (approximately ten-fold) decrease in negative selection efficiency to membrane-bound or soluble proteins, regardless of low or high thymic expression controlled by organ-specific or systemic promoters . Genetic linkage studies between the C57BL10.H2k (B10k) and NOD strains identified four NOD-derived recessive loci that contribute to defective negative selection in vivo, by tracing CD4 T cell deletion triggered in the thymus of transgenic mice by expression of an Aire-dependent antigen (insHEL) that mirrors the expression of the insulin gene itself . As would be expected, the four defective deletion loci correspond to four NOD loci known to contribute to diabetes susceptibility, linked to the markers D7mit101, D15mit229, D2mit490/Idd13 and D1mit181/Idd5. Parallel analyses in vitro, in a thymic organ culture system with exogenously added antigen, found NOD loci that acted dominantly to interfere with apoptosis linked to two recessive diabetes susceptibility loci, Idd5 and Idd3 .
Gene expression profiling on microarrays provides an opportunity to visualize the molecular differentiation of negative from positive selection and defects in negative selection at a global level. Several key questions can, in principle, be addressed. First, does negative selection involve induction or repression of a unique set of genes, or simply quantitatively exaggerated changes in the same set of genes as positive selection? Does the negative selection defect in NOD mice interfere with all or only a small number of negative selection genes, or does it cause a new profile of counter-regulatory genes to be triggered, and does it equally affect positive selection gene responses? Several studies have begun to explore this approach, although they have been limited by complications, such as premature negative selection at the double negative (DN) to DP transition, pooling of developmental subsets, TCR heterogeneity, and the peripheral cytokine storm that is produced after cognate antigen injection [12–15].
Here we extend a preliminary published analysis  using an approach that provides a unique opportunity to visualize the global expression changes under physiological in vivo conditions of negative and positive selection. The model employs TCR transgenic mice to trace selection of T cells with a homogeneous TCR recognizing a well-characterized high affinity peptide-MHC agonist, hen egg lysozyme (HEL) 46-61/I-Ak. This TCR is expressed at low levels on DP cells, and promotes positive selection into TCRhi CD4+ SP cells following the physiological pathway and induction of markers such as CD69 and CCR7 . When the TCR transgenic is crossed to insHEL transgenic mice on the non-autoimmune B10k strain background, the insulin-promoter activates Aire-dependent transcription to produce low levels of HEL antigen in thymic medullary epithelial cells, triggering negative selection at the physiological stage during the transition from cortical DP cells to medullary SP cells. Unlike thymocyte apoptosis initiated through the intravenous injection of antigen or anti-TCR antibodies, deletion of HEL-reactive thymocytes by endogenous presentation of HEL activates a physiological apoptotic process that is T cell intrinsic, with efficient deletion of HEL-reactive thymocytes at various dilutions of precursor frequency and no apoptosis of neighboring non-HEL-specific thymocytes (data not shown). This system of negative selection also has the advantage of faithfully replicating the expression pattern of a key autoantigen in human disease, insulin. By comparing global gene expression in homogeneous subsets of sorted cells from these mice, either pre-selection or undergoing positive or negative selection, we present here a detailed analysis of the gene expression differences distinguishing negative from positive selection in a non-autoimmune prone strain, and analyze the underlying defect in negative selection in the autoimmune NOD strain.
Results and discussion
Global gene expression changes differentiating positive and negative thymocyte selection in the B10kstrain
To characterize the gene expression changes occurring during positive and negative selection in a non-autoimmune strain, gene expression profiles were measured in three relatively homogeneous thymocyte populations from the B10k strain . Pre-selection thymocytes ('PreS') that had not yet received positive or negative selection signals were identified as CD4+8+ DP cells that are negative for CD69 and 1G12 TCR clonotype and sorted from TCR transgenic and TCR insHEL double transgenic mice. Early single positive cells that were beginning to undergo positive selection (S+) were sorted from TCR transgenic mice as CD4+CD8lowCD69+1G12+ cells. Early single positive cells with an identical cell surface phenotype but beginning to undergo negative selection (S-) were sorted from TCR insHEL double transgenic animals. Three independent pools of mRNA from sorted cells were analyzed by labeling and hybridization to Affymetrix 430A microarrays and normalized using MAS 5.0. MAS 5.0 was used as the normalization method in preference to more precise project-based model fitting systems, such as RMA, GCRMA and PLM, due to advantages in simple reduction of the false discovery rate. Use of the MAS 5.0 normalization method produces 'present' and 'absent' calls and, when used as a filtering device, this has been demonstrated to reduce the false discovery rate with little true cost to the true positive rate . Furthermore, by taking the mismatch probe into consideration, MAS 5.0 reduces the level of false positives based on cross-hybridization . It should be noted that all normalization methods involve a trade-off of accuracy or precision at various levels and, while MAS 5.0 performs strongly for background correction and medium and high intensity signals , it is less accurate than some alternatives for probesets in the low intensity range [18, 19]. The complete dataset, with raw values and statistical analysis is given in Additional data file 1 and has been deposited in the NCBI Gene Expression Omnibus (GEO)  accessible through GEO Series accession number GSE3997.
We first performed a global analysis of the differences in gene expression between negative and positive selection and pre-selection by measuring the Euclidian distance between conditions (Figure 1). Measuring Euclidian distance involves treating each condition as a single point in n-dimensional space, where each dimension is the expression of a single probeset. The distance between two conditions can then be calculated as the straight line ('ordinary') distance between the two points (conditions), so that, for example, a low value between two conditions indicates that they have similar values for gene expression, while a high value between two conditions indicates that they have different values for gene expression . The advantage of Euclidian distance is that it is an approximation of the closeness of global gene expression profiles under different conditions, independent of classification of gene changes as significant or non-significant by including the number of genes that are unchanged, and the degree of change in differentially expressed genes (thus allowing subtle changes in large numbers of genes, which would not necessarily be detected if a statistical threshold was required, to impact global distance). Using Euclidean distance as the measure of global 'closeness', the largest difference was between pre-selection thymocytes and selecting thymocytes (both positive and negative selection). Negative selection was closer to the pre-selection condition than positive selection.
Individual gene expression changes were then analyzed in an independent method of assessment, by assigning probesets into categories based on statistical significance of gene expression pattern in the 3 replicates, allowing categorization into 12 possible patterns. Assignment to the 12 patterns was based only on the direction of significant gene expression change rather than degree of change (Figure 2). Global expression 'closeness' could then be estimated by comparing the number of probesets that fit each category.
This measure of global expression 'closeness' between conditions gave a result consistent with the Euclidean analysis, where the pattern categories with the largest number of probesets were those (patterns A and G in Figure 2) displaying an expression difference between pre-selection and selection (both positive and negative selection) but no difference between positive and negative selection. These are likely to represent genes that are developmentally regulated as part of the differentiation of immature DP cells into more mature early SP cells. Some may be induced or repressed by TCR signaling mechanisms that are unable to distinguish between TCR engagement by weak positively selecting agonists and the strong negatively selecting agonist. Pattern A comprised 531 probesets that were induced during maturation/selection, and pattern G comprised 692 probesets that were repressed. The induced genes included well established markers and mediators of maturity, Tcr, Ccr7, S1P1 and IL7R, and genes that are known to be targets of positive and negative selection TCR signals, such as the calcineurin-response genes Ian1, Egr2 and CD52 and ERK-response genes Nab2 and Zfp36l1. The repressed genes included Rag1, preTαβ, CD8α and CD8β, known to be involved in thymocyte maturation changes, and cell cycle genes Cdc7, Cdk2, Cdk2ap1, and Cdk4, which are consistent with the exit from cell cycle that accompanies maturation of early DP cells, which are cycling, into later DP and SP cells, which are non-cycling. The identification of these expected expression changes acts as an internal validation of the dataset.
As well as these previously identified markers of positive and negative selection, several important T cell regulatory genes were also found here to be upregulated in selection: those encoding nucleic acid binding proteins, such as Bcl3 , Dicer1, Fosb , Hivep1 , Irf1 , Irf4 , Irf7 , JunB , Klf2 , Nfat5 , Rora , Stat1 , and Stat6 ; signaling-associated genes Ccnd2 , Evl , Hspa5 , Ly9 , Mcl1 , Ndfip1 , Psen2 , Rassf5 , Upf2  and Zfpn1a ; and those encoding receptors, such as Crry , Gpr83 , 2-K1 , Ms4a6b , Sema4a , Slamf1 , Tlr1 , Tnfsf11 , and Tslpr .
Previously unassociated genes identified as part of this analysis are, therefore, excellent candidates for novel maturation markers, and include those encoding nucleic acid binding proteins, such as 1810007M14Rik, AA408868, Aptx, Arts1, D11Lgp2e, D1Ertd161e, Ddef1, Ddx19b (2810457M08Rik), Dedd2, Dnmt3a, Elk3, Hnrpa1, Hrb, Ifih1, Isgf3g, Mxd4, Nab1, Rab8b, Rbms2, Rnaset2, Rpl12, Rpo1-1, Skil (9130011J04Rik), Sp100, Tcf3, Tef, Trim21, Trim30, Wasl, Zbp1, Zcchc7, Zfp260, Zfp313, and Zfp445 (AW610627); signaling-associated genes Bin1, Dlgh1, Myd88, Nedd9, Pacsin1, Pscdbp, Sdc3, Shkbp1, Sytl2, Traf1, Vps28, Xpo6; and those encoding receptors, such as 1810011E08Rik, Brd8, Cd9, Folr4, Ptger2, Ptgir, Sorl1, and Tlr2. The complete set of genes identified in this category is listed in Additional data file 2.
The next largest pattern categories comprised probesets that were unique to positive selection, consistent with this condition being the most differentiated based on the Euclidean analysis. These categories (Figure 2) included genes that were induced (patterns B and C) or repressed (patterns H and I) during positive selection, but were either not altered at all during negative selection (C and I) or underwent a change of lesser magnitude but in the same direction (B and H). Genes in these categories are candidates for translating TCR engagement by weak agonists into survival and maturation rather than negative selection. However, this category will also include genes that are developmentally regulated at later stages of SP cell maturation, after CD69 induction and increased TCR surface expression, since negative selection will remove SP cells before reaching this stage. Patterns B and C comprised 278 probesets that were preferentially induced during positive selection, including the well established functional genes CD3ζ and the calcineurin-response gene Itgb7. Patterns H and I comprised 394 probesets that were preferentially repressed during positive selection, including developmental markers Cxcr4, CD8α, CD24a, CD25, and cell cycle genes Cdc2, Cdc6, Cdc20, Cdc25b, Cdka2c and Myb.
As well as these previously identified markers of positive selection, a number of important T cell regulatory genes were also found here to be upregulated in positive selection: those encoding nucleic acid binding proteins, such as Dbp ,Foxo1  and Zfp67 ; the signaling-associated gene Stam2 ; and those encoding receptors, such as Il6ra  and Itgb2 .
Previously unassociated genes identified as part of this analysis are, therefore, excellent candidates for novel markers of positive selection, and include those encoding nucleic acid binding proteins, such as Bhc80, Ddb2, Ezh1, Gata1, Pcbp4, Rbms1, Smarca2, and Trim26, Ddit3, Foxp1, Oas2, Tgif2, and Zfp467; signaling-associated genes Arrb1, Bcap31, Emid1, L1cam, Numb, Pea15, Rabip4, Rcbtb1, Rsn, Selpl, and Sh3gl1; and those encoding receptors, such as AA691260, D7Ertd458e, Frag1, Gpr97 Grina, Il6st, Paqr7, Robo3 and Sh2d3c. The complete list (Additional data file 2) dramatically expands the set of candidate mediators and markers for understanding positive selection and SP cell maturation.
A smaller transcriptome of 246 probesets comprised genes that were preferentially or selectively induced or repressed during negative selection (patterns D, E, F, J, K, L in Figure 2), consistent with negative selection being closer to pre-selection in the global analysis. Genes in these categories are candidate mediators or markers of negative selection and thymocyte apoptosis. Pattern categories E and K were selectively induced or repressed during negative selection, showing no change in expression between pre-selection cells and positive selection. Pattern E comprised 87 probesets, including the TCR-induced pro-apoptotic gene Bim (Bcl2l11), which has previously been shown to be induced selectively during negative selection in this system and is an essential mediator of negative selection, and activation markers such as Ccr6.
Genes in patterns D and J were induced or repressed more strongly in negative than positive selection. Probesets in these categories are likely to include genes that report the quantitative differences in TCR signaling thought to differentiate strong TCR engagement by negative selecting agonists from weak engagement by positively selecting agonists. Pattern D comprised 48 probesets, including the gene encoding the thymocyte apoptosis-inducing transcription factor, Nur77, markers of activated T cells or regulatory T cells, Gitrd, Ox40 and 41bb, and the ERK-response gene Fos. Category F and L comprised a small number of probesets that exhibited a change in expression during negative selection that was in the opposite direction to that induced during positive selection. Pattern L includes genes such as CD25, CD24a and Annexin A4.
Overall, the negative selection transcriptome is quite small: 144 induced probesets, and 102 repressed probesets. By contrast, positive selection of early DP thymocytes into early SP thymocytes induced 809 probesets and repressed 1,086 probesets - a transcriptional program that is eight-fold larger. A complete list of the positive and negative selection transcriptomes, all the genes falling into patterns A to L, in the B10k strain is given in Additional data file 2.
Genomic localization of gene expression changes induced by selection on the B10kbackground
Recent studies have found that sets of genes induced by similar stimuli can co-localize in genomic clusters . To determine if positive or negative selection likewise work by the activation or suppression of broad clusters of genes we queried the data to identify cytogenetic bands with a significantly increased proportion of induced or repressed genes, normalized to their gene density.
Focusing first on genes associated with positive selection, there is evidence for a small degree of gene expression change by cytogenetic band. Six cytogenetic bands were broadly suppressed upon positive selection, and no cytogenetic bands were broadly activated. Of the six altered cytogenetic bands, three meet stringent false discovery rates, 12F, 3B and 2F (Table 1).
The gene expression changes induced upon negative selection, by contrast, are highly localized. While the global difference (Euclidian distance) between positive selection and negative selection is only half that of positive selection to pre-selection (Figure 1), a greater number of cytogenetic bands show enrichment of gene expression differences. That is, while fewer genes were changed, and with a lesser magnitude, in positive selection when compared to negative selection than in positive selection compared to pre-selection, the genes that were differentially expressed were highly co-localized to certain genomic regions. One region was broadly activated in response to negative selection, 2F, while seven regions were broadly suppressed upon negative selection, of which two meet stringent false discovery rates, 11E and 6B (Table 1). Region 2F is also of interest because it is a region that contains a cluster of genes that decreased in expression during positive selection (Figure 3a), and a cluster of genes that increased in expression during negative selection (Figure 3b). The induced and repressed clusters are, however, largely distinct, with little correlation (Figure 3d). Of significance, the key pro-apoptotic gene Bim is encoded within 2F. Bim is controlled at least partially by chromatin structure, as histone deacetylase inhibition allows the spontaneous induction of Bim followed by apoptosis . Of the 2F probesets changed by negative selection in the B10k strain, the highest level of upregulation is observed in Dut, Cops2, Dusp2, Bub1, Bim, Mertk, Smox, 6330527O06Rik, LOC545465 and 2610101J03Rik. Similarly, the 2F probesets with the greatest defects in upregulation in the NODk strain are Cops2, Galk2, Bub1, Bim, 1600015H20Rik, IL1a, Smox, Bmp2, Hao1, 6330527O06Rik and 2610101J03Rik. These data generate the hypothesis that the cytogenetic band 2F contains a concentration of apoptotic initiators for negative selection that may be coregulated by chromatin structure.
Global gene expression changes induced upon positive and negative thymocyte selection in the NODkstrain
In parallel with the above analyses of negative and positive selection in B10k mice, the same cell subset markers, sorting methods, and mRNA labeling and hybridization to microarrays were applied to pre-selection (PreS), early positive selection (S+) and early negative selection (S-) thymocytes from TCR and TCR insHEL animals on the NOD.H2k strain background. This allowed developmentally matched, homogeneous populations of T cells to be traced during positive and negative selection using the same TCR and self peptide-MHC ligands, but carrying all the NOD genomic differences from B10 with the exception of the congenically matched H2k haplotype.
The global gene expression differences between pre-selection DP cells and early SP cells undergoing positive or negative selection were first used to compare these states by Euclidian distance (Figure 1). The difference between pre-selection and positive selection thymocytes was similar on the NODk background (23.4 units) to that observed on the B10k background (26.7). On the NOD background, however, there was much less difference between positive and negative selection, with the Euclidean distance between these states decreased from 13.9 in B10k to 8.9 in NOD.
Individual gene expression differences between pre-selection, positive selection and negative selection on the NOD background were categorized into the same 12 patterns, as conducted above for the equivalent cells from B10 strain animals (Figure 2).
Focusing first on patterns A and G, representing probesets that were induced or repressed equivalently during positive or negative selection, these categories contain the largest number of genes that were induced (787 probesets) or repressed (994 probesets), which are comparable to the numbers observed for these categories in the B10 strain (Figure 2). Again, category A includes genes that are developmentally increased during maturation from DP to SP cells, such as IL7R, and genes that are induced by TCR signals during positive and negative selection, such as calcineurin-response genes Ian1, Egr2 and CD52 and ERK-response genes Nab2 and Zfp36l1. In total, 240 of the probesets assigned to category A in NOD were also assigned to this category in B10. The stringent cut-offs used to assign probesets to pattern categories underestimated the similarity of gene expression during DP to SP maturation on the two strain backgrounds, because only 39 of the 531 pattern A probesets in B10 thymocytes have significantly different values from NOD for the corresponding cell types. Likewise, genes that were decreased during maturation (pattern G) included expected developmentally regulated genes such as Rag1, Cd8α, PreTα and cell cycle genes. Of the 692 B10 pattern G probesets, only 56 have significantly different values to NOD for both positive and negative selection. Thus, the NOD background had little effect on gene expression changes associated with early SP maturation from pre-selection DP cells.
By contrast, the NOD background has markedly reduced numbers of genes with expression patterns that differentiate negative from positive selection (patterns B to F and H to L), consistent with the smaller Euclidean distance between these two states in the global analysis. Thus, of the patterns with increased expression in both positive and negative selection (A, B, D), 79% show the same degree of regulation during positive and negative selection (assigned to pattern A) in B10k mice, but 94% show the same degree of regulation in NODk mice (with NODk pattern A containing many probesets assigned to patterns B or C in B10k mice). Likewise, of the patterns with decreased expression in both positive and negative selection (G, H, J), 72% show the same degree of regulation (assigned to pattern G) in B10k mice, but 95% show the same degree of regulation in NODk mice. By this assessment, positive and negative selection are less distinct in NODk mice than in B10k mice.
Focusing specifically on genes that were preferentially or selectively induced (patterns D, E, L) or repressed (J, K, F) during negative selection revealed a global dampening of the negative selection response in the NOD background (Figure 4). Patterns D, E, and L, comprising probesets that were induced during negative selection either selectively (E, L) or to higher levels than during positive selection (D), contained only 66 probesets in NOD mice, whereas these sets were more than twice as large (144 probesets) on the B10 background. Moreover, of the 144 B, E or L probesets that were specifically induced during negative selection in B10 mice, 137 were diminished in expression during negative selection in NOD mice, 112 by more than 20% and 82 by a significant amount (Figure 4a). Thus, as noted previously, Bim induction (category E in B10) is undetectable in NOD thymocytes, while Nur77 induction (category B in B10) is greatly diminished.
Similarly, genes that were selectively decreased in negative selection (patterns J, K, F) accounted for only 46 probesets in the NOD strain compared to 102 in B10. Of the 102 probesets decreased upon negative selection in B10 mice, 100 remained at higher levels during negative selection in NODk mice, 84 by more than 20% and 44 significantly so (Figure 4a). Combining both transcriptional increases and decreases, of the 246 probesets specifically changed by negative selection in the B10k mouse, 51% were significantly less changed in the NODk mouse. By contrast, of the 531 pattern A probesets that were increased equivalently during positive and negative selection in the B10k mouse, only 7% had significantly different expression during negative selection in NODk compared to B10k mice, and the majority showed similar expression (Figure 4b).
The presence of reduced upregulation and downregulation across the entire spectrum of negative selection-specific genes in NODk thymocytes indicates that upstream effects are at least partially responsible. This observation, recognizable only at a genomic level, was not predicted in previous analyses that focused solely on changes in downstream effectors, such as Bim and Nur77 [10, 11]. Such a defect may be occurring at the early signaling synapse, in line with a recognized alteration of TCR signaling components, such as enhanced Fyn kinase activity, differential activation of the Cbl pathway, impairment of membrane-translocation of Son of sevenless (mSOS) Ras GDP releasing factor, and the exclusion of mSOS and Phospholipase C (PLC)-γ1 from the TCR-Grb2-Zap70 complex, resulting in hypoactivation . Altered signaling in the basal TCR apparatus may be responsible for the reduced surface CD3 levels present on TCR transgenic thymocytes (without the presence of insHEL) in the NODk strain compared to their B10k counterparts, with a 50% reduction at the DP stage, and a 20% reduction at the SP stage (Figure 5).
The NOD background also caused a large decrease in probesets assigned to categories B, C, H, and I, comprising genes that are preferentially or selectively altered during positive selection (Figure 2). This result has two non-exclusive explanations. First, there may be a less efficient positive selection response in NOD. Alternatively, many of the genes in this category may normally be developmentally regulated to appear at later stages of SP cell maturation, before CD69 is lost but at a stage when negative selection would have removed most such cells.
Constitutive differences in thymocyte gene expression caused by the NOD background
In addition to the altered negative and positive selection response above, the NOD background also had altered pre-selection gene expression in TCRlowCD69- DP cells, which may set the stage for altered responses when the cells encounter negative selecting antigens. Six independent pools of pre-selection DP cells were analyzed on both B10 and NOD backgrounds: three from TCR animals and three from TCR insHEL animals. There were few differences between TCR and TCR insHEL pre-selection pools within a strain background, consistent with sorting for antigen-nasïve thymocytes that had yet to display TCRs for HEL and induce CD69. Comparing pre-selection cells between the strains at a global level first (Euclidian distance, Figure 1), the difference between these states (14.2) was approximately half that of the difference between pre-selection DP and early positive selection SP cells (26.7), with a total of 1,484 probesets significantly different between NOD PreS and B10 PreS (1,484 probesets). It is unknown if this degree of pre-selection divergence is specific to the NODk strain, or if it is observed across multiple strains based on comparative divergence.
In terms of genomic location, these changes are particularly concentrated in 20 cytogenetic regions (Table 1). Twelve regions show increased expression in B10k pre-selection thymocytes, eleven of which meet stringent false discovery rates: 8E, 7E, 18E, 12F, 6C, 1B, 15D, 5F, 10C, 19A and 4E. Likewise eight regions show increased activity in NODk pre-selection thymocytes, all of which meet stringent false discovery rates: XF, 3E, XD, 3H, 5C, 12C, 1A and XA. With regard to the phenomenon of defective negative selection in the NODk strain, it may be of relevance that two of these regions co-localize with genomic loci that contribute to defective negative selection , 7E and 15D.
Gene expression differences between NODk and B10k strains induced upon negative selection were also analyzed for cytogenetic clustering. Only four cytogenetic bands show enrichment, after eliminating regions changed in the basal (that is, pre-selection thymocytes) state. Two regions, 2F and 3A, were broadly suppressed in the NODk strain, and two regions were broadly activated in the NODk strain, neither of which meet stringent false discovery rates (Table 1). The region 2F is of particular interest for several reasons. Firstly, this is the only region that was broadly activated upon negative selection in the B10k strain (Figure 3b). Secondly, it is one of only two regions that show broad strain differences in regulation upon induction of negative selection, with lower activity in the NODk strain (Figure 3c). Thirdly, this region overlaps one of the six identified loci with a causative effect in defective negative selection in the NODk strain . A comparative analysis of the gene expression changes in B10k and B10k-NODk negative selection demonstrated that this locus comprises the same cluster of genes that are upregulated upon negative selection in the B10k strain and show poor induction of gene expression in the NODk strain (Figure 3e). These data indicate that the NODk strain has a genetic defect preventing the efficient induction of the negative selection 2F cluster, including Bim, preventing initiation of apoptosis.
Gene expression variants representing causal candidates for defective thymic deletion in NOD
Combining the global transcription profiles for NODk and B10k pre-, positive and negative selection with linkage data for efficiency of negative selection in the same in vivo conditions  provides a way to identify candidate genes responsible for the NOD trait of defective negative selection. While expression differences are promising candidates for quantitative traits, we recognize that this approach is unable to detect allelic variants arising from differential mRNA splicing, such as the Idd5 allele of Ctla4 , or from amino acid substitutions, such as the Idd13 polymorphism in β2 m .
Expression pattern was first used to identify six categories of high priority candidate genes (Figure 6, Additional data file 2). Group 1 consists of probesets that were preferentially increased in B10k negatively selecting thymocytes (patterns D, E and L), but were significantly less increased in NODk counterparts. There are 77 probesets in this group, of which 14 show poor induction and 63 show no induction. Defective apoptotic initiators could fall in this group. Group 2 consists of probesets that were increased in B10k thymocytes upon selection (pattern A), but were significantly less increased in NODk mice (p < 0.05). Only 8 probesets are in this group, of which seven show no increase upon maturation in the NODk strain. Defective functional prerequisites for negative selection switched on during positive selection could fall in this category. Group 3 consists of probesets that were significantly lower in NODk thymocytes compared to B10k thymocytes for each biological condition. There are 151 probesets in this group, 80 of which show no development or selection-induced differences within the NODk or B10k groups. Defective constitutively expressed prerequisites for negative selection could fall in this group. Group 4 is the reverse of group 1. It consists of probesets that were increased in the NODk negatively selecting thymocytes, but were significantly less increased in B10k thymocytes. This group is designed to catch candidate genes that were more strongly induced in NODk negatively selecting thymocytes and provided protection from negative selection. Only two probesets fall in this category. Group 5 is the reverse of group 2. It consists of probesets that were increased in NODk thymocytes upon selection (pattern A), but were significantly less increased in B10k thymocytes. This group is designed to catch NODk over-expressed maturation-induced protective genes. Group 6 is the reverse of group 3, comprising probesets that were significantly higher in NODk thymocytes compared to B10k thymocytes for each biological condition. There are 98 probesets in this group, 48 of which show no developmental or selection-induced differences within NODk or B10k thymocytes. Constitutively over-expressed protective factors could fall in this group.
A matrix comprising the probesets in these six categories and the genomic location to a 30 cM bracket surrounding peak linkage to the four regions of NODk susceptibility to defective negative selection markers (D7mit101, D15mit229, D2mit490/Idd13 and D1mit181/Idd5)  identified 44 candidate probesets. Each region has 6 to 10 candidates using this method, except for the region centered on D2mit490, which includes the 2F cluster and has 20 candidates. These are discussed in more detail below, as summarized in Table 2.
Of the candidate genes linked to the D7mit101 loci (Ch7, 60 cM), four genes are of particular interest (Figure 7, Table 3). Bnip3, 8.4 cM from D7mit101, was approximately two-fold over-expressed in NODk thymocytes in every condition. Bnip3 is a BH3-only protein that dimerizes with Bcl-X(L), making a pro-apoptotic heterodimer [57, 58]. Bnip3 has been shown to translocate to the mitochondria to induce apoptosis during CD47-induced apoptosis , nitric oxide induced apoptosis in macrophages , hypoxic apoptosis , and activation induced death of cytotoxic T cells . Overexpression of Bnip3 would, therefore, be unlikely to protect NODk thymocytes from clonal deletion. Bnip3 overexpression may instead be a downstream effect of the NOD defective thymic deletion allowing thymocytes to tolerate a higher level of expression, just as Bnip3 overexpression is associated with more aggressive tumors and poor survival in human patients . Trim30 and Trim12 are approximately 13 cM from D7mit101 and were poorly expressed in NODk thymocytes. Trim30 shows an average of a 2- to 3-fold decrease in expression in NODk thymocytes (represented by three probesets, two of which show a strong difference and a third in the 5' untranslated region (UTR) that shows little difference), while Trim12 shows an approximately 300-fold expression decrease in NODk thymocytes (Figure 7). Both are members of the tripartite motif family, with RING, B-box type 1 and 2, and coiled coil domains . Little is known about their function, but the extent of the change and putative domain function make them strong candidate genes.
A key candidate gene for the D15mit229-linked defective thymic deletion loci (Ch15, 22 cM; Figure 8, Table 4) is Ly6c. Ly6c is 21.1 cM from D15mit229, and was poorly expressed in NOD thymocytes under all conditions (2.3-fold decrease; Figure 8). This reduced expression has been previously observed to be due to a Ly6c promoter polymorphism in NOD mice, and is thus a known cis effect . Functionally, Ly6c inhibits the signal for secretion of interleukin (IL)2 and proliferation in peripheral CD4+ cells , and cross-linking of Ly6c causes clustering of Leukocyte function associated molecule 1 (LFA-1, CD11a/CD18) on the surface of CD8 T cells .
Many candidate genes were identified in the 2F cytogenetic band around D2mit490- (Ch2, 65 cM), of which four are of particular interest (Figure 9, Table 5). Smox oxidizes spermine to spermidine , which has been shown to induce DNA damage and apoptosis in gastric epithelial cells . The major control for Smox during polyamine-induced apoptosis has been shown to occur at the mRNA level ; therefore, the small increase during B10k positive selection and large increase in negative selection could prepare B10k thymocytes for apoptosis (Figure 9). The lower levels of Smox in NODk thymocytes, and the negligible increase during negative selection, could contribute to a poor apoptotic response. Bim/Bcl2l11 as a candidate gene has been previously validated , having a key role in negative selection [70, 71] and defective upregulation in the NODk strain (in three probesets; two probesets in the 3' UTR region of one splice variant showed little change). Another candidate gene in this region is Id1, 8.8 cM from D2mit490. It was increased during negative selection 2.2-fold higher in B10k thymocytes than NODk thymocytes (Figure 9), and has been shown to inhibit the function of E2A and Transcription factor 12 (HEB), increasing the response to TCR stimulation and the sensitivity of thymocytes to apoptosis . Thus, Id1 induction may be required to amplify the clonal deletion signal for apoptosis. Pdia3, 10.9 cM from D2mit490, was expressed at 3.4-fold higher levels in B10k thymocytes (Figure 9). Its function as an endoplasmic reticulum chaperone required for mitomycin C-induced death  indicates potential candidacy. Also of interest is Pdrg1, 8.8 cM from D2mit490, which has previously been shown to be upregulated in response to UV radiation and inhibited by p53 . It is of interest because of the basal low level observed in NODk thymocytes.
The fourth NOD locus contributing to defective negative selection in TCR insHEL mice, linked to D1mit181 (Ch1, 43 cM), includes the genes Ctla4 and Pdcd1 (Figure 10, Table 6). The Ctla4 gene was not differentially expressed in our analysis, but it is a functional candidate because Ctla4-deficient thymocytes are resistant to radiation-induced apoptosis . The NOD Idd5 (D1mit181-linked) haplotype produces less of an alternatively spliced Ctla4 gene product that is a constitutively active inhibitor of TCR zeta phosphorylation [55, 76]. The orthologous region in humans also contains a CTLA4 variant and has been associated with diabetes, thyroid autoimmunity, Addison's disease, and disease severity in multiple sclerosis . As mentioned during the preliminary analysis , Pdcd1 was induced during negative selection in B10 but not in NOD thymocytes (Figure 10). Pdcd1 (also called PD1) has previously identified functions in negative regulation of T cell function  and Pdcd10/0mice display reduced positive selection  and autoimmune symptoms [79, 80]. Furthermore, blockade of Pdcd1 induces diabetes in NOD mice .
The analysis of the negative selection transcriptome here distinguishes among several distinct, but not mutually exclusive, mechanisms accounting for defective negative selection in NOD thymocytes: first, several downstream effectors, such as Bim, are defective; second, the entire negative selection induction process is reduced; third, there is a broad defect in the induction of TCR signaling response genes (both positive and negative selection); and fourth, the NODk strain induces an additional, protective transcriptome during negative selection. By comparison of the NODk strain to the B10k strain at a global transcription level, there is no evidence for the ectopic of an additional, 'protective' gene set, nor for an obvious defect in the TCR signaling-dependent process of positive selection. The second possible mechanism appears correct, as there was a global reduction by approximately 40% in the transcriptional process of negative selection, indicating that a defect in upstream events impacted on multiple downstream mediators. Not exclusive from the upstream defect, several important apoptosis effector genes, including Bim, are almost completely absent from the NOD negative selection response, raising the possibility that these genes are at points where individual quantitative differences summate, with cis-acting promoter defects having an additive effect with the defect in upstream inductive events. In particular, the cluster of poorly induced negative selection genes in cytogenetic band 2F around Bim raises the possibility of a cis-acting allelic variation contributing to poor induction of this locus. Of interest, the defect is not absolute, with partial upregulation seen for the majority of the negative selection gene set, correlating with previous cellular data indicating that NOD thymocytes are capable of strong negative selection when exposed to higher levels of stimuli .
The candidate genes discussed above act at multiple points in the pathway between binding to TCR of negatively selecting peptide-MHC and triggering of thymocyte apoptosis. These data frame a hypothesis that defective negative selection involves the summation of many incremental decreases in the efficiency of this signaling pathway, caused by multiple allelic variants at four chromosomal loci. In NOD thymocytes there is lower surface TCR expression, and lower efficiency of TCR signaling due to increased iCTLA4 and decreased Pdcd1 and Id1. This general decrease in TCR signaling may then be compounded by poor inducibility or low expression of apoptosis inducers Bim, Smox, Bnip3, and Pdia3. By producing a molecular map of negative selection responses in vivo, the results from this study open up pathways to understand the mechanism of negative selection and the basis for its quantitative variation leading to autoimmune disease.
Materials and methods
The data discussed in this publication have been deposited in the NCBI Gene Expression Omnibus , as MIAME compliant data, and are accessible through GEO Series accession number GSE3997.
As previously described , cell populations were purified from healthy 6-8-week old female mice, stained in 5 μg/ml actinomycin D and 2 μg/ml α-amanitin (Sigma-Aldrich, St Louis, MO) with CD4- fluorescein isothiocyanate (FITC), CD69-phosphatidylethanolamine (PE), CD8- peridinin chlorophyll protein (PerCP) and 1G12 indirectly labeled with Allophycocyanin (APC), then sorted with a Becton Dickinson (Franklin Lakes, NJ) FACVantage. Sorted populations were 'early DP' (CD4+CD8+CD69-1G12-) and 'early SP' (CD4+CD8lowCD69+1G12+). Purified RNA underwent two rounds of in vitro RNA amplification before fragmentation and hybridization to Affymetrix GeneChip 430A arrays (Santa Clara, CA, USA).
Microarray data processing
The Affymetrix 430A chips were scanned using standard Affymetrix protocols. All arrays passed routine quality control assessment for hybridization and data quality. Expression values, referred to as probeset 'signal', were calculated using the Affymetrix GeneChip analysis software MAS 5.0, with a scaling chosen so that each array has a trimmed mean of 150. Following examination, the endogenous control probesets were removed. The MAS 5.0 signal values were then transformed to logarithms (base 2). Signal values of 0 were assigned a value of 0.1 before taking logs. Finally, the arrays were standardized to mean zero and variance one. Initial gene filtering kept only those probesets that were called 'present' by the Affymetrix signal detection algorithm in all replicates for at least one biological group. All statistical analyses were carried out using these transformed signal values. Statistical analysis was carried out in three data sets, one consisting only of the B10k biological groups, one consisting only of the NODk biological groups, and one consisting of both B10k and NODk biological groups.
Assignment of probesets into gene expression patterns
Assignment of probesets to different patterns of gene expression was separately carried out on the B10k and NODk datasets. The patterns are defined in terms of direction of change between means, rather than the extent of change, with assignment of a change having a statistical cut-off rather than a fold-change cut-off. For the separate B10k and NODk datasets, two way analyses of variance (ANOVAs) were performed on each of the selected probesets. For each probeset, if the overall F-test for a test of mean difference was significant (p < 0.005), the probeset was considered significantly changed. Having established that the means were different, t-tests for the contrasts in means were used to determine significantly different means. A significance level of less than 0.05 was used for these tests. Contrasts between 'pre-selection' thymocytes (CD4+CD8+1G12- CD69-) from TCR transgenic and TCR insHEL double transgenic mice showed very few differentially expressed probesets, as is expected for populations that have not been exposed to HEL antigen due to low expression of TCR and anatomical segregation of antigen in the corticomedullary junction. As a consequence, these two groups were merged and the model re-estimated for all the selected probesets. Since there are now three groups, there are twelve distinct patterns of up, down and no difference. Contrasts between the means using the re-estimated models were used to assign probesets to a gene expression pattern, based on a logical set of significant changes between the various conditions. A three-way ANOVA model was used to analyze the combined B10k-NODk dataset for differential expression between strains. The p values are used in this research as indicative of 'evidence' and, except in rare instances, will not be an exact measure of probability. Model assumptions for the use of the different tests were examined for a small number of significant probesets and found to hold. Following analysis, each probeset was annotated for genomic location and gene function using the FACTS (Functional Association/Annotation of cDNA Clones from Text/Sequence Sources) program .
Global gene expression differences
Euclidian distance is the most common measure of metric distance, which is an approximation of the 'distance' between gene expression from replicates in one condition to replicates in other conditions. Euclidean distance is calculated by treating the expression of a group of probesets as a point in n-dimensional space, where the distance from the axis in each dimension represents the expression (the technical mean of transformed signal value for the replicates) of a single probeset. This produces a single point in n dimensions (where n is the number of probesets in the group) that represents one set of conditions, and a single point in the same n-dimensional space that represents the same group of probesets under different conditions. The Euclidean distance between each point (representing each condition) was calculated by using the square root of the sums of squared differences between the points in each dimension, using the formula:
Euclidean distance = √(x1 - y1)2 + (x2 - y2)2+ (x3 - y3)2 + ...
where x is the point representing the probeset group in condition x, with each dimension x1, x2, x3, and so on being the expression of individual probesets within the group, and y is the point representing the probeset group in condition y, with each dimension y1, y2, y3, and so on being the expression of individual probesets within the group .
The straight line distance between the two points thus calculated represents the difference in expression of the included probesets between the two conditions. The probeset group used to calculate the Euclidean distance here included all probesets present in all replicates for at least one condition, thus creating an approximation of the genome-wide similarity between two conditions [21, 83, 84].
Genomic clustering analysis
Gene Set Enrichment Analysis (GSEA) was used to examine the genome for areas of differential expression by cytogenetic band. GSEA R script (script defaults, standard method)  was used with the 'gene label' permutation method. Genes represented by multiple probesets were reduced to a single probeset with the smallest overall p value. A lower cut-off of 20 genes per cytogenetic band was used, which resulted in a total of 113 gene sets being tested for enrichment. A false discovery rate cut-off of 0.4 was initially used, with the stricter criterion of 0.25 for regions listed. Regions with enriched gene expression during positive selection (B10k) refer to cytogenetic bands with an enriched number of genes with expression differences between B10k pre-selection thymocytes and B10k positive selection thymocytes (TCR transgenic early SP cells). Regions with enriched gene expression during negative selection (B10k) refer to cytogenetic bands with an enriched number of genes with expression differences between B10k positive selection thymocytes (TCR transgenic early SP cells) and B10k negative selection thymocytes (insHEL:TCR double transgenic early SP cells). Regions with enriched strain differences in pre-selection thymocytes refer to cytogenetic bands with an enriched number of genes with expression differences between B10k pre-selection thymocytes and NODk pre-selection thymocytes. Regions with enriched strain differences induced during negative selection refer to cytogenetic bands with an enriched number of genes with expression differences between B10k negative selection thymocytes (insHEL:TCR double transgenic early SP cells) and NODk negative selection thymocytes (insHEL:TCR double transgenic early SP cells), with any regions showing enrichment at the pre-selection thymocyte stage removed.
We analyzed 6-10-week old mice (or 6 week post-reconstitution chimeric mice) as described previously [5, 10] using the following antibodies: 1G12 anti-clonotype  (gift of E Unanue and D Peterson, Washington University, St Louis, MO, USA) culture supernatant followed by rat anti-mouse IgG1 allo-phycocyanin; anti-CD8α-PerCP; anti-CD4-FITC or PE; anti-Ly5a-FITC; anti-CD3-PE; and anti-B220-PE (all from BD PharMingen, San Jose, CA, USA).
Additional data files
The following additional data are available with the online version of this paper. Additional data file 1 provides a complete listing of statistically significant gene expression changes. In the 'Statistics and annotation' worksheet, basic annotation data are given for each differentially expressed probeset, with the Affymetrix ID number, the gene symbol, chromosome number and starting/ending location, cytogenetic band, gene ontology/molecular function and the Affymetrix target description. Means and gene expression values are given on the transformed scale. The worksheet also contains information on the significance level of analysis of variance tests. The tests were conducted on the transformed gene expression values for each probeset separately. The p value for significant differences between conditions is listed if the parent p value is less than 0.05. Only genes that showed a significant difference between all the means on this scale, using a threshold of p ≤ 0.005, are included in the worksheet. 'Within B10' tests for differences within the B10k conditions only (using only B10k condition variance data), with subtests 'Within pre' comparing the B10k pre-selection population sorted from insHEL transgenic and non-HEL transgenic hosts, 'PreS vs S-' comparing B10k pre-selection populations to B10k negative selection populations, 'PreS vs S+' comparing B10k pre-selection populations to B10k positive selection populations, and 'S+ vs S-' comparing B10k positive selection populations to B10k negative selection populations. 'Within NOD' tests for differences within the NODk conditions only (using only NODk condition variance data), with subtests performed as per the B10k tests. 'Overall sig' tests for differences between any conditions, with subtests comparing B10k versus NODk populations at the pre-selection stage ('PreS'), during negative selection ('S-') and during positive selection ('S+'). For each probeset the average expression is given as log2 scaled and normalized and arithmetic normalized data, for each condition. Individual data for arithmetic normalized expression are also given for each replicate (PreS 1, 2 and 3 come from non-HEL transgenic hosts, 4, 5 and 6 come from insHEL transgenic hosts). Individual Affymetrix 'Present/Moderate/Absent' calls are given for each replicate. In the 'Raw data' worksheet the individual data for arithmetic normalized expression are given for each probeset, regardless of statistical analysis, along with the Affymetrix target description. Additional data 2 lists the assignment of probesets to expression patterns. The 'B10' worksheet contains information on the expression profiles (patterns) for all probesets that showed a significant difference between the means on the transformed scale, using a threshold of p < 0.05 for the B10k strain. Along with the Affymetrix ID are listed the gene symbol, the pattern to which the probeset is assigned in the B10k strain, the pattern to which the probeset is assigned in the NODk strain, the average arithmetic (unlogged) Affymetrix MAS 5.0 signal values for each condition in the B10k and NODk strains of pre-selection ('PreS'), positive selection ('S+') and negative selection ('S-'), and the annotated molecular function. In the 'NOD' worksheet, all probesets that meet the significant cut-off for assignment to an expression pattern in the NODk strain are listed, in the same manner as the 'B10' worksheet. In the 'B10 vs NOD' worksheet, all probesets that meet the significant cut-off for assignment to a differential expression group between the B10k and NODk strains are listed. Along with Affymetrix ID and gene symbol are listed the group number, the fold-change between the relevant B10k and NODk conditions (dependent on differential expression group), the p values for differential expression between B10k and NODk strains for each condition, the average arithmetic normalized expression for each condition in the B10k and NODk strains, and the annotated molecular function.
Kappler JW, Roehm N, Marrack P: T cell tolerance by clonal elimination in the thymus. Cell. 1987, 49: 273-280. 10.1016/0092-8674(87)90568-X.
Starr TK, Jameson SC, Hogquist KA: Positive and negative selection of T cells. Annu Rev Immunol. 2003, 21: 139-176. 10.1146/annurev.immunol.21.120601.141107.
Anderson MS, Venanzi ES, Klein L, Chen Z, Berzins S, Turley SJ, Von Boehmer H, Bronson R, Dierich A, Benoist C, Mathis D: Projection of an immunological self shadow within the thymus by the aire protein. Science. 2002, 298: 1395-1401. 10.1126/science.1075958.
Liston A, Gray DH, Lesage S, Fletcher AL, Wilson J, Webster KE, Scott HS, Boyd RL, Peltonen L, Goodnow CC: Gene dosage-limiting role of Aire in thymic expression, clonal deletion and organ-specific autoimmunity. J Exp Med. 2004, 200: 1015-1026. 10.1084/jem.20040581.
Liston A, Lesage S, Wilson J, Peltonen L, Goodnow CC: Aire regulates negative selection of organ-specific T cells. Nat Immunol. 2003, 4: 350-354. 10.1038/ni906.
Todd JA, Wicker LS: Genetic protection from the inflammatory disease type 1 diabetes in humans and animal models. Immunity. 2001, 15: 387-395. 10.1016/S1074-7613(01)00202-3.
Kishimoto H, Sprent J: A defect in central tolerance in NOD mice. Nat Immunol. 2001, 2: 1025-1031. 10.1038/ni726.
Lesage S, Hartley SB, Akkaraju S, Wilson J, Townsend M, Goodnow CC: Failure to censor forbidden clone of CD4 T cells in autoimmune diabetes. J Exp Med. 2002, 196: 1175-1188. 10.1084/jem.20020735.
Choisy-Rossi CM, Holl TM, Pierce MA, Chapman HD, Serreze DV: Enhanced pathogenicity of diabetogenic T cells escaping a non-MHC gene-controlled near death experience. J Immunol. 2004, 173: 3791-3800.
Liston A, Lesage S, Gray DH, O'Reilly LA, Strasser A, Fahrer AM, Boyd RL, Wilson J, Baxter AG, Gallo EM, et al: Generalised resistance to thymic deletion in the NOD mouse: a polygenic trait characterized by defective induction of Bim. Immunity. 2004, 21: 817-830.
Zucchelli S, Holler P, Yamagata T, Roy M, Benoist C, Mathis D: Defective central tolerance induction in NOD mice: genomics and genetics. Immunity. 2005, 22: 385-396. 10.1016/j.immuni.2005.01.015.
Eaves IA, Wicker LS, Ghandour G, Lyons PA, Peterson LB, Todd JA, Glynne RJ: Combining mouse congenic strains and microarray gene expression analyses to study a complex trait: the NOD model of type 1 diabetes. Genome Res. 2002, 12: 232-243. 10.1101/gr.214102. Article published online before print in January 2002.
DeRyckere D, Mann DL, DeGregori J: Characterization of transcriptional regulation during negative selection in vivo. J Immunol. 2003, 171: 802-811.
Schmitz I, Clayton LK, Reinherz EL: Gene expression analysis of thymocyte selection in vivo. Int Immunol. 2003, 15: 1237-1248. 10.1093/intimm/dxg125.
Martin S, Bevan MJ: Antigen-specific and nonspecific deletion of immature cortical thymocytes caused by antigen injection. Eur J Immunol. 1997, 27: 2726-2736.
McClintick JN, Edenberg HJ: Effects of filtering by Present call on analysis of microarray experiments. BMC Bioinformatics. 2006, 7: 49-10.1186/1471-2105-7-49.
Choe SE, Boutros M, Michelson AM, Church GM, Halfon MS: Preferred analysis methods for Affymetrix GeneChips revealed by a wholly defined control dataset. Genome Biol. 2005, 6: R16-10.1186/gb-2005-6-2-r16.
Seo J, Hoffman EP: Probe set algorithms: is there a rational best bet?. BMC Bioinformatics. 2006, 7: 395-10.1186/1471-2105-7-395.
Qin LX, Beyer RP, Hudson FN, Linford NJ, Morris DE, Kerr KF: Evaluation of methods for oligonucleotide array data via quantitative real-time PCR. BMC Bioinformatics. 2006, 7: 23-10.1186/1471-2105-7-23.
Gene expression omnibus. [http://www.ncbi.nlm.nih.gov/geo/]
Quackenbush J: Computational analysis of microarray data. Nat Rev Genet. 2001, 2: 418-427. 10.1038/35076576.
Mitchell TC, Hildeman D, Kedl RM, Teague TK, Schaefer BC, White J, Zhu Y, Kappler J, Marrack P: Immunological adjuvants promote activated T cell survival via induction of Bcl-3. Nat Immunol. 2001, 2: 397-402.
Muljo SA, Ansel KM, Kanellopoulou C, Livingston DM, Rao A, Rajewsky K: Aberrant T cell differentiation in the absence of Dicer. J Exp Med. 2005, 202: 261-269. 10.1084/jem.20050678.
Chen F, Chen D, Rothenberg EV: Specific regulation of fos family transcription factors in thymocytes at two developmental checkpoints. Int Immunol. 1999, 11: 677-688. 10.1093/intimm/11.5.677.
Matsuyama T, Kimura T, Kitagawa M, Pfeffer K, Kawakami T, Watanabe N, Kundig TM, Amakawa R, Kishihara K, Wakeham A, et al: Targeted disruption of IRF-1 or IRF-2 results in abnormal type-I IFN gene induction and aberrant lymphocyte development. Cell. 1993, 75: 83-97.
Tabrizifard S, Olaru A, Plotkin J, Fallahi-Sichani M, Livak F, Petrie HT: Analysis of transcription factor expression during discrete stages of postnatal thymocyte differentiation. J Immunol. 2004, 173: 1094-1102.
Hartenstein B, Teurich S, Hess J, Schenkel J, Schorpp-Kistner M, Angel P: Th2 cell-specific cytokine expression and allergen-induced airway inflammation depend on JunB. EMBO J. 2002, 21: 6321-6329. 10.1093/emboj/cdf648.
Huang YH, Li D, Winoto A, Robey EA: Distinct transcriptional programs in thymocytes responding to T cell receptor, Notch, and positive selection signals. Proc Natl Acad Sci USA. 2004, 101: 4936-4941. 10.1073/pnas.0401133101.
Trama J, Go WY, Ho SN: The osmoprotective function of the NFAT5 transcription factor in T cell development and activation. J Immunol. 2002, 169: 5477-5488.
Carrillo-Vico A, Garcia-Perganeda A, Naji L, Calvo JR, Romero MP, Guerrero JM: Expression of membrane and nuclear melatonin receptor mRNA and protein in the mouse immune system. Cell Mol Life Sci. 2003, 60: 2272-2278. 10.1007/s00018-003-3207-4.
Yu Q, Park JH, Doan LL, Erman B, Feigenbaum L, Singer A: Cytokine signal transduction is suppressed in preselection double-positive thymocytes and restored by positive selection. J Exp Med. 2006, 203: 165-175. 10.1084/jem.20051836.
Winrow CJ, Pankratz DG, Vibat CR, Bowen TJ, Callahan MA, Warren AJ, Hilbush BS, Wynshaw-Boris A, Hasel KW, Weaver Z, et al: Aberrant recombination involving the granzyme locus occurs in Atm-/- T-cell lymphomas. Hum Mol Genet. 2005, 14: 2671-2684. 10.1093/hmg/ddi301.
Niederberger N, Buehler LK, Ampudia J, Gascoigne NR: Thymocyte stimulation by anti-TCR-beta, but not by anti-TCR-alpha, leads to induction of developmental transcription program. J Leukocyte Biol. 2005, 77: 830-841. 10.1189/jlb.1004608.
Simarro M, Lanyi A, Howie D, Poy F, Bruggeman J, Choi M, Sumegi J, Eck MJ, Terhorst C: SAP increases FynT kinase activity and is required for phosphorylation of SLAM and Ly9. Int Immunol. 2004, 16: 727-736. 10.1093/intimm/dxh074.
Opferman JT, Letai A, Beard C, Sorcinelli MD, Ong CC, Korsmeyer SJ: Development and maintenance of B and T lymphocytes requires antiapoptotic MCL-1. Nature. 2003, 426: 671-676. 10.1038/nature02067.
Kappes DJ, He X, He X: CD4-CD8 lineage commitment: an inside view. Nat Immunol. 2005, 6: 761-766. 10.1038/ni1230.
Kinashi T, Katagiri K: Regulation of immune cell adhesion and migration by regulator of adhesion and cell polarization enriched in lymphoid tissues. Immunology. 2005, 116: 164-171. 10.1111/j.1365-2567.2005.02214.x.
Wang J, Vock VM, Li S, Olivas OR, Wilkinson MF: A quality control pathway that down-regulates aberrant T-cell receptor (TCR) transcripts by a mechanism requiring UPF2 and translation. J Biol Chem. 2002, 277: 18489-18493. 10.1074/jbc.M111781200.
Winandy S, Wu L, Wang JH, Georgopoulos K: Pre-T cell receptor (TCR) and TCR-controlled checkpoints in T cell differentiation are set by Ikaros. J Exp Med. 1999, 190: 1039-1048. 10.1084/jem.190.8.1039.
Arsenovic-Ranin N, Vucevic D, Okada N, Dimitrijevic M, Colic M: A monoclonal antibody to the rat Crry/p65 antigen, a complement regulatory membrane protein, stimulates adhesion and proliferation of thymocytes. Immunology. 2000, 100: 334-344. 10.1046/j.1365-2567.2000.00043.x.
Fontenot JD, Rasmussen JP, Gavin MA, Rudensky AY: A function for interleukin 2 in Foxp3-expressing regulatory T cells. Nat Immunol. 2005, 6: 1142-1151. 10.1038/ni1263.
Lee CK, Gimeno R, Levy DE: Differential regulation of constitutive major histocompatibility complex class I expression in T and B lymphocytes. J Exp Med. 1999, 190: 1451-1464. 10.1084/jem.190.10.1451.
Mick VE, Starr TK, McCaughtry TM, McNeil LK, Hogquist KA: The regulated expression of a diverse set of genes during thymocyte positive selection in vivo. J Immunol. 2004, 173: 5434-5444.
Uckun FM, Tuel-Ahlgren L, Obuz V, Smith R, Dibirdik I, Hanson M, Langlie MC, Ledbetter JA: Interleukin 7 receptor engagement stimulates tyrosine phosphorylation, inositol phospholipid turnover, proliferation, and selective differentiation to the CD4 lineage by human fetal thymocytes. Proc Natl Acad Sci USA. 1991, 88: 6323-6327. 10.1073/pnas.88.14.6323.
Josien R, Wong BR, Li HL, Steinman RM, Choi Y: TRANCE, a TNF family member, is differentially expressed on T cell subsets and induces cytokine production in dendritic cells. J Immunol. 1999, 162: 2562-2568.
Al-Shami A, Spolski R, Kelly J, Fry T, Schwartzberg PL, Pandey A, Mackall CL, Leonard WJ: A role for thymic stromal lymphopoietin in CD4(+) T cell development. J Exp Med. 2004, 200: 159-168. 10.1084/jem.20031975.
Leenders H, Whiffield S, Benoist C, Mathis D: Role of the forkhead transcription family member, FKHR, in thymocyte differentiation. Eur J Immunol. 2000, 30: 2980-2990. 10.1002/1521-4141(200010)30:10<2980::AID-IMMU2980>3.0.CO;2-9.
Sun G, Liu X, Mercado P, Jenkinson SR, Kypriotou M, Feigenbaum L, Galera P, Bosselut R: The zinc finger protein cKrox directs CD4 lineage differentiation during intrathymic T cell positive selection. Nat Immunol. 2005, 6: 373-381. 10.1038/ni1183.
Yamada M, Ishii N, Asao H, Murata K, Kanazawa C, Sasaki H, Sugamura K: Signal-transducing adaptor molecules STAM1 and STAM2 are required for T-cell development and survival. Mol Cell Biol. 2002, 22: 8648-8658. 10.1128/MCB.22.24.8648-8658.2002.
Betz UA, Muller W: Regulated expression of gp130 and IL-6 receptor alpha chain in T cell maturation and activation. Int Immunol. 1998, 10: 1175-1184. 10.1093/intimm/10.8.1175.
Puthier D, Joly F, Irla M, Saade M, Victorero G, Loriod B, Nguyen C: A general survey of thymocyte differentiation by transcriptional analysis of knockout mouse models. J Immunol. 2004, 173: 6109-6118.
Hurst LD, Pal C, Lercher MJ: The evolutionary dynamics of eukaryotic gene order. Nat Rev Genet. 2004, 5: 299-310. 10.1038/nrg1319.
Zhao Y, Tan J, Zhuang L, Jiang X, Liu ET, Yu Q: Inhibitors of histone deacetylases target the Rb-E2F1 pathway for apoptosis induction through activation of proapoptotic protein Bim. Proc Natl Acad Sci USA. 2005, 102: 16090-16095. 10.1073/pnas.0505585102.
Salojin K, Zhang J, Cameron M, Gill B, Arreaza G, Ochi A, Delovitch TL: Impaired plasma membrane targeting of Grb2-murine son of sevenless (mSOS) complex and differential activation of the Fyn-T cell receptor (TCR)-zeta-Cbl pathway mediate T cell hyporesponsiveness in autoimmune nonobese diabetic mice. J Exp Med. 1997, 186: 887-897. 10.1084/jem.186.6.887.
Ueda H, Howson JM, Esposito L, Heward J, Snook H, Chamberlain G, Rainbow DB, Hunter KM, Smith AN, Di Genova G, et al: Association of the T-cell regulatory gene CTLA4 with susceptibility to autoimmune disease. Nature. 2003, 423: 506-511. 10.1038/nature01621.
Hamilton-Williams EE, Serreze DV, Charlton B, Johnson EA, Marron MP, Mullbacher A, Slattery RM: Transgenic rescue implicates beta2-microglobulin as a diabetes susceptibility gene in nonobese diabetic (NOD) mice. Proc Natl Acad Sci USA. 2001, 98: 11533-11538. 10.1073/pnas.191383798.
Ray R, Chen G, Vande Velde C, Cizeau J, Park JH, Reed JC, Gietz RD, Greenberg AH: BNIP3 heterodimerizes with Bcl-2/Bcl-X(L) and induces cell death independent of a Bcl-2 homology 3 (BH3) domain at both mitochondrial and nonmitochondrial sites. J Biol Chem. 2000, 275: 1439-1448. 10.1074/jbc.275.2.1439.
Imazu T, Shimizu S, Tagami S, Matsushima M, Nakamura Y, Miki T, Okuyama A, Tsujimoto Y: Bcl-2/E1B 19 kDa-interacting protein 3-like protein (Bnip3L) interacts with bcl-2/Bcl-xL and induces apoptosis by altering mitochondrial membrane permeability. Oncogene. 1999, 18: 4523-4529. 10.1038/sj.onc.1202722.
Lamy L, Ticchioni M, Rouquette-Jazdanian AK, Samson M, Deckert M, Greenberg AH, Bernard A: CD47 and the 19 kDa interacting protein-3 (BNIP3) in T cell apoptosis. J Biol Chem. 2003, 278: 23915-23921. 10.1074/jbc.M301869200.
Yook YH, Kang KH, Maeng O, Kim TR, Lee JO, Kang KI, Kim YS, Paik SG, Lee H: Nitric oxide induces BNIP3 expression that causes cell death in macrophages. Biochem Biophys Res Commun. 2004, 321: 298-305. 10.1016/j.bbrc.2004.06.144.
Guo K, Searfoss G, Krolikowski D, Pagnoni M, Franks C, Clark K, Yu KT, Jaye M, Ivashchenko Y: Hypoxia induces the expression of the pro-apoptotic gene BNIP3. Cell Death Differ. 2001, 8: 367-376. 10.1038/sj.cdd.4400810.
Wan J, Martinvalet D, Ji X, Lois C, Kaech SM, Von Andrian UH, Lieberman J, Ahmed R, Manjunath N: The Bcl-2 family pro-apoptotic molecule, BNIP3 regulates activation-induced cell death of effector cytotoxic T lymphocytes. Immunology. 2003, 110: 10-17. 10.1046/j.1365-2567.2003.01710.x.
Giatromanolaki A, Koukourakis MI, Sowter HM, Sivridis E, Gibson S, Gatter KC, Harris AL: BNIP3 expression is linked with hypoxia-regulated protein expression and with poor prognosis in non-small cell lung cancer. Clin Cancer Res. 2004, 10: 5566-5571. 10.1158/1078-0432.CCR-04-0076.
Reymond A, Meroni G, Fantozzi A, Merla G, Cairo S, Luzi L, Riganelli D, Zanaria E, Messali S, Cainarca S, et al: The tripartite motif family identifies cell compartments. EMBO J. 2001, 20: 2140-2151. 10.1093/emboj/20.9.2140.
Yamanouchi S, Kuwahara K, Sakata A, Ezaki T, Matsuoka S, Miyazaki J, Hirose S, Tamura T, Nariuchi H, Sakaguchi N: A T cell activation antigen, Ly6C, induced on CD4+ Th1 cells mediates an inhibitory signal for secretion of IL-2 and proliferation in peripheral immune responses. Eur J Immunol. 1998, 28: 696-707. 10.1002/(SICI)1521-4141(199802)28:02<696::AID-IMMU696>3.0.CO;2-N.
Jaakkola I, Merinen M, Jalkanen S, Hanninen A: Ly6C induces clustering of LFA-1 (CD11a/CD18) and is involved in subtype-specific adhesion of CD8 T cells. J Immunol. 2003, 170: 1283-1290.
Cervelli M, Polticelli F, Federico R, Mariottini P: Heterologous expression and characterization of mouse spermine oxidase. J Biol Chem. 2003, 278: 5271-5276. 10.1074/jbc.M207888200.
Xu H, Chaturvedi R, Cheng Y, Bussiere FI, Asim M, Yao MD, Potosky D, Meltzer SJ, Rhee JG, Kim SS, et al: Spermine oxidation induced by Helicobacter pylori results in apoptosis and DNA damage: implications for gastric carcinogenesis. Cancer Res. 2004, 64: 8521-8525. 10.1158/0008-5472.CAN-04-3511.
Wang Y, Hacker A, Murray-Stewart T, Fleischer JG, Woster PM, Casero RA: Induction of human spermine oxidase SMO(PAOh1) is regulated at the levels of new mRNA synthesis, mRNA stabilization, and newly synthesized protein. Biochem J. 2005, 386: 543-547. 10.1042/BJ20041084.
Bouillet P, Metcalf D, Huang DC, Tarlinton DM, Kay TW, Kontgen F, Adams JM, Strasser A: Proapoptotic Bcl-2 relative Bim required for certain apoptotic responses, leukocyte homeostasis, and to preclude autoimmunity. Science. 1999, 286: 1735-1738. 10.1126/science.286.5445.1735.
Bouillet P, Purton JF, Godfrey DI, Zhang LC, Coultas L, Puthalakath H, Pellegrini M, Cory S, Adams JM, Strasser A: BH3-only Bcl-2 family member Bim is required for apoptosis of autoreactive thymocytes. Nature. 2002, 415: 922-926. 10.1038/415922a.
Qi Z, Sun XH: Hyperresponse to T-cell receptor signaling and apoptosis of Id1 transgenic thymocytes. Mol Cell Biol. 2004, 24: 7313-7323. 10.1128/MCB.24.17.7313-7323.2004.
Celli CM, Jaiswal AK: Role of GRP58 in mitomycin C-induced DNA cross-linking. Cancer Res. 2003, 63: 6016-6025.
Luo X, Huang Y, Sheikh MS: Cloning and characterization of a novel gene PDRG that is differentially regulated by p53 and ultraviolet radiation. Oncogene. 2003, 22: 7247-7257. 10.1038/sj.onc.1207010.
Bergman ML, Cilio CM, Penha-Goncalves C, Lamhamedi-Cherradi SE, Lofgren A, Colucci F, Lejon K, Garchon HJ, Holmberg D: CTLA-4-/- mice display T cell-apoptosis resistance resembling that ascribed to autoimmune-prone non-obese diabetic (NOD) mice. J Autoimmun. 2001, 16: 105-113. 10.1006/jaut.2000.0474.
Vijayakrishnan L, Slavik JM, Illes Z, Greenwald RJ, Rainbow D, Greve B, Peterson LB, Hafler DA, Freeman GJ, Sharpe AH, et al: An autoimmune disease-associated CTLA-4 splice variant lacking the B7 binding domain signals negatively in T cells. Immunity. 2004, 20: 563-575. 10.1016/S1074-7613(04)00110-4.
Freeman GJ, Long AJ, Iwai Y, Bourque K, Chernova T, Nishimura H, Fitz LJ, Malenkovich N, Okazaki T, Byrne MC, et al: Engagement of the PD-1 immunoinhibitory receptor by a novel B7 family member leads to negative regulation of lymphocyte activation. J Exp Med. 2000, 192: 1027-1034. 10.1084/jem.192.7.1027.
Nishimura H, Honjo T, Minato N: Facilitation of beta selection and modification of positive selection in the thymus of PD-1-deficient mice. J Exp Med. 2000, 191: 891-898. 10.1084/jem.191.5.891.
Nishimura H, Nose M, Hiai H, Minato N, Honjo T: Development of lupus-like autoimmune diseases by disruption of the PD-1 gene encoding an ITIM motif-carrying immunoreceptor. Immunity. 1999, 11: 141-151. 10.1016/S1074-7613(00)80089-8.
Nishimura H, Okazaki T, Tanaka Y, Nakatani K, Hara M, Matsumori A, Sasayama S, Mizoguchi A, Hiai H, Minato N, Honjo T: Autoimmune dilated cardiomyopathy in PD-1 receptor-deficient mice. Science. 2001, 291: 319-322. 10.1126/science.291.5502.319.
Ansari MJ, Salama AD, Chitnis T, Smith RN, Yagita H, Akiba H, Yamazaki T, Azuma M, Iwai H, Khoury SJ, et al: The programmed death-1 (PD-1) pathway regulates autoimmune diabetes in nonobese diabetic (NOD) mice. J Exp Med. 2003, 198: 63-69. 10.1084/jem.20022125.
Nagashima T, Silva DG, Petrovsky N, Socha LA, Suzuki H, Saito R, Kasukawa T, Kurochkin IV, Konagaya A, Schonbach C: Inferring higher functional information for RIKEN mouse full-length cDNA clones with FACTS. Genome Res. 2003, 13: 1520-1533. 10.1101/gr.1019903.
Call DR, Borucki MK, Besser TE: Mixed-genome microarrays reveal multiple serotype and lineage-specific differences among strains of Listeria monocytogenes. J Clin Microbiol. 2003, 41: 632-639. 10.1128/JCM.41.2.632-639.2003.
Sawa T, Ohno-Machado L: A neural network-based similarity index for clustering DNA microarray data. Comput Biol Med. 2003, 33: 1-15. 10.1016/S0010-4825(02)00032-X.
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP: Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005, 102: 15545-15550. 10.1073/pnas.0506580102.
Van Parijs L, Peterson DA, Abbas AK: The Fas/Fas ligand pathway and Bcl-2 regulate T cell responses to model self and foreign antigens. Immunity. 1998, 8: 265-274. 10.1016/S1074-7613(00)80478-1.
Borggrefe T, Wabl M, Akhmedov AT, Jessberger R: A B-cell-specific DNA recombination complex. J Biol Chem. 1998, 273: 17025-17035. 10.1074/jbc.273.27.17025.
Shinohara M, Terada Y, Iwamatsu A, Shinohara A, Mochizuki N, Higuchi M, Gotoh Y, Ihara S, Nagata S, Itoh H, et al: SWAP-70 is a guanine-nucleotide-exchange factor that mediates signaling of membrane ruffling. Nature. 2002, 416: 759-763. 10.1038/416759a.
Janoueix-Lerosey I, Jollivet F, Camonis J, Marche PN, Goud B: Two-hybrid system screen with the small GTP-binding protein Rab6. Identification of a novel mouse GDP dissociation inhibitor isoform and two other potential partners of Rab6. J Biol Chem. 1995, 270: 14801-14808. 10.1074/jbc.270.24.14801.
Lorenzi MV, Horii Y, Yamanaka R, Sakaguchi K, Miki T: FRAG1, a gene that potently activates fibroblast growth factor receptor by C-terminal fusion through chromosomal rearrangement. Proc Natl Acad Sci USA. 1996, 93: 8956-8961. 10.1073/pnas.93.17.8956.
Volkert MR, Elliott NA, Housman DE: Functional genomics reveals a family of eukaryotic oxidation protection genes. Proc Natl Acad Sci USA. 2000, 97: 14530-14535. 10.1073/pnas.260495897.
Trappe R, Ahmed M, Glaser B, Vogel C, Tascou S, Burfeind P, Engel W: Identification and characterization of a novel murine multigene family containing a PHD-finger-like motif. Biochem Biophys Res Commun. 2002, 293: 816-826. 10.1016/S0006-291X(02)00277-2.
Fujino T, Kondo J, Ishikawa M, Morikawa K, Yamamoto TT: Acetyl-CoA synthetase 2, a mitochondrial matrix enzyme involved in the oxidation of acetate. J Biol Chem. 2001, 276: 11420-11426. 10.1074/jbc.M008782200.
Salvatore P, Hanash CR, Kido Y, Imai Y, Accili D: Identification of sirm, a novel insulin-regulated SH3 binding protein that associates with Grb-2 and FYN. J Biol Chem. 1998, 273: 6989-6997. 10.1074/jbc.273.12.6989.
Caplan S, Hartnell LM, Aguilar RC, Naslavsky N, Bonifacino JS: Human Vam6p promotes lysosome clustering and fusion in vivo. J Cell Biol. 2001, 154: 109-122. 10.1083/jcb.200102142.
Mbikay M, Seidah NG, Chretien M: Neuroendocrine secretory protein 7B2: structure, expression and functions. Biochem J. 2001, 357: 329-342. 10.1042/0264-6021:3570329.
Ozawa M, Muramatsu T: Reticulocalbin, a novel endoplasmic reticulum resident Ca(2+)-binding protein with multiple EF-hand motifs and a carboxyl-terminal HDEL sequence. J Biol Chem. 1993, 268: 699-705.
Lagaudriere-Gesbert C, Newmyer SL, Gregers TF, Bakke O, Ploegh HL: Uncoating ATPase Hsc70 is recruited by invariant chain and controls the size of endocytic compartments. Proc Natl Acad Sci USA. 2002, 99: 1515-1520. 10.1073/pnas.042688099.
Diehl JA, Yang W, Rimerman RA, Xiao H, Emili A: Hsc70 regulates accumulation of cyclin D1 and cyclin D1-dependent protein kinase. Mol Cell Biol. 2003, 23: 1764-1774. 10.1128/MCB.23.5.1764-1774.2003.
Miyazawa H, Izumi M, Tada S, Takada R, Masutani M, Ui M, Hanaoka F: Molecular cloning of the cDNAs for the four subunits of mouse DNA polymerase alpha-primase complex and their gene expression during cell proliferation and the cell cycle. J Biol Chem. 1993, 268: 8111-8122.
Zerbe LK, Kuchta RD: The p58 subunit of human DNA primase is important for primer initiation, elongation, and counting. Biochemistry. 2002, 41: 4891-4900. 10.1021/bi016030b.
We thank S Lesage for advice and D Silva for probeset annotation. This work was supported by grants from the NHMRC and the Juvenile Diabetes Research Foundation.
Electronic supplementary material
Additional data file 1: In the 'Statistics and annotation' worksheet, basic annotation data are given for each differentially expressed probeset, with the Affymetrix ID number, the gene symbol, chromosome number and starting/ending location, cytogenetic band, gene ontology/molecular function and the Affymetrix target description. Means and gene expression values are given on the transformed scale. The worksheet also contains information on the significance level of analysis of variance tests. The tests were conducted on the transformed gene expression values for each probeset separately. The p value for significant differences between conditions is listed if the parent p value is less than 0.05. Only genes that showed a significant difference between all the means on this scale, using a threshold of p ≤ 0.005, are included in the worksheet. 'Within B10' tests for differences within the B10k conditions only (using only B10k condition variance data), with subtests 'Within pre' comparing the B10k pre-selection population sorted from insHEL transgenic and non-HEL transgenic hosts, 'PreS vs S-' comparing B10k pre-selection populations to B10k negative selection populations, 'PreS vs S+' comparing B10k pre-selection populations to B10k positive selection populations, and 'S+ vs S-' comparing B10k positive selection populations to B10k negative selection populations. 'Within NOD' tests for differences within the NODk conditions only (using only NODk condition variance data), with subtests performed as per the B10k tests. 'Overall sig' tests for differences between any conditions, with subtests comparing B10k versus NODk populations at the pre-selection stage ('PreS'), during negative selection ('S-') and during positive selection ('S+'). For each probeset the average expression is given as log2 scaled and normalized and arithmetic normalized data, for each condition. Individual data for arithmetic normalized expression are also given for each replicate (PreS 1, 2 and 3 come from non-HEL transgenic hosts, 4, 5 and 6 come from insHEL transgenic hosts). Individual Affymetrix 'Present/Moderate/Absent' calls are given for each replicate. In the 'Raw data' worksheet the individual data for arithmetic normalized expression are given for each probeset, regardless of statistical analysis, along with the Affymetrix target description (XLS 16 MB)
Additional data file 2: The 'B10' worksheet contains information on the expression profiles (patterns) for all probesets that showed a significant difference between the means on the transformed scale, using a threshold of p < 0.05 for the B10k strain. Along with the Affymetrix ID are listed the gene symbol, the pattern to which the probeset is assigned in the B10k strain, the pattern to which the probeset is assigned in the NODk strain, the average arithmetic (unlogged) Affymetrix MAS 5.0 signal values for each condition in the B10k and NODk strains of pre-selection ('PreS'), positive selection ('S+') and negative selection ('S-'), and the annotated molecular function. In the 'NOD' worksheet, all probesets that meet the significant cut-off for assignment to an expression pattern in the NODk strain are listed, in the same manner as the 'B10' worksheet. In the 'B10 vs NOD' worksheet, all probesets that meet the significant cut-off for assignment to a differential expression group between the B10k and NODk strains are listed. Along with Affymetrix ID and gene symbol are listed the group number, the fold-change between the relevant B10k and NODk conditions (dependent on differential expression group), the p values for differential expression between B10k and NODk strains for each condition, the average arithmetic normalized expression for each condition in the B10k and NODk strains, and the annotated molecular function (XLS 938 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.