Insights into female sperm storage from the spermathecal fluid proteome of the honeybee Apis mellifera
Genome Biology volume 10, Article number: R67 (2009)
Female animals are often able to store sperm inside their body - in some species even for several decades. The molecular basis of how females keep non-own cells alive is largely unknown, but since sperm cells are reported to be transcriptionally silenced and, therefore, limited in their ability to maintain their own function, it is likely that females actively participate in sperm maintenance. Because female contributions are likely to be of central importance for sperm survival, molecular insights into the process offer opportunities to observe mechanisms through which females manipulate sperm.
We used the honeybee, Apis mellifera, in which queens are highly polyandrous and able to maintain sperm viable for several years. We identified over a hundred proteins representing the major constituents of the spermathecal fluid, which females contribute to sperm in storage. We found that the gel profile of proteins from spermathecal fluid is very similar to the secretions of the spermathecal gland and concluded that the spermathecal glands are the main contributors to the spermathecal fluid proteome. A detailed analysis of the spermathecal fluid proteins indicate that they fall into a range of different functional groups, most notably enzymes of energy metabolism and antioxidant defense. A metabolic network analysis comparing the proteins detected in seminal fluid and spermathecal fluid showed a more integrated network is present in the spermathecal fluid that could facilitate long-term storage of sperm.
We present a large-scale identification of proteins in the spermathecal fluid of honeybee queens and provide insights into the molecular regulation of female sperm storage.
Sperm storage by females is widespread throughout the animal kingdom [1, 2] but amazingly little is known about how females are able to keep sperm cells viable over prolonged periods of time. In many species, females provide specialized morphological structures for sperm storage often known as spermathecae . Females 'interact' with and 'sustain' sperm that are stored in these structures through glandular secretions, produced, for example, by the spermathecal glands . These secretions contain proteins, metabolites and other chemicals in the honeybee Apis mellifera  and spermathecal fluid has recently been shown to maintain sperm viability [6, 7]. Several proteins have been proposed to be responsible for this effect, such as the glycolytic enzyme triosphosphate isomerase  and a number of antioxidant defense enzymes . In addition, high K+ concentrations and the high pH of the spermathecal fluid have been proposed to lower the metabolic rate of sperm in storage [5, 9, 10]. However, despite the spermatheca containing 5 to 10 mg of protein/ml , no systematic analysis of these female derived proteins has so far been conducted. As a consequence, our knowledge about the biochemical and physiological mechanisms that maintain sperm viability or the physiological costs associated with sperm storage are extremely limited . Furthermore, females have been hypothesized to bias paternity outcomes by manipulating sperm in storage . Consequently, sexual selection  may influence the female contributions towards stored sperm as well.
The study of male contributions towards sperm, such as seminal fluids or male accessory gland secretions, has received much more attention [14–16]. Males transfer a complex mixture of components to the female along with sperm [13, 17–21], which have multiple effects on sperm viability or female physiology [6, 7] but some of these components also seem to be agents of sexual conflict [22–25]. It seems reasonable to assume that females have also evolved a complementary arsenal of components to support and manipulate sperm. This makes detailed studies of female sperm storage physiology and its interactions with sperm and/or seminal fluid timely. A crucial step to understand female influence on stored sperm is to identify the components provided by the female, and proteomic technologies offer the opportunity to investigate the female's arsenal.
Social hymenopteran insects (the bees, ants and wasps) are interesting model systems to study sperm storage by females because several species have taken sperm storage to spectacular extremes [11, 26, 27]. This can be seen in terms of both the total number of sperm stored as well as the efficiency by which sperm are kept alive over prolonged periods of time . A phenomenon common to many social hymenopteran insects is that queens only copulate during a brief period early in life [16, 29, 30]. In the absence of re-mating later in life, queens acquire and store a lifetime supply of sperm that often fixes the upper limit of a colony's size, longevity and fitness. Apart from the total number of initially stored sperm, queen lifetime fecundity is also influenced by her efficiency to keep sperm viable. Some social insect queens can not only live for several decades [26, 31], but they also maintain colonies of several million workers [11, 30, 32]. Selection is therefore expected to have maximized storage efficiency of sperm number  and sperm survival and minimized sperm number used per egg fertilization. Sperm storage induces costs for the female that are known to trade off with other female life history traits in leaf cutter ants  and bumblebees . Finally, in polyandrous species, ejaculates of several males can coexist within the spermatheca for years, but it remains to be investigated whether sperm competition or cryptic female choice occurs whilst sperm is in storage .
We have used the honeybee, A. mellifera, and present a proteomic identification of the female's contribution towards sperm by identifying proteins that females provide to sperm in storage. Honeybee queens are efficient sperm storers that initially store around 6 million sperm for up to 7 years, giving them an estimated potential to sire up to 1.7 million offspring (see  for a review on the honeybee mating system). Consequently, spermathecal fluid components are expected to maximize the survival of large numbers of sperm. Furthermore, honeybee queens are highly polyandrous and store sperm from several males. Consequently, females could use sperm storage to manipulate sperm and, thus, manipulate paternity success. An additional advantage of honeybees as a model system is that the availability of the honeybee genome sequence  allows the use of tandem mass spectrometry (MS/MS) to identify proteins [19, 20, 35]. We here identify the spermathecal fluid proteome of honeybee and compare it to recently published proteomic profiles of sperm and seminal fluid [19, 20] in order to understand the specific female contribution to sperm in storage.
The proteins of spermathecal fluid collected from dissected spermathecae were separated by one-dimensional SDS-PAGE (Figure 1). We compared this profile to extracted spermathecal wall proteins, hemolymph and sperm. In each case the protein profiles were distinct, showing that separation of these protein subsets could be achieved by our dissection and extraction protocols ( and data not shown). Protein profiles of spermathecal fluid were visually inspected on a total of 11 one-dimensional gels using 12 independent biological replicates for mated and 4 independent biological replicates for virgin queens. We found that specific protein profiles for spermathecal fluid can be consistently reproduced (Figure 1), in both technical and biological replicates and resemble those found in earlier studies [5, 7]. Modifications of our standardized extraction protocol resulted in no obvious abundance changes of protein profiles on the gels, indicating that our collection method is a reliable way to sample spermathecal fluid. We found a large overlap in the spermathecal fluid protein band profiles of mated and virgin queens (Figure 2). Furthermore, the protein profile of the spermathecal gland secretions is very similar to that of the spermathecal fluid, both for mated and virgin queens (Figure 2). The protein profiles of spermathecal fluid were very different from that of seminal fluid isolated from male ejaculates (Figure 2).
To identify the most abundant proteins present in the spermathecal fluid, we ran a total of four mass spectrometry analyses from four independent biological samples. Two sets of analyses were performed, one based on in-gel digested bands of one-dimensional SDS-PAGE (Figure 1) and a second based on liquid chromatography (LC)-MS/MS analysis of total protein tryptic digests. The latter were nested experiments each consisting of six LC-MS/MS experiments performed in series, with the peptides identified in each run excluded from the subsequent analysis to improve the depth of analysis (see Materials and methods).
A summary of all significant protein identifications is given in Table 1 (protein match data are presented in Additional data file 1). Our final analysis resulted in the identification of 122 different proteins across the four spermathecal fluid samples. This set of proteins included molecular chaperones, an array of enzymes involved in energy and amino acid metabolism, antioxidant enzymes, proteins involved in signaling pathways, structural proteins, and a range of proteins with unknown functions (Table 2).
We compared our list of 122 spermathecal proteins with the reported abundant proteins from bee sperm samples ; we found that only 10 (8%) proteins were detected in both the spermathecal fluid and this list of sperm proteins (Figure 3; Additional data file 1). We also detected five of these ten sperm proteins in the spermathecal fluid of virgin queens, so it is unlikely that these are contaminating sperm proteins but instead represent the expression of the same gene that queens secrete into the spermathecal fluid. Only 5 (4%) proteins were found in sperm samples in our previous publication from male ejaculates and also in the spermathecal fluid list from mated queens presented here (Figure 3). Comparison of the spermathecal list with the top 12 most abundant hemolymph proteins we have previously detected by mass spectrometry  also revealed no overlap. We have also compared the protein profiles of spermathecal fluid identified here and our previous analysis of seminal fluid  and again found substantial differences. Only 19 (16%) out of the set of 122 spermathecal proteins were also detected in this previously reported seminal fluid proteome. Sixteen of this set of 19 proteins were also present in the spermathecal fluid of virgin queens and, thus, cannot be considered as contaminants from male seminal fluid (Table 1; Additional data file 1). This provides evidence that while qualitative assessment of seminal fluid contamination in our spermathecal fluid samples was minimal at the depth of the analysis performed, some identical proteins are present, which appear to be expressed and secreted by both males into their ejaculate and by females into the spermatheca. Our dataset of 122 proteins also allowed a comparison of the spermathecal protein population of virgin and mated queens. We detected peptides for 61 proteins present in both virgin and mated queens (Figure 3), but each group also had unique sets of proteins not found in the other. We found that 38 (30%) spermathecal fluid proteins were only detected in mated queens and 23 (19%) proteins were only detected in virgin queens. Obviously, protein profiles differ between young, virgin and old inseminated queens, but our study was not able to distinguish whether this proteomic changes are caused by queen age or mating status. Future work will be needed to resolve this issue; however, aged virgin females are physiologically and technically extremely difficult to obtain to test this issue.
Spectral counts in our LC-MS/MS data from spermathecal fluid revealed that counts for particular proteins were sometimes substantially different between mated and virgin queens (Additional data file 2). This indicates that the protein concentrations might substantially differ between spermathecal fluid of mated and virgin queens. Future work is obviously needed to quantify the proteins with different spectral counts. To do this, biological replicates of spectral counts based on LC-MS/MS will be necessary, but were beyond the scope of the current study.
To further explore the metabolic network established in the spermathecal fluid, we created metabolic networks of spermathecal fluid and seminal fluid using data from the Kyoto Encyclopedia of Genes and Genomes (KEGG) [36, 37] associated with our identified proteins. This was then visualized with the Cytoscape software package . The resulting networks are presented in Figure 4 (see also an annotated version provided as Additional data file 3), where colored nodes (rounded squares) represent enzymes in different functional categories, metabolites are shown as small grey circles, while the reaction is shown as connecting lines between the enzymes and metabolite nodes. The two networks differ in their degree of connectivity and the number of hubs that join multiple reactions. In the seminal fluid network there are discrete metabolic reactions leading to six clusters of reactions plus the redox reaction of disulfide isomerase. This is consistent with sperm needing only to survive for a short period in seminal fluid and the substrates necessary for these reactions being pre-charged in seminal fluid prior to ejaculation. In contrast, the spermathecal fluid is a well-connected single metabolic entity. It contains 5 of the 14 enzyme nodes present in the seminal fluid, but also an extra 23 enzyme nodes that combine the 6 clusters in the seminal fluid into a single metabolic network. Obviously, the different metabolic steps are interlinked with many products representing the substrates for other reactions. This correlates with the requirement of spermathecal fluid to maintain homeostatic functions for years, perhaps with only a small set of entry metabolites. The terminal metabolite nodes of the network are potential substrates to be transported in or out of the spermatheca, across the spermathecal wall.
The spermathecal network shows the key features of biochemistry needed for sperm protection and maintenance. It shows a near complete glycolytic pathway that is absent from the seminal fluid and a large series of components for a vacuolar like protein pumping ATPase. It also contains a variety of antioxidant defenses, most shared with seminal fluid proteins, although it is often different gene products that catalyze the same reactions. These three networks interact through common ATP/ADP and NAD(P)/NAD(P)H pools (Additional data file 3).
Protons (H+) are presented as metabolites here and are heavily connected nodes (Additional data file 3); we kept these in the network given that metabolic maintenance of pH may be an important function in spermathecal fluid . However, removal of this 'currency metabolite'  does not significantly break the highly interconnected structure of the spermathecal fluid network, but it does further fragment the seminal fluid network (data not shown).
The first large-scale identification of proteins that are present in the spermathecal fluid of honeybee queens is an essential step in uncovering the molecular regulation of long-term sperm storage. A comparison of identified protein lists between our spermathecal fluid samples and those from sperm and hemolymph revealed surprisingly little overlap. Our analysis of spermathecal fluid of virgin queens, which could not have been contaminated with sperm proteins, allowed us to further decrease the number of possible sperm contaminants to only five proteins that we subsequently removed from our final list to avoid any form of contamination from stored sperm. The detection of these remaining sperm proteins in spermathecal fluid does not necessarily result from contamination, as proteins might be expressed in both locations in vivo. Information about the proteineous contributions of females towards stored sperm is still very limited. An expressed sequence tag analysis in Drosophila detected 42 transcripts that are enriched for expression in the spermatheca  but we noted that only 3 proteins within the honeybee spermathecal proteome list had significant sequence similarity to proteins predicted from these Drosophila transcripts. A set of 19 genes highly expressed in spermathecae were identified during analysis of the Hr39 gene in Drosophila, which is reported to regulate Drosophila female reproductive tract development and function . While there are orthologs for most of these proteins in Apis, only one of the Drosophila genes highly expressed in spermathecae (Hsc70-4) has orthologs in our protein set. These orthologs are among the heat shock protein molecular chaperones (Table 2). Recently released microarray analysis of virgin and mated spermatheca from Drosophila [43, 44] reveals a large number of spermatheca enriched transcripts. Sequence comparison with the Apis spermathecal proteins in Table 2 reveal that approximately 47% of the corresponding genes in Drosophila have significant spermatheca-enriched expression patterns, while a further 30% have significantly spermatheca-depleted expression patterns (Additional data file 4).
The spermathecal fluid proteins of the honeybee differ substantially from those we have reported in seminal fluid , supporting the idea that selection on seminal and spermathecal fluid were substantially different. Seminal fluid was selected to increase insemination and paternity success whereas spermathecal fluid evolved to maximize sperm survival. Nevertheless, we were surprised by the finding of a small 20% overlap between these two protein sets (Figure 2 and Table 1) given that seminal fluid and spermathecal fluid are expected to also share common roles, such as keeping sperm alive, reducing oxidative stress, nourishing sperm or protecting sperm from microbial attacks. The network analysis shows that while different proteins are involved, many biochemical classes and enzymatic functions are the same in both fluids. Indeed, previous research in ants  and honeybees  shows that both spermathecal fluid and seminal fluid keep sperm viable, but we here show that the specific proteins to achieve this differ substantially between the male and female. Sperm is obviously able to survive in both of these 'habitats' but it might have to undergo developmental changes at the beginning of its storage to achieve this. Our finding that spermathecal fluid of virgins, which are anticipating freshly ejaculated sperm to arrive in the spermatheca, differs, in part, from that in mated queens (Figure 3), where sperm has been stored for several months, supports this idea. Consequently, the sperm storage process might be more complicated than assumed so far, and may involve a period of adjustment when the female partially mimics the seminal fluid environment but then modifies the conditions. This may minimize the energetic costs of sperm storage over time or select for specific sperm traits and thereby manipulates the paternity success of her mates.
Some of the components of the spermathecal fluid are likely linked to the need for protection of the sperm from damaging infections or damaging chemical substances that might be detrimental to long term storage. For example, several chitinases were found that might be used in defense for degrading fungal cell walls [RefSeq Gi 66514614, 110760993, 66511507]. We also found an elaborate antioxidant defense system of nine different enzymes, including defenses against superoxide, hydrogen peroxide and lipid peroxides, that likely help prevent oxidative damage to sperm during their substantial hiatus. This is consistent with the evidence of a high activity of several antioxidant defense enzymes in spermathecal fluid . Also, we found a number of chelating proteins, several with roles in Fe2+ binding, which again may represent an antioxidant defense by preventing metal-catalyzed reactive oxygen species production and/or a scavenging of metals to prevent their use in the growth and proliferation of bacterial or fungal infections.
The most prominent aspect of the spermathecal metabolic network is glycolytic metabolism, which is a pathway for fructose degradation to organic acids and the production of both NADH and ATP (Figure 4). NADH will be needed to fuel the antioxidant enzymes noted above. ATP from this extracellular glycolysis could be used to fuel the vacuolar-like ATPase (Table 1). In many animal cell types, such an ATPase normally hydrolyzes cellular ATP and is used to pump protons out of cells, leading to raising of cellular pH and activation of K+ influx channels that replace the expelled H+ with K+ [45, 46]. The long established basic pH and high K+ concentration in the spermatheca that has been hypothesized to slow sperm metabolic rate [47, 48] could be catalyzed by such an ATPase pump activity. However, to our knowledge, such pumps have not been reported to operate in the direction required here, raising extracellular pH, so the link between vacuolar-like ATPases and the spermathecal pH and K+ concentrations requires more research.
An intriguing possibility is that this glycolytic pathway is also feeding carbon substrates to the sperm to maintain their own internal metabolism. Fructose as a carbon source seems to be of specific importance for honeybees  and the dominance of gycolytic pathway proteins in male reproductive organs has been reported earlier . Klenk et al.  previously identified the glycolytic enzyme triosephosphate isomerase as a mating enhanced component of the honeybee spermathecal fluid. Together, our evidence is significant for an extracellular glycolytic pathway operating in the spermathecal fluid. This could suggest a change in primary carbon substrate for sperm, because in seminal fluid they are fueled by their own internal energy stores. This switch in substrates may be critical in establishing a new, slower metabolic rate required for long-term homeostasis in the spermatheca.
Our large-scale identification of proteins within the spermathecal fluid of honeybee queens offers an intriguing insight into the details of female sperm storage. Our data indicate that females provide stored sperm with a complex mixture of proteins that form a metabolically connected network. They also suggest that some essential physiological requirements of sperm have effectively been 'outsourced' and are now provided by the female. In this respect, sperm storage could be regarded as a specialized from of endosymbiosis between males and females, post-copulation but pre-fertilization.
Materials and methods
Spermathecal fluid was collected by dissecting virgin and mated queens using a Leica stereo microscope at 40× to 62× magnification. All dissections were performed with fine watchmaker forceps (INOX 5, Biology) and in Hayes solution (9.0 g/l NaCl, 0.2 g/l CaCl2, 0.2 g/l KCl, 0.1 g/l NaHCO3, pH 8.7). Spermathecal fluid was sampled from a total of 206 mated and 64 virgin queens. Mated queens were egg laying mother queens at least 9 months of age and were provided by several local beekeepers. Virgin queens were obtained by grafting and used at an average age of 6 days, being the age when queens typically perform their nuptial flights. To sample spermathecal fluid, queens were briefly anesthetized in CO2 for 20 to 30 seconds after which their spermathecae were immediately dissected and transferred to a drop of Hayes solution. The dense tracheal network surrounding the spermatheca was carefully removed. The spermatheca was then washed in a second drop of Hayes to minimize contamination by hemolymph. The spermatheca was then placed on a microscopic slide. After the removal of remaining Hayes an injection needle was used to pierce a small whole into the spermathecal wall. The spermathecal fluid was then collected out of the lumen using a fine glass capillary. For each biological sample we pooled samples from 20 to 30 queens. For the samples from mated queens spermathecal fluid was separated from the surrounding stored sperm by centrifugation for 25 minutes at 850 × g at 4°C. The supernatant (spermathecal fluid) was collected and centrifuged again at 18,620 × g for 10 minutes at 4°C to remove remaining sperm. Samples from virgin queens were briefly centrifuged at 10,000 × g but not processed any further and all spermathecal fluid samples were frozen at -80°C prior to further analyses. To collect secretions of the spermathecal glands, we collected up to 20 glands for each biological sample and kept them in 50 μl of Hayes on ice. The glands were then carefully opened at their distal end using watchmaker forceps to allow the gland content to dissolve into the surrounding solution. Separation of the gland tissue from the dissolved gland secretions was done by centrifugation for 20 minutes at 850 × g and at 4°C.
Protein profiling using gel electrophoresis
Profiling of spermathecal fluid proteins was performed by SDS-PAGE using either Biorad Criterion precast gels (10 to 20% (w/v) acrylamide, HCl, 1 mm, 18 comb) or larger 12% (w/v) acrylamide homemade slab gels (Hercules, CA, USA). Gels were run at 30 mA, fixed in fixing solution (40% methanol, 10% acetic acid) for an hour and stained overnight with colloidal Coomassie blue (G 250). Gels were kept in 0.5% (v/v) phosphoric acid at 4°C prior to protein identifications using peptide mass spectrometry.
Identification of proteins from gels using tandem mass spectrometry
Colloidal Coomassie blue stained protein spots were cut from gels and destained twice in 10 mM Na2HCO3 with 50% (v/v) acetonitrile. Samples where dried at 50°C before being rehydrated with 15 μl of digestion solution (10 mM NH4CO3 with 12.5 μg/ml trypsin (Invitrogen, Carlsbad, CA, USA) and 0.01% (v/v) trifluoroacetic acid) and incubated over night at 37°C. Peptides produced from trypsinization were twice extracted from gel plugs using 15 μl acetonitrile. The supernatant was then collected and plugs washed twice with 15 μl of 50% (v/v) acetonitrile and 5% (v/v) formic acid and combined with initial supernatant. The pooled extracts were dried by vacuum centrifugation and stored at 4°C before being analyzed by mass spectrometry.
Gel spot protein identifications
Samples from excised gel pieces were analyzed on an Agilent XCT Ultra IonTrap mass spectrometer with an electrospray ionization (ESI) source equipped with a low flow nebuliser in positive mode and controlled by Chemstation (rev. B.01.03 ; Agilent Technologies, Santa Clara, CA, USA) and MSD Trap Control software version 6.1 (Bruker Daltonik GmbH, Bremen, Germany). Peptides were eluted from a self-packed Microsorb (Varian Inc., Palo Alto, CA, USA) C18 (5 μm, 100 Å) reverse phase column (0.5 × 50 mm) using an Agilent Technologies 1100 series capillary liquid chromatography system at 10 μl/minute using a 9 minute acetonitrile gradient (5 to 60% (v/v)) in 0.1% (v/v) formic acid at a regulated temperature of 50°C. The method used for initial ion detection utilized a mass range of 200 to 1,400 m/z with scan mode set to 'standard' (8,100 m/z per second) and ion charge control conditions set at 250,000 and 3 averages taken per scan. Smart mode parameter settings were employed using a target of 800 m/z, a compound stability factor of 90%, a trap drive level of 80% and optimize set to 'normal'. Ions were selected for MS/MS after reaching an intensity of 80,000 cps and two precursor ions were selected from the initial mass spectrometry scan. MS/MS conditions employed SmartFrag for ion fragmentation, a scan range of 70 to 2,200 m/z using an average of 3 scans, the exclusion of singly charged ions option and ion charge control conditions set to 200,000 in Ultra scan mode (26,000 m/z per second). Resulting MS/MS spectra were exported from the DataAnalysis for LC/MSD Trap version 3.3 (build 149) software package (Bruker Daltonik GmbH) using default parameters for AutoMS(n) and compound 'export'. The resulting .mgf files were then searched as outlined below.
Whole lysate protein identifications
Spermathecal fluid proteins of mated as well as virgin queens were also analyzed with a non-gel approach, using complex mixture LC-MS/MS analysis. Spermathecal samples were digested overnight at 37°C with trypsin and insoluble components were removed by centrifugation at 20,000 × g for 10 minutes. Samples were analyzed on an Agilent 6510 triple quadrupole mass spectrometer (Q-TOF) mass spectrometer with an HPLC Chip Cube source. The chip consisted of a 40 nl enrichment column (Zorbax 300SB-C18 5 u) and a 150 mm separation column (Zorbax 300SB-C18 5 u) driven by Agilent Technologies 1100 series nano/capillary liquid chromatography system. Both systems were controlled by MassHunter Workstation Data Acquisition for Q-TOF (version B.01.02, build 65.4, Patches 1,2,3,4; Agilent Technologies). Peptides were loaded onto the trapping column at 4 μl min-1 in 5% (v/v) acetonitrile and 0.1% (v/v) formic acid with the chip switched to enrichment and using the capillary pump. The chip was then switched to separation and peptides eluted during a 1 h gradient (5% acetonitrile to 40% acetonitrile) directly into the mass spectrometer. The mass spectrometer was run in positive ion mode and scans run over a range of 275 to 1,500 m/z and at 4 spectra s-1. Precursor ions were selected for auto MS/MS at an absolute threshold of 500 and a relative threshold of 0.01, with a maximum of 3 precursors per cycle, and active exclusion set at 2 spectra and released after 1 minute. Precursor charge-state selection and preference was set to 2+ and then 3+ and precursors selected by charge then abundance. Resulting MS/MS spectra were opened in MassHunter Workstation Qualitative Analysis (version B.01.02, build 18.104.22.168, Patches 3; Agilent Technologies) and MS/MS compounds detected by 'Find Auto MS/MS' using default settings. The resulting compounds were then exported as mzdata files that were then searched as outlined below.
Mass spectra output files were analyzed against the predicted A. mellifera peptide set (PreRelease2, 11,069 sequences; 5,989,390 residues) from BeeBase  using the Mascot search engine version 2.2.03 (Matrix Science, Boston, MA, USA). Gel spot searches were conducted using the Mascot search engine version 2.2.03 (Matrix Science) utilizing error tolerances of ± 1.2 Da for MS and ± 0.6 Da for MS/MS, 'Max missed cleavages' set to 1, the Oxidation (M) variable modifications and the instrument set to ESI-TRAP and peptide charge set at '2+ and 3+'. Results were filtered using 'Standard scoring', 'Max. number of hits' set to 20, 'Significance threshold' at P < 0.05. Complex lysate searches were conducted using the Mascot search engine version 2.2.03 (Matrix Science) utilizing error tolerances of ± 100 ppm for MS and ± 0.5 Da for MS/MS, 'Max missed cleavages' set to 1, the Oxidation (M) variable modifications and the instrument set to ESI-Q-TOF and peptide charge set at 2+ and 3+. Results were filtered using 'MUDPIT scoring', 'Max. number of hits' set to 20, 'Significance threshold' at P < 0.05. Lists of the spermathecal fluid protein sets identified for the various samples and scores for matches are provided as Additional data file 1. To build the protein list, we applied conservative approaches to minimize false positives. Protein matches were only claimed if at least two distinct peptides were detected per protein, and MOWSE (molecular weight search) scores being higher than 50 (P < 0.05 significance level is a score >37). False discovery rate analysis of the trypsin digested spermathecal fluid samples from virgin and mated queens against a decoy randomized A. mellifera protein set (PreRelease2, 11,069 sequences; 5,989,390 residues) revealed a <2.5% false discovery rate for the virgin queen sample and a <2.5% false discovery rate for the sample from mated queens.
Each protein sequence identified from the Apis protein set was submitted to a BLAST search to identify homologous proteins from insects and other organisms. This process was used to confirm or modify the functional annotation of the proteins from the PreRelease2 dataset, and then each protein was placed into a functional category according to its annotation and manual literature searches where necessary.
Network analysis and visualization
From the KEGG database [36, 37] of biochemical pathways, proteins identified in the present study (Table 1) and  were associated with unique ID 'dame' entries specific to A. mellifera enzymes. Following this step, enzyme commission (EC) numbers, enzyme names and reactions associated with these KEGG IDs, where these exist, were extracted with a Perl script from the 'enzyme' file downloaded from the KEGG ftp site . Proteins for which no EC number could be assigned typically have unknown function or are responsible for non-enzymatic processes. A total of 41 of the honey bee proteins (using the PreRelease2 accession numbers) in our spermathecal set shown in Table 1 were assigned EC numbers in this manner, making a non-redundant set of 33 enzyme nodes and 70 metabolites. Similarly, seminal fluid proteins from  yielded a non-redundant set of 16 enzyme nodes and 47 metabolites.
After the recovery of these data, the set of unique EC numbers and biochemical reactions was parsed to generate a simple interaction format (SIF) file to represent a metabolic network. The SIF file and other data, such as GB codes associated with EC numbers, enzyme names, and node types (enzyme or metabolite), were inputted into the Cytoscape software (version 2.6.0)  for network visualization and analysis. Network images were exported from Cytoscape as .svg files, imported into Adobe Illustrator and modified visually for presentation purposes.
Additional data files
The following additional data are available with the online version of this paper: a table showing identification of proteins in honeybee spermathecal fluid by MS/MS analysis of two one-dimensional gels and two gel-free analyses of tryptic peptides (Additional data file 1); a table listing peptide counts from mated and virgin spermathecal fluid using tandem MS analysis of gel-free analyses of tryptic peptides (Additional data file 2); a figure illustrating the metabolic network of seminal and spermatecal fluid (Additional data file 3); a table listing abundances of Drosophila transcripts with sequence similarity to the proteins found in Apis spermatheca (Additional data file 4).
Kyoto Encyclopedia of Genes and Genomes
molecular weight search
tandem mass spectrometry
National Center for Biotechnology Information
triple quadrupole mass spectrometer.
Birkhead TR, Moller AP: Sperm Competition and Sexual Selection. 1998, New York: Academic Press
Eberhard WG: Sexual Selection and Animal Genitalia. 1985, Cambridge, Massachusetts: Harvard University Press
Eberhard WG: Female Control: Sexual Selection by Cryptic Female Choice. 1996, Princeton, New Jersey: Princeton University Press
Ruttner F, Koeniger G: The filling of the spermatheca of the honey bee queen active migration or passive transport of the spermatozoa. Z vergl Physiol. 1971, 72: 411-422. 10.1007/BF00300712.
Klenk M, Koeniger G, Koeniger N, Fasold H: Proteins in spermathecal gland secretion and spermathecal fluid and the properties of a 29 kDa protein in queens of Apis mellifera. Apidologie. 2004, 35: 371-381. 10.1051/apido:2004029.
den Boer SP, Boomsma JJ, Baer B: Seminal fluid enhances sperm viability in the leafcutter ant Atta colombica. Behav Ecol Sociobiol. 2008, 62: 1843-1849. 10.1007/s00265-008-0613-5.
den Boer SP, Boomsma JJ, Baer B: Honey bee males and queens use glandular secretions to enhance sperm viability before and after storage. J Insect Physiol. 2009, 55: 538-543. 10.1016/j.jinsphys.2009.01.012.
Collins AM, Williams V, Evans JD: Sperm storage and antioxidative enzyme expression in the honey bee, Apis mellifera. Insect Mol Biol. 2004, 13: 141-146. 10.1111/j.0962-1075.2004.00469.x.
Verma LR: An ionic basis for a possible mechanism of sperm survival in the spermatheca of the queen honey bee Apis mellifera. Comp Biochem Physiol. 1973, 44: 1325-1331. 10.1016/0300-9629(73)90272-7.
Gessner B: Transfer der Spermatozoen in die Spermatheca der Koenigin bei Apis mellifica carnica. 1973, Frankfurt am Main: Johann Wolfgang Goethe Universität
Baer B, Armitage SAO, Boomsma JJ: Sperm storage induces an immunity cost in ants. Nature. 2006, 441: 872-875. 10.1038/nature04698.
Simmons LW: Sperm Competition and Its Evolutionary Consequences in the Insects. 2001, Oxford: Princeton University Press
Tozetto SDO, Bitondi MMG, Dallacqua RP, Simoes ZLP: Protein profiles of testes, seminal vesicles and accessory glands of honey bee pupae and their relation to the ecdysteroid titer. Apidologie. 2007, 38: 1-11. 10.1051/apido:2006045.
Ram KR, Wolfner MF: Seminal influences: Drosophila Acps and the molecular interplay between males and females during reproduction. Integr Comp Biol. 2007, 47: 427-445. 10.1093/icb/icm046.
Poiani A: Complexity of seminal fluid: a review. Behav Ecol Sociobiol. 2006, 60: 289-310. 10.1007/s00265-006-0178-0.
Baer B: Bumblebees as model organisms to study male sexual selection in social insects. Behav Ecol Sociobiol. 2003, 54: 521-533. 10.1007/s00265-003-0673-5.
Findlay GD, Yi X, Maccoss MJ, Swanson WJ: Proteomics reveals novel Drosophila seminal fluid proteins transferred at mating. PLoS Biol. 2008, 6: e178-10.1371/journal.pbio.0060178.
Pilch B, Mann M: Large-scale and high-confidence proteomic analysis of human seminal plasma. Genome Biol. 2006, 7: R40-10.1186/gb-2006-7-5-r40.
Baer B, Heazlewood JL, Taylor NL, Eubel H, Millar AH: The seminal fluid proteome of the honeybee Apis mellifera. Proteomics. 2009, 9: 2085-2097. 10.1002/pmic.200800708.
Collins AM, Caperna TJ, Williams V, Garett WM, D EJ: Proteomic analyses of male contributions to honeybee sperm storage and mating. Insect Mol Biol. 2006, 15: 541-549. 10.1111/j.1365-2583.2006.00674.x.
Colonello NA, Hartfelder K: She's my girl - male accessory gland products and their function in the reproductive biology of social bees. Apidologie. 2005, 36: 231-244. 10.1051/apido:2005012.
Chen PS, Stumm Zollinger E, Aigaki T, Balmer J, Bienz M, Bohlen P: A male accessory gland peptide that regulates reproductive behavior of female Drosophila melanogaster. Cell. 1988, 54: 291-298. 10.1016/0092-8674(88)90192-4.
Baer B, Morgan ED, Schmid-Hempel P: A non-specific fatty acid within the bumblebee mating plug prevents females from remating. Proc Natl Acad Sci USA. 2001, 98: 3926-3928. 10.1073/pnas.061027998.
Baer B, Maile R, Schmid-Hempel P, Morgan ED, Jones GR: Chemistry of a mating plug in bumblebees. J Chem Ecol. 2000, 26: 1869-1875. 10.1023/A:1005596707591.
Sauter A, Brown MJF, Baer B, Schmid-Hempel P: Males of social insects can prevent queens from multiple mating. Proc R Soc B. 2001, 268: 1449-1454. 10.1098/rspb.2001.1680.
Pamilo P: Life span of queens in the ant Formica exsecta. Insect Soc. 1991, 38: 111-120. 10.1007/BF01240961.
Keller L, Genoud M: Extraordinary lifespans in ants: a test of evolutionary theories of ageing. Nature. 1997, 389: 958-960. 10.1038/40130.
Baer B, Dijkstra MB, Mueller UG, Nash DR, Boomsma JJ: Sperm length evolution in the fungus growing ants. Behav Ecol. 2009, 20: 38-45. 10.1093/beheco/arn112.
Baer B: Sexual selection in Apis bees. Apidologie. 2005, 36: 187-200. 10.1051/apido:2005013.
Boomsma JJ, Baer B, Heinze J: The evolution of male traits in social insects. Annu Rev Entomol. 2005, 50: 395-420. 10.1146/annurev.ento.50.071803.130416.
Weber NA: Gardening Ants: the Attines. 1972, Philadelphia: The American Philosophical Society
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13: 2498-2504. 10.1101/gr.1239303.
Baer B, Schmid-Hempel P: Sperm influences female hibernation success, survival and fitness in the bumblebee Bombus terrestris. Proc R Soc B. 2005, 272: 319-323. 10.1098/rspb.2004.2958.
Weinstock GM, Robinson GE, Gibbs RA, Worley KC, Evans JD, Maleszka R, Robertson HM, Weaver DB, Beye M, Bork P, Elsik CG, Hartfelder K, Hunt GJ, Zdobnov EM, Amdam GV, Bitondi MMG, Collins AM, Cristino AS, Lattorff HMG, Lobo CH, Moritz RFA, Nunes FMF, Page RE, Simoes ZLP, Wheeler D, Carninci P, Fukuda S, Hayashizaki Y, Kai C, Kawai J, et al: Insights into social insects from the genome of the honeybee Apis mellifera. Nature. 2006, 443: 931-949. 10.1038/nature05260.
Chan QWT, Howes CG, Foster LJ: Quantitative comparison of caste differences in honeybee hemolymph. Mol Cell Proteomics. 2006, 5: 2252-2262. 10.1074/mcp.M600197-MCP200.
Kyoto Encyclopedia of Genes and Genomes. [http://www.genome.jp/kegg/]
Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, Itoh M, Katayama T, Kawashima S, Okuda S, Tokimatsu T, Yamanishi Y: KEGG for linking genomes to life and the environment. Nucleic Acids Res. 2008, 36: D480-D484. 10.1093/nar/gkm882.
Lensky Y, Schindler H: Motility and reversible inactivation of honeybee spermatozoa in vivo and in vitro. Ann Abeille. 1967, 10: 5-16. 10.1051/apido:19670101.
Huss M, Holme P: Currency and commodity metabolites: their identification and relation to the modularity of metabolic networks. IET Syst Biol. 2007, 1: 280-285. 10.1049/iet-syb:20060077.
Prokupek A, Hoffmann F, Eyun SI, Moriyama E, Zhou M, Harshman L: An evolutionary expressed sequence tag analysis of Drosophila spermathecal genes. Evolution. 2008, 62: 2936-2947. 10.1111/j.1558-5646.2008.00493.x.
Allen AK, Spradling AC: The Sf1-related nuclear hormone receptor Hr39 regulates Drosophila female reproductive tract development and function. Development. 2008, 135: 311-321. 10.1242/dev.015156.
FlyAtlas: the Drosophila Gene Expression Atlas. [http://www.flyatlas.org]
Chintapalli VR, Wang J, Dow JAT: Using FlyAtlas to identify better Drosophila melanogaster models of human disease. Nat Genet. 2007, 39: 715-720. 10.1038/ng2049.
Stevens TH, Forgac M: Structure, function and regulation of the vacuolar (H+)-ATPase. Annu Rev Cell Dev Biol. 1997, 13: 779-808. 10.1146/annurev.cellbio.13.1.779.
Nelson N, Harvey WR: Vacuolar and plasma membrane proton-adenosinetriphosphatases. Physiol Rev. 1999, 79: 361-385.
Verma LR, Shuel RW: Respiratory metabolism of the semen of the honeybee, Apis mellifera. J Insect Physiol. 1973, 19: 97-103. 10.1016/0022-1910(73)90225-4.
Gessner B, Ruttner F: Transfer of spermatozoa into the spermatheca of the honey bee queen. Apidologie. 1977, 8: 1-18. 10.1051/apido:19770101.
Kunieda T, Fujiyuki T, Kucharski R, Foret S, Ament SA, Toth AL, Ohashi K, Takeuchi H, Kamikouchi A, Kage E, Morioka M, Beye M, Kubo T, Robinson GE, Maleszka R: Carbohydrate metabolism genes and pathways in insects: Insights from the honey bee genome. Insect Mol Biol. 2006, 15: 563-576. 10.1111/j.1365-2583.2006.00677.x.
BeeBase: Hymenoptera Genome Database. [http://www.beebase.org]
Kegg FTP Site. [ftp://ftp.genome.jp/pub/kegg/ligand/enzyme/enzyme]
TargetP 1.1 Server. [http://www.cbs.dtu.dk/services/TargetP/]
Psort Site. [http://psort.ims.u-tokyo.ac.jp/]
We were supported by the Australian Research Council (ARC) Discovery Program (Queen Elizabeth II Fellowship to BB, Australian Post-doctoral Fellowships to HE and NLT, Australian Professorial Fellowship to AHM) and the ARC Centre of Excellence in Plant Energy Biology (CE0561495). We thank the honeybee keepers of Western Australia (Better Bees of Western Australia) and especially Ron Clark for providing the necessary honeybee material for this study.
BB carried out the experimental work, analyzed the data and wrote the paper; HE participated in the SDS PAGE work and MS/MS; NLT performed all MS/MS runs, NOT performed the network analysis and AHM analyzed the data and co-wrote the paper. All authors read and approved the final manuscript.
Electronic supplementary material
Additional data file 1: MS/MS spectra derived from trypsinated peptides of spermathecal fluid proteins were matched using Mascot (Matrix Sciences) against Honey Bee PreRelease 2.0. In each case: 'MOWSE' is the Mascot reported molecular weight search score (>37 is P < 0.05); 'Peptides' is the number of peptides matched to the protein above the threshold as outlined in materials and methods; and 'Gelspot' refers to the protein band number as shown in Figure 1. Shading indicates multiple matching data to the same Apis predicted protein sequence from gel bands. Seminal fluid and sperm data on the same proteins are reproduced from . Bee genome GB number and the corresponding RefSeq protein GI from NCBI are given along with the assembly 4.0 gene ID for the bee genome. (XLS 64 KB)
Additional data file 2: MS/MS spectra derived from trypsinated peptides of spermathecal fluid proteins were matched using Mascot (Matrix Sciences) against Honey Bee PreRelease 2.0. In each case 'Spectra' is the number of spectra matching to each protein as outlined in Materials and methods. Bee genome GB number and the corresponding RefSeq protein GI from NCBI are given. (XLS 44 KB)
Additional data file 3: Visualization of spermathecal and seminal fluid metabolic networks based on the proteins identified in this study and Baer et al. . Colored nodes (rounded squares) represent enzymes in different functional categories, metabolites are shown as small grey circles, and reactions are shown as connecting lines between the enzyme and metabolite nodes. EC numbers are listed near enzyme nodes and metabolite names for all features are noted. The seven enzymes in common between the two datasets are highlighted by increased size, red outlines and consistent spatial arrangement in both networks. (EPS 4 MB)
About this article
Cite this article
Baer, B., Eubel, H., Taylor, N.L. et al. Insights into female sperm storage from the spermathecal fluid proteome of the honeybee Apis mellifera. Genome Biol 10, R67 (2009). https://doi.org/10.1186/gb-2009-10-6-r67