A novel informatics concept for high-throughput shotgun lipidomics based on the molecular fragmentation query language
- Ronny Herzog†1, 2,
- Dominik Schwudke†1, 3,
- Kai Schuhmann1, 2,
- Julio L Sampaio1,
- Stefan R Bornstein2,
- Michael Schroeder4 and
- Andrej Shevchenko1Email author
© Herzog et al.; licensee BioMed Central Ltd. 2011
Received: 26 July 2010
Accepted: 19 January 2011
Published: 19 January 2011
Shotgun lipidome profiling relies on direct mass spectrometric analysis of total lipid extracts from cells, tissues or organisms and is a powerful tool to elucidate the molecular composition of lipidomes. We present a novel informatics concept of the molecular fragmentation query language implemented within the LipidXplorer open source software kit that supports accurate quantification of individual species of any ionizable lipid class in shotgun spectra acquired on any mass spectrometry platform.
Lipidomics, an emerging scientific discipline, aims at the quantitative molecular characterization of the full lipid complement of cells, tissues or whole organisms (reviewed in [1–4]). Eukaryotic lipidomes comprise over a hundred lipid classes, each of which is represented by a large number of individual yet structurally related molecules. According to different estimates, a eukaryotic lipidome might contain from 9,000 to 100,000 individual molecular lipid species in total [2, 5]. Due to the enormous compositional complexity and diversity of physicochemical properties of individual lipid molecules, lipidomic analyses rely heavily on mass spectrometry. A shotgun lipidomics methodology implies that total lipid extracts from cells or tissues are directly infused into a tandem mass spectrometer and the identification of individual species relies on their accurately determined masses and/or MS/MS spectra acquired from corresponding precursor ions [6–8].
The apparent technical simplicity of shotgun lipidomics is appealing; indeed, molecular species from many lipid classes are determined in parallel in a single analysis with no chromatographic separation required. Species quantification is simplified because in direct infusion experiments the composition of electrosprayed analytes does not change over time. Adjusting the solvent composition (organic phase content, basic or acidic pH, buffer concentration) and ionization conditions (polarity mode, declustering energy, interface temperature, etc.) enhances the detection sensitivity by several orders of magnitude [8, 9]. In shotgun tandem mass spectrometry (MS/MS) analysis, all detectable precursors (or, alternatively, all plausible precursors from a pre-defined inclusion list) could be fragmented . Given enough time, the shotgun analysis would ultimately produce a comprehensive dataset of MS and MS/MS spectra comprising all fragment ions obtained from all ionizable lipid precursors.
While methods of acquiring shotgun mass spectra have been established, a major bottleneck exists in the accurate interpretation of spectra, despite the fact that several programs (LipidQA , LIMSA , FAAT , LipID , LipidSearch , LipidProfiler (now marketed as LipidView) , LipidInspector ) - have been developed for this. Although these programs utilize different algorithms for identifying lipids, they share a few common drawbacks. First, relying on a database of reference MS/MS spectra is usually counterproductive because many lipid precursor ions are isobaric and in shotgun experiments their collision-induced dissociation yields mixed populations of fragment ions. Second, lipid fragmentation pathways strongly depend both on the type of tandem mass spectrometer used (reviewed in ) and the experiment settings; therefore, compiling a single generic reference spectra library is often impossible and always impractical. Third, software is typically optimized towards supporting a certain instrumentation platform, while mass spectrometers deliver different mass resolution and mass accuracy and therefore different spectra interpretation algorithms are required. Fourth, the programs offer little support to lipidomics screens, which require batch processing of thousands of MS and MS/MS spectra, including multiple replicated analyses of the same samples.
Therefore, there is an urgent need to develop algorithms and software supporting consistent cross-platform interpretation of shotgun lipidomics datasets . We reasoned that such software could rely upon three simple rationales. First, MS and MS/MS spectra should not be interpreted individually; instead, the entire pool of acquired spectra should be organized into a single database-like structure that is probed according to user-defined reproducibility, mass resolution and mass accuracy criteria. Second, MS/MS spectra should be examined de novo in a user-defined way so that adding new interpretation routines (like, probing for another lipid class) should not require modifying the dataset or altering the program engine. Third, it should be possible to apply multiple parallel interpretation routines and, whenever required, bundle them with boolean operations to enhance the analysis specificity.
Here we report on LipidXplorer, a full featured software kit designed in consideration of these assumptions. It relies upon a flat file database (MasterScan) that organizes the spectra dataset acquired in the entire lipidomics experiment. To identify and quantify lipids, the MasterScan is then probed via queries written in the molecular fragmentation query language (MFQL), which supports any lipid identification routine in an intuitive, transparent and user-friendly manner independently of the instrumentation platform.
Results and discussion
Shotgun lipidomic experiments: terms and definitions
During shotgun analyses, spectra are acquired in the following way: within a certain period of time (for example, 30 s) a mass spectrometer repeatedly acquires individual spectra in much shorter intervals (for example, 1 s) that are termed as scans. Subsequent averaging of all related scans into a single representative spectrum increases mass accuracy and improves ion statistics.
Acquisition typically proceeds in a data-dependent mode: first, a survey (MS) spectrum is acquired to determine m/z and abundances of precursor ions. Then, MS/MS spectra are acquired from several automatically selected precursors and then the acquisition cycle (MS spectrum followed by a few MS/MS spectra) is repeated. Each acquisition comprises a large number of MS survey spectra and MS/MS spectra from selected precursors, while each spectrum is saved as several individual scans (Figure 1).
A typical lipidomics study might encompass 10 to 100 individual samples, from each of which 10 to 100 MS and 100 to 1,000 MS/MS spectra are acquired. Peaks in MS and MS/MS spectra share three common attributes: mass accuracy (expressed in Da or parts-per-million (ppm)), mass resolution (full peak width at half maximum (FWHM)) and peak occupancy. The two former attributes are determined by mass spectrometer type and equally apply to all peaks detected within the experiment. Contrarily, peak occupancy depends on both instrument performance and individual features of analyzed samples. Even multiple repetitive acquisitions do not fully compensate for under-sampling of low abundant precursors, especially if detected with poor signal-to-noise ratio. Since data-dependent acquisition of MS/MS spectra is biased towards fragmenting more abundant precursors, low abundant precursors might not necessarily be fragmented in all acquisitions. Therefore, the peak occupancy attribute, here defined as a frequency with which a particular peak is encountered in individual acquisitions within the full series of experiments, helps to balance coverage and reproducibility of lipid peak detection.
Concept and rationale
To support large scale shotgun lipidomics analyses, the software design should address three major conceptual problems: first, the software should utilize spectra acquired on any tandem mass spectrometer; second, it should identify and quantify species from any lipid class that were detected during mass spectrometric analysis; third, it should handle large datasets composed of highly redundant MS and MS/MS spectra, with several technical and biological replicates acquired from each analyzed sample, as well from multiple blanks and controls.
Common features of shotgun lipidomics software
MS + MS/MS
Database of lipid masses
Database of spectra
Offset correction of masses
All programs support direct lipid identification by MS and some also by MS/MS. Most of the software (excepting LipidXplorer) relies upon pre-compiled databases of expected precursor masses or libraries of MS/MS spectra that are either acquired in direct experiments or computed in silico. These databases are, in principle, expandable, yet users might not be able to add in new (or putative) lipid classes at will. The identification algorithms are tuned to expected patterns of fragment ions and mass resolution typical for a certain instrument and cross-platform interpretation of spectra is therefore difficult.
The conceptual difference between LipidXplorer and other lipidomics software (Table 1) is that it is fully database-independent. Effectively, each spectra dataset is interpreted de novo, while the interpretation rules formalized as MFQL queries may be altered at any time at the user's discretion. Also, LipidXplorer identifications proceed within a pre-processed dataset (MasterScan), which offers the means to adjust processing settings according to the peak attributes. Within the same framework LipidXplorer can accurately interpret spectra acquired on both high- and low-resolution tandem mass spectrometers from different vendors.
LipidXplorer was designed to support a pipeline of lipidomics experiments rather than to assist in identifying lipids in the collection of spectra from a single acquisition. It enables batch processing of all acquisitions made within the series of biological experiments. Users can group individual acquisitions (technical or biological replicates, controls, blanks, and so on) and then compare groups without altering the MasterScan file. Several features were specifically designed to improve the confidence and accuracy of lipid identification and quantification. LipidXplorer improves the mass accuracy by adjusting the masses using offsets to reference peaks. Built-in isotopic correction improves the quantification accuracy by adjusting the abundances of peaks within partially overlapping isotopic clusters.
LipidXplorer outputs the identified lipid species and abundances of user-defined reporter ions in each analyzed sample. We intentionally refrained from programming a module that would recalculate ion abundances into lipid concentrations because quantification routines applied in lipidomics are diverse and strongly project-dependent: they might rely upon several normalization factors (for example, total phosphate content, total protein content, relative normalization to another lipid class, to mention only a few) and employ a palette of internal standards. In high-throughput screens, intensities of precursor ions are directly output into the multivariate analysis software, bypassing the calculation of species abundances (reviewed in [5, 19]). At the same time, calculating the concentrations of individual lipids is a simple operation  that seldom fails once the accurate basis data (identified lipid species and intensities of reporter peaks) are provided.
The LipidXplorer software is organized in several functional modules (Figure 2) that are controlled by a simple intuitive graphical user interface (GUI; Additional file 1). LipidXplorer starts importing raw mass spectra by averaging individual scans into representative MS and MS/MS spectra. These spectra are further aligned by m/z of precursor and fragment ions, respectively, and then MS/MS spectra are associated with the corresponding precursor masses. Spectra-importing routines are instrument-dependent and consider common peak attributes: mass resolution and its change over the full range of m/z; minimum peak intensity thresholds specified separately for MS and MS/MS spectra; width of precursor isolation window in MS/MS experiments and the polarity mode. LipidXplorer also corrects observed masses by linear approximation of the mass shift calculated from a few reference masses (if any are detectable in the spectrum). It also pre-filters spectra by user-defined peak intensity and occupation thresholds that are also specified separately for MS and MS/MS modes.
Scan averaging algorithm
where I(m i ) is the intensity of the peak having mass m i , I max is the intensity of the most abundant peak within the bin B and m avg is the intensity weighted average mass.
The average mass is then stored as a single representative mass for this bin and the procedure is repeated for the next mass bin. We assume that the variation of peak masses is normally distributed within the bin and therefore the procedure should be repeated several times (Additional file 3). Computational tests (data not shown) suggested that three successive iterations should suffice for complete separation of bins such that masses are collected correctly into their dedicated bins and that no two adjacent bins are closer than the value of . One known limitation of this algorithm is that abundant chemical noise might impact binning accuracy. Therefore, we always set the threshold for signal-to-noise ratios of peaks at the value of 3.0, which is a commonly accepted estimate for calculating the limit of detection (LOD) of analytical methods.
MasterScan: a database of shotgun mass spectra
The MasterScan is a flat file database that stores all mass spectra acquired from all analyzed samples, including technical and biological replicates, blanks and controls. While building the MasterScan, individual acquisitions are processed and stored independently, although users could subsequently combine them into arbitrary groups.
The accurate alignment of MS and MS/MS spectra is a key step in interpreting shotgun lipidomics datasets, yet it is a computationally challenging task. Even successive mass spectrometric analyses of the same sample are not fully reproducible and masses of identical precursors and fragments might vary within certain ranges. Abundances of background peaks are affected by spraying conditions and therefore could hardly serve as robust references. At the same time, not all genuine lipid peaks can be aligned - some peaks might only appear in a few samples, while being fully undetectable in others. Also, the available algorithms for aligning mass spectra are not time-linear and are hardly applicable for shotgun datasets that include both MS and MS/MS spectra [23, 24].
The LipidXplorer spectra alignment algorithm (Additional file 4) is similar to the scan averaging algorithm; however, peak masses are averaged without weighting and intensities of all peaks are stored in a list. Each bin is represented by the average mass of individual peaks within the bin. This mass is associated with corresponding intensities in individual spectra, in which the aligned peaks were observed. Note that in tandem mass spectrometric experiments precursor ions are typically isolated within a mass window exceeding 1 Da. Depending on the mass resolution in MS spectra and the actual width of the precursor isolation window, multiple precursor masses might be associated with the same MS/MS spectrum.
The Molecular Fragmentation Query Language (MFQL)
MFQL is the first query language developed for the identification of molecules in complex shotgun spectra datasets. It formalizes the available or assumed knowledge of lipid fragmentation pathways into queries that are used for probing a MasterScan database. Below we introduce its design and present an example of composing a MFQL query for identifying species of phosphatidylcholines lipid class in a typical shotgun dataset.
Background and design rationale
A typical MFQL query consists of four sections:
DEFINE: defines sum compositions, sc-constraints, masses or groups of masses and associates them with user-defined names.
IDENTIFY: determines where and how the DEFINE content is applied. It usually encompasses searches for precursor and/or fragment ions in MS and MS/MS spectra.
SUCHTHAT: defines optional constraints that are formulated as mathematical expressions and inequalities, numerical values, peak attributes (Additional file 5), sum compositions and functions. Several individual constraints can be bundled by logical operations and applied together.
REPORT: establishes the output format.
A single MFQL query identifies all detectable species of a given lipid class in the dataset, if they share common fragmentation pathways. The MFQL concept takes full advantage of the apparent completeness of shotgun lipidomics datasets that might contain all fragment ions produced from all plausible precursors. In this way MFQL supports parallel application of any shotgun lipidomic approach, such as top-down screening [25, 26], multiple precursor and neutral loss scanning , multiple reaction monitoring [27, 28], among others. The Backus-Naur-Form (BNF) of MFQL is available in Additional file 6.
How to compose a MFQL query?
First, let us assign a name to the query:
QUERYNAME = Phosphatidylcholine;
Next, we define the variables used for identifying the species. Our query should identify the singly charged PC head group fragment and therefore:
headPC = 'C5 H15 O4 N1 P1' WITH CHG = +1;
In a shotgun experiment not all fragmented peaks will originate from PCs. For higher search specificity we next define precursors (prPC) that are expected to produce headPC fragment in MS/MS spectra. We impose the sc-constraint on precursor masses: in addition to sum composition requirements, it requests that precursors are singly charged and their degree of unsaturation (expressed as a double bond equivalent)  is within a certain range (here from 1.5 to 7.5):
prPC = 'C[30..48]H[30..200]NOP' WITH CHG = +1, DBR = (1.5, 7.5);
Next, the IDENTIFY section specifies that 'prPC' precursors should be identified in MS spectra (termed MS1 in the query) and 'headPC' fragments in MS/MS spectra (termed MS2), both acquired in positive mode. The logical operation AND requests that 'headPC' should only be searched in MS/MS spectra of 'prPC'.
prPC IN MS1+ AND
headPC IN MS2+
We further limit the search space by applying optional project-specific compositional constraints formulated in the next SUCHTHAT section. For example, it is generally assumed that mammals do not produce fatty acids having an odd number of carbon atoms. Therefore, we could optionally limit the search space by only considering lipids with even-numbered fatty acid moieties.
Here the operator isEven requests that candidate PC precursors should contain an even number of carbon atoms. Since the head group of PC and the glycerol backbone contain 5 and 3 carbon atoms, respectively, this implies that a lipid could not comprise fatty acid moieties with odd and even numbers of carbon atoms at the same time.
By executing the DEFINE, IDENTIFY and SUCHTHAT sections LipidXplorer will recognize spectra pertinent to PC species. The last section REPORT defines how these findings will be reported. This includes annotation of the recognized lipid species, reporting the abundances of characteristic ions for subsequent quantification and reporting additional information pertinent to the analysis, such as masses, mass differences (errors), and so on. LipidXplorer outputs the findings as a *.csv file in which identified species are in rows, while the column content is user-defined. In this example we define five columns, including NAME (to report the species name) and four peak attributes, such as: MASS, species mass; CHEMSC, chemical sum composition; ERROR, difference to the calculated mass; INTENS, intensities of the specified ions reported for each individual acquisition.
MASS = prPC.mass;
NAME = "PC [%d:%d]" % "((prPC.chemsc - headPC.chemsc)[C] - 3, prPC.chemsc[db] - 1.5)";
CHEMSC = prPC.chemsc;
ERROR = "%dppm" % "(prPC.errppm)";
INTENS = prPC.intensity;
FRAGINTENS = headPC.intensity;;
It is also possible to define mathematical terms or use certain functions, such as text formatting, on these attributes. The text format implies two strings separated by '%', where the first string contains placeholders and the second string their content. This formatting is used in the NAME string such that the actual annotation convention remains at the user's discretion. In this example two placeholders '%d' of the lipids class name "PC [%d:%d] " are filled with the number of carbon atoms and double bonds in the fatty acid moieties. The number of carbon atoms is calculated by subtracting the sum composition of 'headPC' from the precursor 'prPC' and subtracting 3 for carbons in the glycerol backbone (Figures 5 and 6).
We note that here our assignment of PC species only relied upon their precursor masses and the identification of the specific head group fragment in their MS/MS spectra. Therefore, we could only annotate the species by the total number of carbon atoms and double bonds in both fatty acid moieties (like PC 36:1), but we could not determine what these individual moieties really were.
Validation of the LipidXplorer algorithms
LipidXplorer has been subjected to extensive validation in two ways. First, we tested scan averaging, spectra alignment and isotopic correction routines in a series of experiments with specifically designed datasets. Second, we benchmarked overall LipidXplorer identification performance against available lipidomics software using the Escherichia coli total lipid extract as a sample and the curated list of identified species as a reference.
Validation of scan averaging
Comparison of scan averaging algorithms in Xcalibur and LipidXplorer
Number of peaks
158.40 ± 23.57
237.62 ± 37.36
736.22 ± 128.71
Mass difference, ppm
0.06 ± 0.09
0.08 ± 0.09
0.30 ± 0.09
Intensity difference, %
0.61 ± 0.87
0.72 ± 0.86
3.00 ± 1.24
Spearman rank correlation
0.99 ± 0.02
0.98 ± 0.02
0.94 ± 0.03
Mismatched masses, %
2.37 ± 1.57
7.06 ± 2.36
Validation of isotopic correction
The isotopic correction algorithm adjusts the intensities of peaks within partially overlapping isotopic clusters of neighboring lipid species [7, 12, 20]. The algorithm computes the expected profiles of isotopic clusters from the sum compositions of identified lipids and corrects corresponding peak intensities in both MS and MS/MS modes.
Measured molar ratios agreed with the expected ratios and ratios calculated from computationally simulated spectra (data not shown). We underscore that isotopic correction is absolutely required to determine the content of relatively low abundant species. Even at the moderate dynamic range of 1:9, the abundance of PA 18:0/18:1 would have been drastically overestimated in both MS and MS/MS measurements (Additional file 8).
Validation of the spectra alignment algorithm
The algorithm should recognize related peaks within the submitted spectra and attribute them to mass bins in a resolution-dependent manner, while individual peak abundances should be preserved. An ideal validation test should encompass a large collection of real-life spectra, while in each spectrum the correct (rather than measured) masses of peaks observed even at the lowest signal-to-noise ratio should be exactly known. Since this is unfeasible, we validated the algorithm in two separate tests. In the first test, peak abundances were effectively disregarded, yet the correct masses were exactly known and the dataset composition was controlled. The second test relied on a compendium of real-life spectra of total lipid extracts having typical distribution and variability of abundances of genuine lipid peaks, along with a large number of background peaks and chemical noise. However, the exact composition of lipid species in each sample was not known.
We first designed an experiment in which several spectra were computationally generated from a template spectrum and aligned in a MasterScan. The abundances of peaks were then correlated with the abundances of peaks in the original template spectrum. We designed the template spectrum such that the distance between the two adjacent peaks with the masses m 1 and m 2 was , where R = 500. Within a mass range of 500 to 945, which covers most lipid precursors, the template contained 319 peaks that were spaced, on average, by a distance of 1.4 Da. From this template we generated 256 spectra in which masses of peaks were randomly selected from Gaussian distributions having the centroid m and σ = , where R = 100,000 and m is the corresponding mass from the template spectrum. Note that, under selected resolution and spacing, peaks in the simulated spectra did not overlap.
Conventionally, LipidXplorer successively repeats spectra binning three times. However, for this test only, we configured LipidXplorer such that peaks were binned one, two and three times. After importing the spectra, we anticipated that all 319 peaks of the template spectrum should be present in the MasterScan and that occupation of individual peaks through all 256 spectra should mirror Gaussian distribution, if peaks were only binned once. Therefore, we expected to find 319 peaks with an average occupation of 0.68, since this is the number of peaks falling into the rage of [m- σ, m+σ] of the distribution, which equals a bin size of .
Computational validation of the peak alignment algorithm
Number of binning cycles
Average peak occupation
Average mass difference, ppm
0.65 ± 0.05
1.3 ± 0.8
0.87 ± 0.08
1.6 ± 0.7
0.97 ± 0.04
0.4 ± 0.4
However, this test assumed that in the aligned spectra no unrelated peaks fall into the same mass bin, which is unrealistic in real-life shotgun spectra. Therefore, we next tested if the alignment accuracy was affected by the complexity of the analyzed lipid mixtures and by chemical noise. To this end, we compared lipid species identified by LipidXplorer in individual spectra and in the same spectra aligned within the MasterScan.
Benchmarking the lipid identification performance
We benchmarked the LipidXplorer performance in two ways. First, we provided an estimate of the rate of false positive identifications by shotgun analysis of a total lipid extract. Second, we compared LipidXplorer identification performance with other programs that support shotgun lipidomics experiments by interpreting peak lists produced from MS and MS/MS spectra.
We note that the composition of any complex real-life lipid extract might not be exactly known and it is therefore difficult to judge if any particular identification is a false positive. To circumvent this problem, we first produced a dataset of MS and MS/MS spectra by analyzing a commercially available total lipid extract of E. coli on a LTQ Orbitrap XL mass spectrometer using data-dependent acquisition in negative ion mode. It is known that, upon collision-induced dissociation, molecular anions of glycerophospholipids produce abundant acyl anions of their fatty acid moieties that enable unequivocal identification of individual molecular species . The glycerophospholipidome of wild type E. coli comprises bulk quantities of phosphatidylethanolamines (PE class) and phosphatidylglycerols (PG class) and minor amounts of PA [32–34] that are identifiable with any available software. Also E. coli does not produce lipids with polyunsaturated fatty acid (PUFA) moieties [33, 35]. Therefore, we reasoned that species of other glycerophospholipid classes (such as phosphatidylinositols (PI class) and phosphatidylserines (PS class)) or any species containing PUFA, if identified by the software, will likely represent false positives. Cardiolipins, another major component of the E. coli lipidome, could be detected as both singly and doubly charged molecular anions, which might lead to inconsistent interpretations of both MS and MS/MS spectra by different software. We therefore deliberately omitted the identification of cardiolipins from our benchmarking protocol.
Benchmarking LipidXplorer identification performance using the E. coli lipidome
In summary, our software benchmarking procedure relied upon the following rationale: we estimated the rate of false negative identifications by comparing the software output to the reference list and we estimated the rate of false positive identifications by forcing the software to identify species from lipid classes that are not produced by E. coli. For the latter test, we only considered the lipid classes whose precursors readily produce molecular anions and whose masses might overlap with precursors of genuine E. coli lipids (PE, PG, PA) in low resolution mass spectra. Although LipidXplorer could restrict the search space by sc-constraints and, hence, reduce the expected rate of false positives (data not shown), for better consistency with other tested programs it was set to report hits with fatty acid moieties having up to 22 carbon atoms and up to 6 double bonds.
A separate dataset was acquired in eight technical replicates from the same E. coli extract under the low mass resolution of 800 for both MS and MS/MS modes, which is common for triple quadrupole or ion trap instruments. This dataset was independently processed by LipidXplorer, LipidQA and LipidSearch programs (Table 4). LipidQA and LipidSearch could only process each technical replicate independently. Therefore, their output was aligned by the reported lipid species and species identified in less than four (out of the total of eight) replicates were discarded. The same criterion was applied using an occupation threshold of 50% while testing LipidXplorer.
LipidXplorer produced a total of 53 identifications, which included 36 (100%) species from the reference list plus another 17 species (see Additional file 11 for corresponding MFQL queries). According to the above convention, one species was declared a false positive. Both LipidQA and LipidSearch reported fewer species from the reference lists and more false positives (Table 4). A full list of species identified by all software tools is presented in Additional file 12.
Based on these findings, we concluded that LipidXplorer outperformed the currently available software in interpreting shotgun lipidomics datasets.
Benchmarking LipidXplorer speed
Importing a dataset of 32 samples each consisting of 55 MS and 110 MS/MS scans in *.mzXML format took 59 s on an Intel Core 2 Duo CPU (T9300; 2.50 GHz) computer under Windows Vista. The total size of the *.mzXML files was 45 MB, whereas the size of the produced MasterScan file was only 3.35 MB. LipidXplorer identification of species of six lipid classes (PC, PC-O (1-alkyl-2-acylglycerophosphocholines), PE, PE-O (1-alkyl-2-acylglycerophosphoethanolamines), SM (sphingomyelins) and TAG (triacylglycerols)) required 59 s.
To test how the processing speed of LipidXplorer is affected by the spectra dataset size, we imported mzXML files totaling 168 MB that comprised 248 MS acquisitions each of approximately 2,400 peaks. Building the MasterScan file took 13 minutes on the same desktop PC and required 0.7 GB of RAM. Subsequent screening of the 29.1 MB MasterScan file with 16 MFQL queries required only 6.5 s. We note that a MasterScan is only built once from all spectra acquired in the project. Further interpretation of the dataset, including repetitive screening for other lipid classes or using alternative signature ions, does not require changing the MasterScan. Although LipidXplorer does not explicitly restrict the size of mzXML files, in our experience a dataset of 500 acquisitions each comprising 2,500 peaks might be a practical limit for desktop computers having up to 4 GB of RAM.
Enabling functionalities of LipidXplorer
Using MasterScan and MFQL within LipidXplorer software has two important analytical implications. First, LipidXplorer accurately processes MS and MS/MS spectra acquired on different tandem mass spectrometers whose mass resolution varies from the unit (triple quadrupoles, ion traps) to 100,000 (Orbitrap). Second, the software identifies any individual lipid species or entire lipid classes that were ionized and fragmented during the shotgun experiment.
LipidXplorer supports mass resolution-dependent interpretation of shotgun mass spectra
Mass resolution and mass accuracy of detected peaks are determined by the type of employed tandem mass spectrometer. LipidXplorer imports spectra in generic mzXML format and converters from proprietary formats to mzXML are available for major instrument platforms. Here we provide evidence that LipidXplorer consistently and accurately interprets spectra acquired at different mass resolution and accuracy.
Diacyl (PE) and alkylacyl (PE-O) lipids were distinguished by assigning the correct sum compositions to peaks observed at a mass resolution of 30,000. The number of false assignments to PE-O dropped from 33 at a MS resolution of 7,500 to 10 at a MS resolution of 30,000, which, as expected, distinguished peaks with a mass offset of approximately 30 mDa. Increasing mass resolution in MS spectra up to 100,000 further decreased the number of false positives, yet did not eliminate them completely. When the mass resolution was also increased in MS/MS mode up to 15,000 and enabled to match fragment masses with an accuracy of better than 5 mDa, the number of false positive assignments dropped to zero (Figure 9). Hence, we demonstrated that LipidXplorer takes full advantage of the high mass resolution and mass accuracy of a hybrid tandem mass spectrometer. It has also become apparent that averaging and alignment of related peaks in multiple experiments did not compensate for the limited identification specificity of low resolution machines (Additional file 13).
LipidXplorer supports consistent cross-platform identification of lipids
By its design and operational principles, LipidXplorer is not tethered to any particular mass spectrometry platform. The program imports shotgun spectra as instrument-independent peak lists or mzXML files. When building a MasterScan, LipidXplorer only considers a few generic features of raw MS and MS/MS spectra, such as mass resolution and mass accuracy, while MFQL adapts lipid identification routines to machine-dependent molecular fragmentation pathways. This implies that even if raw spectra are acquired on different machines and using different analytical modes (MS or MS/MS), their LipidXplorer interpretation should result in quantitatively consistent profiles provided the intensities of selected precursor and/or fragment peaks adequately represent the abundances of lipid species. To substantiate this, we validated LipidXplorer cross-platform performance in two steps. First, we demonstrated that lipid quantification by LipidXplorer corroborates an established independent analytical method that relies on a different instrument, operation mode and software; this ensured that LipidXplorer interpretations were correct. Second, we employed LipidXplorer for interpreting shotgun datasets of MS and MS/MS spectra acquired on different instruments and demonstrated that it produced quantitatively concordant molecular species profiles.
To this end, we analyzed a total lipid extract of E. coli on the LTQ Orbitrap Velos by MS and data-dependent MS/MS. Then, the same extract was analyzed on a quadrupole time-of-flight mass spectrometer QSTAR Pulsar i by MS and MS/MS and also by the MPIS method, which is a unique feature of QSTAR machines [16, 31]. The dataset of MPIS spectra was processed using LipidProfiler software. For better consistency, the mass resolution of the Orbitrap was set at 7,500 such that it was close to the mass resolution of the QSTAR. MS and MS/MS spectra were imported into MasterScan databases as mzXML files and the same MFQL queries (Additional file 11) were applied to identify and quantify 24 major species (15 from PE and 9 from PG lipid classes) that were detected in all analyses with good signal-to-noise ratios, which was important for consistent comparison of independent experiments. MS quantification relied on the intensities of intact molecular anions of corresponding species, while for MS/MS quantification the MFQL queries reported the intensities of acyl anion fragments of corresponding fatty acid moieties of each fragmented lipid precursor [10, 16].
Cross-platform correlation of relative abundances of E. coli lipidsa
Orbitrap versus QSTARc
Orbitrap versus MPIS QSTARd
QSTAR versus MPIS QSTARe
We therefore concluded that LipidXplorer processed spectra acquired using different mass spectrometers and by different (MS and MS/MS) methods in a consistent and quantitative manner.
LipidXplorer exploits the diversity of lipid fragmentation pathways
Lipid identification relies upon specific 'signature' ions detectable in MS and/or MS/MS mode that, not necessarily unequivocally, distinguish the molecular species from molecules of other lipid classes or of the same class. The conceptual advance of MFQL is that many of these ions and/or their combinations can be simultaneously recognized in each MS/MS spectrum and bundled with several independent sc-constraints. Here we demonstrate that these assignments are accurate and coherent and could be employed in parallel to recognize individual species of multiple lipid classes in total lipid extracts.
Multifaceted identification of bovine brain lipid species by LipidXplorer
Number of identified species
Number of signature ions
The interepretation of a shotgun dataset by LipidXplorer takes advantage of independent use of several signature ions for each lipid class. If detected at the high mass resolution, precursor ions of intact lipids are signature ions themselves. Some lipid classes, such as TAG, DAG and CL, have unique compositions of N, O and P atoms and can be unequivocally identified solely by their intact masses with no recourse to MS/MS .
We note that signature ions could be recognized by MFQL queries even if fragments originating from accidentally co-fragmented precursors are also present. Users also have full flexibility to choose the signature ions and sc-constraints for species identification and alter MFQL queries accordingly, while the species profiles produced by alternative interepretations remain quantitatively consistent (Figure 11c).
Probing the MasterScan with correspondent MFQL queries effectively emulated several lipid class-specific and lipid species-specific precursor ion and neutral loss scans [10, 16, 36, 37] (Figure 11). Signature ions might be associated with any structural feature of a lipid molecule and the power of the MFQL concept is that any of these can be recognized and used for the identification and quantification of individual species. Therefore, we argue that a combination of MFQL-assisted interpretation and the organization of shotgun lipidomics datasets in a MasterScan database enables cross-platform, accurate and comprehensive lipidomics analysis of complex biological samples.
This study addresses the architecture, algorithms, validation and advanced features of LipidXplorer software, which supports the broadest scope of current shotgun lipidomics experiments, from targeted quantification of selected lipid species or classes to high-throughout lipidomics screens. LipidXplorer and its early prototype, LipidX, have been extensively tested in real-life applications and have already contributed interesting biological results [25, 38–42]. Two key features distinguish LipidXplorer from other lipidomics software. First, the entire dataset comprising hundreds of MS and MS/MS spectra, including multiple technical and biological replicates, is organized into a single flat file database - the MasterScan. Second, for the first time, lipids are identified using user-defined queries formulated in the molecular fragmentation query language (MFQL). We demonstrate that MasterScan and MFQL make a powerful alliance enabling exhaustive interpretation of large shotgun datasets.
Shotgun lipidomics experiments could run on any tandem mass spectrometer with minimal sample preparation. We argue that, with flexible cross-platform software like LipidXplorer, a broad cell biology community can adopt lipidomics approaches for their specific needs, presumably at the same magnitude as proteomics methods are currently used. We note that LipidXplorer is just one possible implementation of a generic informatics concept that relies on MFQL-type interpretation of spectra. One much anticipated development is to extend the coverage of lipidomes of important model and medically relevant organisms by developing and validating queries covering all major lipid classes. An accessible public library of organism-specific queries should become an important resource for a broad lipidomics community. Better algorithms supporting all aspects of data processing could enhance the software potential in lipidomics screens. Importantly, LipidXplorer is an open-source software and its modular organization offers opportunity for further developments within a network of collaborating laboratories.
By eliminating major technical obstacles in identifying and quantifying any detectable lipid, LipidXplorer development revealed a few conceptual problems common to the entire lipidomics field. First, statistical estimates of species identification confidence should now be introduced also in lipidomics and each lipid composition report should be supported with a false discovery rate or similar statistical measure. It has become apparent (also from Table 4 and Figure 9) that false positive identifications commonly occur even when analyzing a relatively simple dataset. The next challenge would be to develop a statistical model that estimates identification confidence in a dataset- and instrument-specific way.
Another informatics challenge is unifying shotgun and liquid chromatography (LC)-MS or LC-MS/MS driven lipidomics on a common software platform. At the moment these approaches seem to be developing almost in parallel, although there have been efforts to enhance the performance of shotgun analysis by pre-fractionation of lipids by LC-MS . We argue that, because of its flexible architecture and spectra interpretation routines, LipidXplorer has the potential to develop into an integrated platform supporting a palette of lipidomics applications in a consistent, statistically rigorous manner.
Materials and methods
Annotation of lipid species
Lipid classes are: PE, phosphatidylethanolamines; LPE; lyso-phosphatidylethanolamines; PE-O, 1-alkyl-2-acylglycerophosphoethanolamines; PS, phosphatidylserines; PC, phosphatidylcholines; PC-O, 1-alkyl-2-acylglycerophosphocholines; LPC, lysophosphatidylcholines; SM, sphingomyelins; PA, phosphatidic acids; PG, phosphatidylglycerols; PI, phosphatidylinositols; DAG, diacylglycerols; TAG, triacylglycerols; CL, cardiolipins; LCL, triacyl-lysocardiolipins; Cer, ceramides; Chol, cholesterol; CholEst, cholesterol esters.
Individual molecular species are annotated as follows: <lipid class > <no. of carbon atoms in the first fatty acid or fatty alcohol moiety >:<no. of double bonds in the first fatty acid or fatty alcohol moiety >/<no. of carbon atoms in the second fatty acid moiety >:<no. of double bonds in the second fatty acid moiety >. For example, PC 18:0/18:1 stands for a phosphatidylcholine comprising the moieties stearic (18:0) and oleic (18:1) fatty acids. If the exact composition of fatty acid or fatty alcohol moieties is not known, the species are annotated as: <lipid class > <no. of carbon atoms in both moieties >:<no. of double bonds in both moieties >. In this way, PC 36:1 stands for a PC species having 36 carbon atoms and one double bond in both fatty acid moieties.
Mass spectrometry experiments
Mass spectrometry experiments were performed on a LTQ Orbitrap XL hybrid mass spectrometer (Thermo Fisher Scientific, Bremen, Germany) and, where specified, on a modified QSTAR Pulsar i quadrupole time-of-flight mass spectrometer (MDS Sciex, Concord, Ontario, Canada), both equipped with a robotic nanoflow ion source TriVersa (Advion BioSciences, Ithaca, NY, USA). If not specified otherwise, data-dependent acquisition was performed as described in . A data-dependent acquisition cycle consisted of one MS spectrum followed by MS/MS spectra acquired from ten most abundant precursor ions, whose masses were subsequently excluded from further MS/MS experiments. MS/MS spectra were acquired on a LTQ Orbitrap using pulsed Q collision-induced dissociation (PQD) under the normalized collision energy of 21%. Fragment ions were detected at the linear ion trap (IT) or Orbitrap analyzers, as indicated separately for each experiment. The linear ion trap was operated at the low (unit) mass resolution R, while mass resolution of the Orbitrap was set for each experiment separately using the target resolution parameter specified as FWHM of the peak at m/z 400. Where specified, LTQ Orbitrap MS/MS spectra were acquired by the method of higher energy collision-induced dissociation (HCD). Precursor ions were isolated by the linear ion trap at the unit resolution, fragmented in the HCD cell under the normalized collision energy of 45% and fragment ions detected by the Obitrap analyzer at a mass resolution of 7,500. MPIS scans were acquired on a quadrupole time-of-flight mass spectrometer QSTAR Pulsar i (AB Sciex, Toronto, Ontario, Canada) and interpreted by LipidProfiler software as described in . Data-dependent MS/MS experiments on a QSTAR Pulsar i were performed as described in .
Implementation of LipidXplorer software
LipidXplorer was programmed in Python 2.6. It imports spectra in *.mzXML  or peak lists in the *.dta/*.csv format. Free converters to *.mzXML are available at . LipidXplorer automatically converts *.raw or *.wiff files into *.mzXML using, respectively, ReAdW or mzWiff programs.
LipidXplorer organizes mass spectra in a database-like format termed MasterScan (*.sc). The MasterScan is saved using Python's PICKLE function  for Python object serialization.
The MFQL interpreter is written using PLY (Python Lex-Yacc) , a lexer/parser generator based on Lex and Yacc. A collection of MFQL scripts is included in the distributed version of LipidXplorer and supports quantitative profiling of 19 major lipid classes. The routine for calculating sum compositions is an exhaustive search algorithm written in C and imported into Python.
The algorithm for calculating isotopic distributions was developed by Dr Magnus Palmblad (University of Reading, UK) and converted to Python by Dr Brian H Clowers using the NUMPY module .
LipidXplorer is available under general public license (GPL) at . Full documentation on LipidXplorer, including the installation guidelines, a lipid identification tutorial and a library of MFQL scripts are provided at . A sample dataset of shotgun mass spectra is also available for testing local installations of the software.
LipidXplorer benchmarking: the dataset
E. coli total lipid extract was purchased from Avanti Polar Lipids (Alabaster, AL, USA) and analyzed on the LTQ Orbitrap XL instrument in negative ion mode. A solution of the total lipid concentration of 2.5 μg/ml in 7.5 mM ammonium acetate in choloroform/methanol/2-propanol (1/2/4, v/v/v) was infused into the mass spectrometer by TriVersa robotic ion source using a chip with the diameter of spraying nozzles of 4.1 μm. To produce the spectra dataset, the extract was analyzed in several independent experiments: experiment I, eight acquisitions under the unit mass resolution (R) settings using ion trap (IT) to acquire both MS and MS/MS spectra; experiment II, six acquisitions with R = 7,500 for MS spectra (Orbitrap) and unit resolution for MS/MS spectra (IT); experiment III, four acquisitions with R = 30,000 for MS spectra (Orbitrap) and unit resolution for MS/MS spectra (IT); experiment IV, four acquisitions with R = 100,000 for MS spectra (Orbitrap) and unit resolution for MS/MS spectra (IT); experiment V, seven acquisitions with R = 100,000 for MS spectra (Orbitrap) and R = 15,000 for MS/MS spectra (Orbitrap).
In the experiments I to IV, each acquisition produced approximately 33 MS and 330 MS/MS spectra; in the experiment V, 10 MS and 100 MS/MS spectra were acquired. To reduce undersampling, in the experiment V, acquisition of MS/MS spectra was navigated by the inclusion list compiled from 40 masses of plausible PE, PG and PA precursors A list of molecular lipid species was produced by manual interpretation of spectra acquired in the experiment V with requested mass tolerance of better than 3 ppm for precursors and 5 ppm for specific fragment ions. Only lipid species identified in at least four out of seven replicated analyses were included.
Spectra acquired in each of the experiments I to IV were further processed by LipidXplorer to produce corresponding MasterScan files. We used the dataset from the experiment I for comparative benchmarking of LipidXplorer against LipidQA and LipidSearch programs. Since LipidQA and LipidSearch do not align the spectra from replicated analyses, each acquisition was processed independently and then a non-redundant list of all identified lipid species was compiled.
LipidXplorer benchmarking: the procedure
Eight acquisitions containing complete sets of MS and MS/MS spectra were independently submitted as *.raw files. The output was aligned by reported lipid species. Individual lipid species were considered as positively identified if they were recognized in four or more replicated analyses. In all tests the programs were prompted to identify species of PE, PI, PS, PG and PA classes. Mass tolerance was set at 0.3 Da in MS and MS/MS modes; fatty acid moieties were assumed to comprise 12 to 22 carbon atoms and 0 to 6 double bonds.
Settings specific for each tested program were as follows.
LipidXplorer: 'MS threshold' was set to 100 and 'MS/MS threshold' to 5 counts per peak area; 'Resolution gradient' was set to 1; other common spectra import settings were as in Additional file 13 (setting: 'FAS_LTQ').
LipidQA (spectra were imported as *.raw files): 'MS error' and the 'MS/MS error' were both set to 0.3 Da; 'Finnigan Filter', on; 'Quantification', off; 'Mode selection', Neg. Mode; 'If MS2 spectra were centroided', checked. Only species with a score above 0.5 were accepted. The current version of LipidQA is available at .
Lipid Search version 2.0 beta: 'SearchType' was set to 'MS2,MS3'; 'ExpType' to 'Infusion'; 'Precursor tol' to '0.3 Da'; 'Product peak tol' to 0.3 Da; 'Intensity threshold' to 0.01; 'Threshold type' to Relative; 'M-score Threshold' to 10.0. The current version of LipidSearch is available at .
LipidProfiler v.1.0.97: the software was used for creating a reference list of lipids in the E. coli extract and utilized a separate dataset acquired on a QSTAR Pulsar i mass spectrometer by the MPIS method. Intensity threshold was set to 0.2%; all lipid species reported as 'confirmed results' in at least four independent acquisitions.
Validation of isotopic correction algorithm
We analyzed in two independent replicates a mixture of PA standards consisting of PA18:0/18:2, PA18:1/18:1, PA18:0/18:1 and PA18:0/18:0 (all from Avanti Polar Lipids) with the molar ratio of 1:9:1:1 on a LTQ Orbitrap Velos. Spectra were acquired under data-dependent acquisition control in negative mode using the linear ion trap analyzer under a target resolution of 800 for both MS and MS/MS. Precursors were fragmented using collision-induced dissociation. To process the dataset, mass tolerance was set to 300 ppm for MS and 500 ppm for MS/MS, spectra; occupation threshold was set to 0.5.
Validation of the peak alignment algorithm
We used a dataset of 128 MS spectra of human blood plasma extracts acquired on a LTQ Orbitrap XL mass spectrometer. Spectra were imported into a MasterScan file assuming a mass resolution of 127,500 (FWHM, at m/z 400), a mass accuracy of 4 ppm, and an occupation threshold of 0.5. Post-acquisition adjustment of peak masses was achieved using two reference masses of lipid standards spiked into the samples prior to extraction . Lipids of 11 major classes (PC, PC-O, PE, PE-O,, LPC, LPE, SM, DAG, TAG, Chol and CholEst were identified by their accurate masses with no recourse to MS/MS.
Validation of cross-platform quantification by LipidXplorer
Total lipid extract of E. coli was analyzed by multiple precursor ion scanning  and by data-dependent acquisition  on a QSTAR Pulsar i mass spectrometer. The same extract was analyzed by data-dependent HCD at the LTQ Orbitrap Velos mass spectrometer. Each analysis was performed in four replicates. Datasets of shotgun MS and MS/MS spectra were imported into MasterScan files built separately for each mass spectrometer and lipid species identified by MFQL queries (see Additional file 14 for the import settings and Additional file 11 for the queries). Lipid species were quantified in MS mode by using the intensities of their molecular ions. For MS/MS quantification, MFQL queries recognized and reported the sum of abundances of acyl anion fragments for each individual precursor. Relative quantities of individual lipids were calculated by normalizing to the total abundance of all species of the same lipid class. Parameters of linear correlation of lipid species profiles obtained by different methods (correlation coefficient R2 and slope) were computed by Microsoft Excel (see Additional file 14).
Analysis of bovine heart total lipid extract
Total lipid extract of bovine heart (Avanti Polar Lipids) was analyzed in six technical replicates on a LTQ-Orbitrap XL mass spectrometer using a target resolution of 100,000 for MS spectra (Orbitrap) and unit resolution for MS/MS (IT) in negative ion mode. Six replicates were acquired, each consisting of 31 MS and 310 MS/MS spectra.
Publicly accessible depository of spectra
Mass spectra used for benchmarking and validating of LipidXplorer are available in original formats (*.raw for LTQ Orbitrap and *.wiff for QSTAR Pulsar i) at the LipidXplorer wiki page .
full width at half maximum
higher energy collision-induced dissociation
molecular fragmentation query language
multiple precursorion scanning
tandem mass spectrometry
Pearson correlation factor
polyunsaturated fatty acid
We are grateful to our colleagues in MPI CBG, Technical University of Dresden and University of Heidelberg for valuable discussions and beta-testing of LipidXplorer software; to Mrs Kathy Eisenhofer and Dr Christer S Ejsing (University of Southern Denmark, Odense) for critical reading of the manuscript. We are indebted to Dr Ejsing for the Figure 5 concept and Dr Palmblad for helpful discussions on the algorithm for calculating the isotopic distribution. Work in the AS laboratory was supported by TRR 83 grant from Deutsche Forschungsgemeinschaft (DFG) and Virtual Liver (Code/0315757) grant from Bundesministerium f. Bildung u. Forschung (BMBF). DS is supported by Wellcome Trust/DBT India Alliance and is a recipient of NCBS-Merck&Co International Investigator Award. Work in the SRB laboratory was supported by the Paul Langerhans Institute Dresden.
- Wenk MR: The emerging field of lipidomics. Nat Rev Drug Discov. 2005, 4: 594-610. 10.1038/nrd1776.PubMedView ArticleGoogle Scholar
- van Meer G: Cellular lipidomics. EMBO J. 2005, 24: 3159-3165. 10.1038/sj.emboj.7600798.PubMedPubMed CentralView ArticleGoogle Scholar
- Dennis EA: Lipidomics joins the omics evolution. Proc Natl Acad Sci USA. 2009, 106: 2089-2090. 10.1073/pnas.0812636106.PubMedPubMed CentralView ArticleGoogle Scholar
- Oresic M, Hanninen VA, Vidal-Puig A: Lipidomics: a new window to biomedical frontiers. Trends Biotechnol. 2008, 26: 647-652. 10.1016/j.tibtech.2008.09.001.PubMedView ArticleGoogle Scholar
- Yetukuri L, Ekroos K, Vidal-Puig A, Oresic M: Informatics and computational strategies for the study of lipids. Mol Biosyst. 2008, 4: 121-127. 10.1039/b715468b.PubMedView ArticleGoogle Scholar
- Han X, Gross RW: Global analyses of cellular lipidomes directly from crude extracts of biological samples by ESI mass spectrometry: a bridge to lipidomics. J Lipid Res. 2003, 44: 1071-1079. 10.1194/jlr.R300004-JLR200.PubMedView ArticleGoogle Scholar
- Han X, Gross RW: Shotgun lipidomics: electrospray ionization mass spectrometric analysis and quantitation of cellular lipidomes directly from crude extracts of biological samples. Mass Spectrom Rev. 2005, 24: 367-412. 10.1002/mas.20023.PubMedView ArticleGoogle Scholar
- Ejsing CS, Sampaio JL, Surendranath V, Duchoslav E, Ekroos K, Klemm RW, Simons K, Shevchenko A: Global analysis of the yeast lipidome by quantitative shotgun mass spectrometry. Proc Natl Acad Sci USA. 2009, 106: 2136-2141. 10.1073/pnas.0811700106.PubMedPubMed CentralView ArticleGoogle Scholar
- Han X, Yang K, Yang J, Fikes KN, Cheng H, Gross RW: Factors influencing the electrospray intrasource separation and selective ionization of glycerophospholipids. J Am Soc Mass Spectrom. 2006, 17: 264-274. 10.1016/j.jasms.2005.11.003.PubMedView ArticleGoogle Scholar
- Schwudke D, Oegema J, Burton L, Entchev E, Hannich JT, Ejsing CS, Kurzchalia T, Shevchenko A: Lipid profiling by multiple precursor and neutral loss scanning driven by the data-dependent acquisition. Anal Chem. 2006, 78: 585-595. 10.1021/ac051605m.PubMedView ArticleGoogle Scholar
- Song H, Hsu FF, Ladenson J, Turk J: Algorithm for processing raw mass spectrometric data to identify and quantitate complex lipid molecular species in mixtures by data-dependent scanning and fragment ion database searching. J Am Soc Mass Spectrom. 2007, 18: 1848-1858. 10.1016/j.jasms.2007.07.023.PubMedPubMed CentralView ArticleGoogle Scholar
- Haimi P, Uphoff A, Hermansson M, Somerharju P: Software tools for analysis of mass spectrometric lipidome data. Anal Chem. 2006, 78: 8324-8331. 10.1021/ac061390w.PubMedView ArticleGoogle Scholar
- Leavell MD, Leary JA: Fatty acid analysis tool (FAAT): An FT-ICR MS lipid analysis algorithm. Anal Chem. 2006, 78: 5497-5503. 10.1021/ac0604179.PubMedView ArticleGoogle Scholar
- Hubner G, Crone C, Lindner B: lipID - a software tool for automated assignment of lipids in mass spectra. J Mass Spectrom. 2009, 44: 1676-1683.PubMedGoogle Scholar
- Houjou T, Yamatani K, Imagawa M, Shimizu T, Taguchi R: A shotgun tandem mass spectrometric analysis of phospholipids with normal-phase and/or reverse-phase liquid chromatography/electrospray ionization mass spectrometry. Rapid Commun Mass Spectrom. 2005, 19: 654-666. 10.1002/rcm.1836.PubMedView ArticleGoogle Scholar
- Ejsing CS, Duchoslav E, Sampaio J, Simons K, Bonner R, Thiele C, Ekroos K, Shevchenko A: Automated identification and quantification of glycerophospholipid molecular species by multiple precursor ion scanning. Anal Chem. 2006, 78: 6202-6214. 10.1021/ac060545x.PubMedView ArticleGoogle Scholar
- Glish GL, Burinsky DJ: Hybrid mass spectrometers for tandem mass spectrometry. J Am Soc Mass Spectrom. 2008, 19: 161-172. 10.1016/j.jasms.2007.11.013.PubMedView ArticleGoogle Scholar
- Shevchenko A, Simons K: Lipidomics: coming to grips with lipid diversity. Nat Rev Mol Cell Biol. 2010, 11: 593-598. 10.1038/nrm2934.PubMedView ArticleGoogle Scholar
- Forrester JS, Milne SB, Ivanova PT, Brown HA: Computational lipidomics: a multiplexed analysis of dynamic changes in membrane lipid composition during signal transduction. Mol Pharmacol. 2004, 65: 813-821. 10.1124/mol.65.4.813.PubMedView ArticleGoogle Scholar
- Liebisch G, Lieser B, Rathenberg J, Drobnik W, Schmitz G: High-throughput quantification of phosphatidylcholine and sphingomyelin by electrospray ionization tandem mass spectrometry coupled with isotope correction algorithm. Biochim Biophys Acta. 2004, 1686: 108-117.PubMedView ArticleGoogle Scholar
- Liu J, Bell AW, Bergeron JJ, Yanofsky CM, Carrillo B, Beaudrie CE, Kearney RE: Methods for peptide identification by spectral comparison. Proteome Sci. 2007, 5: 3-10.1186/1477-5956-5-3.PubMedPubMed CentralView ArticleGoogle Scholar
- Frank AM, Bandeira N, Shen Z, Tanner S, Briggs SP, Smith RD, Pevzner PA: Clustering millions of tandem mass spectra. J Proteome Res. 2008, 7: 113-122. 10.1021/pr070361e.PubMedPubMed CentralView ArticleGoogle Scholar
- Kaltenbach HM, Wilke A, Bocker S: SAMPI: protein identification with mass spectra alignments. BMC Bioinformatics. 2007, 8: 102-10.1186/1471-2105-8-102.PubMedPubMed CentralView ArticleGoogle Scholar
- Jeffries N: Algorithms for alignment of mass spectrometry proteomic data. Bioinformatics. 2005, 21: 3066-3073. 10.1093/bioinformatics/bti482.PubMedView ArticleGoogle Scholar
- Graessler J, Schwudke D, Schwarz PE, Herzog R, Shevchenko A, Bornstein SR: Top-down lipidomics reveals ether lipid deficiency in blood plasma of hypertensive patients. PLoS One. 2009, 4: e6261-10.1371/journal.pone.0006261.PubMedPubMed CentralView ArticleGoogle Scholar
- Schwudke D, Hannich JT, Surendranath V, Grimard V, Moehring T, Burton L, Kurzchalia T, Shevchenko A: Top-down lipidomic screens by multivariate analysis of high-resolution survey mass spectra. Anal Chem. 2007, 79: 4083-4093. 10.1021/ac062455y.PubMedView ArticleGoogle Scholar
- Schwudke D, Liebisch G, Herzog R, Schmitz G, Shevchenko A: Shotgun lipidomics by tandem mass spectrometry under data-dependent acquisition control. Methods Enzymol. 2007, 433: 175-191. full_text.PubMedView ArticleGoogle Scholar
- Liebisch G, Binder M, Schifferer R, Langmann T, Schulz B, Schmitz G: High throughput quantification of cholesterol and cholesteryl ester by electrospray ionization tandem mass spectrometry (ESI-MS/MS). Biochim Biophys Acta. 2006, 1761: 121-128.PubMedView ArticleGoogle Scholar
- Sparkman OD: Mass Spectrometry Desk Reference. 2006, Global View Publishing, 2Google Scholar
- Makarov A, Denisov E, Lange O, Horning S: Dynamic range of mass accuracy in LTQ Orbitrap hybrid mass spectrometer. J Am Soc Mass Spectrom. 2006, 17: 977-982. 10.1016/j.jasms.2006.03.006.PubMedView ArticleGoogle Scholar
- Ekroos K, Chernushevich IV, Simons K, Shevchenko A: Quantitative profiling of phospholipids by multiple precursor ion scanning on a hybrid quadrupole time-of-flight mass spectrometer. Anal Chem. 2002, 74: 941-949. 10.1021/ac015655c.PubMedView ArticleGoogle Scholar
- Oursel D, Loutelier-Bourhis C, Orange N, Chevalier S, Norris V, Lange CM: Lipid composition of membranes of Escherichia coli by liquid chromatography/tandem mass spectrometry using negative electrospray ionization. Rapid Commun Mass Spectrom. 2007, 21: 1721-1728. 10.1002/rcm.3013.PubMedView ArticleGoogle Scholar
- Geiger O, Gonzalez-Silva N, Lopez-Lara IM, Sohlenkamp C: Amino acid-containing membrane lipids in bacteria. Prog Lipid Res. 2010, 49: 46-60. 10.1016/j.plipres.2009.08.002.PubMedView ArticleGoogle Scholar
- Kikuchi S, Shibuya I, Matsumoto K: Viability of an Escherichia coli pgsA null mutant lacking detectable phosphatidylglycerol and cardiolipin. J Bacteriol. 2000, 182: 371-376. 10.1128/JB.182.2.371-376.2000.PubMedPubMed CentralView ArticleGoogle Scholar
- Oursel D, Loutelier-Bourhis C, Orange N, Chevalier S, Norris V, Lange CM: Identification and relative quantification of fatty acids in Escherichia coli membranes by gas chromatography/mass spectrometry. Rapid Commun Mass Spectrom. 2007, 21: 3229-3233. 10.1002/rcm.3177.PubMedView ArticleGoogle Scholar
- Ekroos K, Ejsing CS, Bahr U, Karas M, Simons K, Shevchenko A: Charting molecular composition of phosphatidylcholines by fatty acid scanning and ion trap MS3 fragmentation. J Lipid Res. 2003, 44: 2181-2192. 10.1194/jlr.D300020-JLR200.PubMedView ArticleGoogle Scholar
- Han X, Gross RW: Quantitative analysis and molecular species fingerprinting of triacylglyceride molecular species directly from lipid extracts of biological samples by electrospray ionization tandem mass spectrometry. Anal Biochem. 2001, 295: 88-100. 10.1006/abio.2001.5178.PubMedView ArticleGoogle Scholar
- Reich A, Schwudke D, Meurer M, Lehmann B, Shevchenko A: Lipidome of narrow-band ultraviolet B irradiated keratinocytes shows apoptotic hallmarks. Exp Dermatol. 2010, 19: e103-110. 10.1111/j.1600-0625.2009.01000.x.PubMedView ArticleGoogle Scholar
- Raa H, Grimmer S, Schwudke D, Bergan J, Walchli S, Skotland T, Shevchenko A, Sandvig K: Glycosphingolipid requirements for endosome-to-Golgi transport of Shiga toxin. Traffic. 2009, 10: 868-882. 10.1111/j.1600-0854.2009.00919.x.PubMedView ArticleGoogle Scholar
- Saito K, Dubreuil V, Arai Y, Wilsch-Bräuninger M, Schwudke D, Saher G, Miyata T, Breier G, Thiele C, Shevchenko A, Nave KA, Huttner WB: Ablation of cholesterol biosynthesis in neural stem cells increases their VEGF expression and angiogenesis but causes neuron apoptosis. Proc Natl Acad Sci USA. 2009, 106: 8350-8355. 10.1073/pnas.0903541106.PubMedPubMed CentralView ArticleGoogle Scholar
- Carvalho M, Schwudke D, Sampaio JL, Palm W, Riezman I, Dey G, Gupta GD, Mayor S, Riezman H, Shevchenko A, Kurzchalia TV, Eaton S: Survival strategies of a sterol auxotroph. Development. 2010, 137: 3675-3685. 10.1242/dev.044560.PubMedPubMed CentralView ArticleGoogle Scholar
- Penkov S, Mende F, Zagoriy V, Erkut C, Martin R, Pässler U, Schuhmann K, Schwudke D, Gruner M, Mäntler J, Reichert-Müller T, Shevchenko A, Knölker HJ, Kurzchalia TV: Maradolipids: diacyltrehalose glycolipids specific to dauer larva in Caenorhabditis elegans. Angew Chem Int Ed Engl. 2010, 49: 9430-9435. 10.1002/anie.201004466.PubMedView ArticleGoogle Scholar
- Sommer U, Herscovitz H, Welty FK, Costello CE: LC-MS-based method for the qualitative and quantitative analysis of complex lipid mixtures. J Lipid Res. 2006, 47: 804-814. 10.1194/jlr.M500506-JLR200.PubMedView ArticleGoogle Scholar
- Pedrioli PG, Eng JK, Hubley R, Vogelzang M, Deutsch EW, Raught B, Pratt B, Nilsson E, Angeletti RH, Apweiler R, Cheung K, Costello CE, Hermjakob H, Huang S, Julian RK, Kapp E, McComb ME, Oliver SG, Omenn G, Paton NW, Simpson R, Smith R, Taylor CF, Zhu W, Aebersold R: A common open representation of mass spectrometry data and its application to proteomics research. Nat Biotechnol. 2004, 22: 1459-1466. 10.1038/nbt1031.PubMedView ArticleGoogle Scholar
- Trans-Proteomic Pipeline. [http://tools.proteomecenter.org/wiki/index.php?title=Software:TPP]
- Python documentation. [http://docs.python.org/library/pickle.html]
- Python Lex-Yacc. [http://www.dabeaz.com/ply/]
- NumPy. [http://numpy.scipy.org]
- SourceForge. [https://sourceforge.net/projects/lipidxplorer/files/]
- LipidXplorer Wiki site. [https://wiki.mpi-cbg.de/wiki/lipidx/index.php/Main_Page]
- NIH/NCRR Mass Spectrometry Resource at Washington University in St Louis. [http://msr.dom.wustl.edu/Personnel/Staff_Scientist_Song_Haowei.htm]
- Lipid Search. [http://lipidsearch.jp/lipidsearch/lipidsearch.do]
- Lipid Maps. [http://www.lipidmaps.org/]
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.