Skip to main content


Volume 12 Supplement 1

Beyond the Genome 2011

  • Poster presentation
  • Open Access

InSilico DB: an online platform to collaboratively structure and export publicly available datasets from the Gene Expression Omnibus database

  • 1,
  • 1,
  • 1,
  • 2,
  • 2,
  • 2,
  • 2,
  • 2,
  • 2,
  • 1 and
  • 1
Genome Biology201112 (Suppl 1) :P33

  • Published:


  • Gene Expression Omnibus
  • Online Platform
  • Gene Expression Omnibus Database
  • Expression Platform
  • Platform Data

There are more than 20,000 genomic studies comprising 500,000 samples freely available in the Gene Expression Omnibus (GEO) database [1]. However, accessing these data requires complex computational steps, including structuring and formatting the clinical vocabulary used to annotate the samples. These complex steps hinder the accessibility of genomic datasets through visualization and analysis software platforms, such as GenePattern and R/Bioconductor, therefore hampering the pace of research.

InSilico DB [2] is an online platform that provides a complete collaborative solution for structuring and formatting clinical annotations from GEO, making GenePattern and R datasets one click away for researchers.

InSilico DB has made available powerful and intuitive online curation tools to structure the metadata of GEO datasets. The database is automatically updated daily, through GEO import pipelines. Datasets can have multiple annotations given by different users, and one user can have multiple versions of an annotation to suit different experimental questions.

The InSilico DB platform supports datasets from Affymetrix human gene expression platforms, which account for 2,900 studies comprising 110,000 samples, making InSilico DB the largest public database of manually curated human gene expression samples. In addition to the web interface, InSilico DB offers programmatic access through an R/Bioconductor package [3].

Future releases of InSilico DB will include Illumina RNA-Seq platform data and Affymetrix mouse gene expression data.

Authors’ Affiliations

IRIDIA, Université libre de Bruxelles, Brussels, 1050, Belgium
COMO, Vrije Universiteit Brussel, Brussels, 1050, Belgium


  1. Galperin MY, Cochrane GR: The 2011 Nucleic Acids Research Database Issue and the online Molecular Biology Database Collection. Nucleic Acids Res. 2011, 39: D1-D6. 10.1093/nar/gkq1243.PubMedPubMed CentralView ArticleGoogle Scholar
  2. InSilico DB. []
  3. R/Bioconductor package for InSilico DB. []


© Coletta et al; licensee BioMed Central Ltd. 2011

This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.