Skip to main content

Table 1 A key to the databases mentioned in this article

From: Genome cartography through domain annotation

Database URL Description
COGs Clusters of orthologous groups of proteins, generated from the
   comparison of protein sequences encoded in 34 complete genomes,
   representing 26 major phylogenetic lineages.
DAS A distributed sequence annotation system software client and
   database server for the annotation of protein sequences.
Ensembl Software for the automatic annotation of eukaryotic genomes.
   Annotation and searching with gene, SNP, and cross-genome
   comparative information.
EuGenes Automatic annotations of sequence databases with gene and genomic
   information, including chromosome, genetic and molecular maps.
Gene Ontology A dynamic, controlled vocabulary applicable to the annotation of
   eukaryotic genomes. Includes knowledge of the role of genes and
   proteins within cells.
GeneCards A database of human genes that maps genes, proteins and
   diseases. Provides information on gene function.
InterPro Proteome analysis database based on Pfam, SMART, Prosite, PRINTS
   and ProDom protein and domain family databases and the SWISS-
   PROT and TrEMBL sequence databases. Also contains software for
   the annotation of protein sequences using these databases.
LocusLink Interface to a database of sequence and descriptive information
   correlated with genetic loci.
Mammalian Mammalian homology and comparative maps. Tools and databases
Homology   from the Jackson Laboratory for the comparison of mammalian
Pfam Protein families database containing multiple sequence alignments
   and hidden Markov models.
PRINTS Database of protein fingerprints based on protein motifs.
ProDom Protein domain database based on an automatic compilation of
   homologous domains.
Prosite Prosite profiles are protein domain profiles constructed from multiple
   sequence alignments of proteins from families of related sequences.
SMART Protein domain families database containing multiple sequence
   alignments and hidden Markov models, based on a smaller set of
   domains than Pfam but designed to find domains that are more
   difficult to detect.
SMD The Stanford microarray database of raw and normalized data from
   microarray experiments, including interfaces for data retrieval and
SWISS-PROT Protein sequence databases. SWISS-PROT represents a 'gold
and Trembl   standard' of annotation.
TIGRFAMs Database of protein families based on hidden Markov models of
   multiple protein sequence alignments.