Schema Overview

Related Topics: Contents  |  Experimental Modelling  |  Attributes


click on the table names in this image to see additional documentation

The schema is the collection of tables and the links between them. It is split into four sections:

Experiment
a collection of related hybridisations and the resulting data
Array Design
the contents and the layout of a microarray
Bio-Materials
the actual biological entities that are used
Protocols
standardised methods of operation in the laboratory

The attributes that are defined for each table depend on the attribute definition for a particular database. A default set of attributes is supplied which are suitable for capturing all of the information required for compliance with the MIAME standard. The collection of attributes can be customised for particular domains.

 

The Experiment Section

 

Experiment

The Experiment table records information common to a set of Hybridisations such as the goal of the experiment, the factors examined, a brief description and details of any publication which describe the experiment.

Each Experiment has a link to a Submitter which identifies the person who contributed this Experiment to the database.

 

Submitter

A Submitter is the person or organisation who supplied one or more Experiments.

 

Measurement

Each Measurement represents the collection of results from analysing the scanned image of the microarray after Hybridisation. Information stored in this table includes the name of the person who performed the hybridisation, when it was done and which HybridisationProtocol was used.

A Measurement is linked to the Image from which the results were obtained. More than one Measurement can be linked to a particular Image if, for example, the Image was scanned multiple times using a different ScanningProtocol.

Measurements can be linked to any number of Property instances in which the actual experimental data is stored. Each Property corresponds to one column in the file that came from the scanner, or to data derived in subsequent data analysis such as normalisation.

 

Image

The Image records details of the image file that is produced when an Array has been analysed by a scanner. Relevant details include which ScanningProtocol was used, who the scan was done by, when it was done and the resolution and colour depth of the resulting image.

The image data itself is not stored in the database. Images should be archived elsewhere and a link to where the archived image should be recorded in the Image instance.

 

Hybridisation

Hybridisation is the process in which one or more LabelledExtracts are placed on an Array which is then treated according to a given HybridisationProtocol. Afterwards, the Array is scanned, resulting in an Image which is subsequently analysed to acquire the expression profile data.

 

Array

An Array repesents a single microarray 'chip' which is used to perform a Hybridisation. The Array table records data including the batch number of the chip and the date on which it was manufactured.

Each Array must be linked to an ArrayType which contains the actual description of the array design used to manufacture the array.

 

The Array Design Section

 

ArrayType

ArrayType records information that is common to a set of Array objects, i.e. the description of a particular design of microarray. Each Array that uses this design should be linked to this ArrayType instance.

Each ArrayType can be linked to by any number of Feature instances, each of which represents a single well or spot on the microarray described by the ArrayType.

The suggested form for ArrayType names is "Lab Name - Species Name - Unique Identifier", for example "Bloggs - Homo Sapiens - xyz200/v2".

 

Feature

A Feature represents a unique location on an Array. Each Feature has an ArrayType link recording which ArrayType the Feature belongs to. A Feature can only belong to one ArrayType.

Each Feature normally has a link to a Reporter, however this link can be omitted if it is not known what has been spotted or synthesised in this location.

Features record the location of the spot (both as an absolute position on the array and as a row/column location) and whether this Feature is a control element (and if so, what type of control).

 

Reporter

A Reporter (sometimes refered to as a 'probe') is a nucleotide sequence that has been placed in one or more Features on an Array.

Reporter can be linked to any number of Genes. These links identify what the Reporter is supposed to be detecting. Multiple Reporters can be linked to the same Gene in the case where several short sequences are used to identify the presence of a single larger sequence. Alternatively, a single Reporter can be linked to more than one Gene when it is known that several sequences are likely to bind to the same material.

Reporters can be shared between ArrayTypes. If the same material is used to create several different ArrayTypes then the same collection of Reporters can be used when creating the Features.

Each Reporter instance stores information including the 'type' of the Reporter (e.g. cDNA clone or plasmid), the sequence data, references to external genomic databases and how this Reporter was manufactured.

 

Gene

The Gene is a general purpose term refering to any nucleotide sequence who's presence can be detected by one or more Reporters.

A Gene instance stores information including the 'type' of the Gene (e.g. EST or exon), the actual sequence data and any references to external genomic databases providing further information.

Each Gene can be linked to by any number of Reporters.

 

The Bio-Materials Section

 

Source

A Source represents the organism, tissue sample or individual from which all the other bio-materials are derived. It is the starting point for a series of processes which result in a LabelledExtract that can be used for Hybridisation.

The Source tables can record a great deal of information, not all of which is relevant to every domain. Possible data that can be captured includes the organism name, the sex, age and developmental stage of the individual, details of the cell type or line used, and information about any known disease carried by the Source.

Where possible, terminology used to describe the Source should come from well known ontologies (i.e. standard dictionaries of well-defined terms, for example the NCBI Taxonomy). More information about providing terms from ontologies can be found here.

 

Sample

A Sample represents the acquistion of material from a Source. The process by which the Sample is obtained is described using a SamplingProtocol.

Information that can be stored about a Sample includes the date on which it was created, the name of the person who did the sampling and the type of material obtained.

The Sample is one suitable point for use in recording the underlying experimental design. In a "treated vs. untreated" style experiment for example, two Samples might be taken from the same Source.

 

TreatedSample

A TreatedSample is a Sample which has had something done to it. Details of the particular treatment performed are recorded in the associated TreatmentProtocol.

The TreatedSample table is unique in that it allows a recursive link, i.e. a TreatedSample can be linked to another TreatedSample. This enables arbitrarily long chains of TreatedSamples to be created, with each step in the chain representing the application of a different treatment. An example of this might be TreatedSample 'X1' was dosed with some compound, resulting in TreatedSample 'X2' which was then incubated for 24 hours, resulting in TreatedSample 'X3' which was then dosed with a different compound resulting in TreatedSample 'X4'.

 

Extract

An Extract is the final stage in bio-material preparation prior to the creation of a LabelledExtract.

The particular extraction method that was used is recorded in the ExtractionProtocol link. The Extract table can also be used to record when the extraction event occured and the name of person who performed it.

An Extract is linked to one or more TreatedSamples. Linking to multiple TreatedSamples enables a 'pooling' operation to be represented, in which material from several TreatedSamples is combined to provide sufficient material for Hybridisation.

 

LabelledExtract

A LabelledExtract is an Extract which has been combined with a compound that is detectable by a scanner, typically a fluorescent dye or radioactive marker.

Details of how the Extract was labelled are recorded in the LabellingProtocol which is linked to each LabelledExtract.

Information about when the labelling was done and who did it can be stored in this table. The quantity of material that was produced must also be recorded.

 

The Protocols Section

The protocol tables record information about a specific task performed as part of the experiment. Typically a given laboratory will have a small number of standardised protocols that are reused between experiments. However, variations from these standard protocols should always be captured by creating a new protocol instance and linking to that rather than the standard one from which it is derived.

All of the protocol tables share a common format, storing a description of the equipment (hardware and/or software) that was used and a reference to a publication or other resource which describes in protocol in more detail.

 

SamplingProtocol

The SamplingProtocol describes the process of preparing a Sample of material from a specific Source.

 

TreatmentProtocol

A TreatmentProtocol captures information about something that was done to a Sample (or TreatedSample) as part of the experiment. Examples might include "exposure to radiation" or "incubation for a time period".

 

ExtractionProtocol

The ExtractionProtocol details how material was extracted from one or more TreatedSamples in preparation for labelling.

 

LabellingProtocol

A LabellingProtocol records the steps taken to attach some form of detectable marker to an Extract so that the expression levels of candidate sequences can subsequently be quantified in the scanning process.

 

HybridisationProtocol

The HybridisationProtocol should provide full details of how the Hybridisation was performed, including the blocking agent(s) used, the wash procedure and all relevant parameters settings of the hybridisation station.

 

ScanningProtocol

The ScanningProtocol describes the hardware and/or software used to create an Image by scanning the Array after the Hybridisation was completed.

The ScanningProtocol should record all of the parameters of the scanning hardware (for example laser power, voltages and gain settings).

 

ImageAnalysisProtocol

An ImageAnalysisProtocol describes the process by which an Image is analysed to generate a series of data which can be stored in the corresponding Measurement. Full details of the software used, and the parameters chosen should be provided.