WORKLIST ENTRIES (1):
ENGRAILED View alignment Engrailed homeodomain signature
Type of fingerprint: COMPOUND with 2 elements
Links:
PRINTS; PR00024 HOMEOBOX; PR00025 ANTENNAPEDIA; PR00027 PAIREDBOX
PRINTS; PR00028 POUDOMAIN; PR00029 OCTAMER; PR00030 HTHCRO
PRINTS; PR00031 HTHREPRESSR
INTERPRO; IPR000747
PROSITE; PS00033 ENGRAILED
Creation date 08-JUN-1993; UPDATE 14-JUN-1999
1. XUE, Z.G., GEHRING, W.J. AND LE DOURAIN, N.M.
Quox-1, A quail homeobox gene expressed in the embryonic central nervous
system, including the forebrain.
PROC.NATL.ACAD.SCI.U.S.A. 88(6) 2427-2431 (1991).
2. GEHRING, W.J.
Homeo boxes in the study of development.
SCIENCE 236 1245-1252 (1987).
3. ANGERER, L.M., DOLECKI, G.J., GAGNON, M.L., LUM, R., WANG, G., YANG, Q.,
HUMPHREYS, T. AND ANGERER, R.C.
Progressively restricted expression of a homeo box gene within the aboral
ectoderm of developing sea urchin embryos.
GENES DEV. 3(3) 370-383 (1989).
4. SASAKI, H., YOKOYAMA, E. AND KUROIWA, A.
Specific DNA-binding of the 2 chicken deformed fmily homeodomain proteins,
Chox-1.4 AND Chox-A.
NUCLEIC ACIDS RES. 18 1739-1747 (1990).
5. BRENNAN, R.G., TAKEDA, Y., KIM, J., ANDERSON, W.F. AND MATTHEWS, B.W.
Crystallization of a complex of cro repressor with a 17 base pair operator.
J.MOL.BIOL. 188 115-118 (1986).
Organisms develop according to a precise program that specifies the body
plan in intricate detail and also determines the timing of developmental
events [1]. The highly complex nature of these events has suggested that
the process may be regulated by proteins capable of controlling the
temporal and spatial expression of many structural genes. The genes for
this process were first discovered as homeotic mutations in Drosophila [2],
and many similar genes are now known in a wide variety of organisms.
Proteins that regulate developmental gene expression are nuclear proteins
[3] that contain a conserved domain known as the homeobox, the flanking
sequences of which differ considerably among different proteins. The homeo
domain includes the helix-turn-helix (HTH) motif, which binds to DNA in a
sequence-specific manner to exert a temporal and spatial regulation of
developmental gene expression [4]. The second helix of this motif binds
to DNA via a number of hydrogen bonds and hydrophobic interactions, which
occur between specific side chains and the exposed bases and thymine methyl
groups within the major groove. The first helix may help to stablise the
structure [5].
Many homeodomain-containing proteins have now been sequenced and, while
the homeodomain-flanking regions vary, characteristic conserved sequences
upstream of the domain allow the proteins to be grouped into 3 subfamilies:
the so-called antennapedia, engrailed and 'paired box' proteins. Engrailed
plays an important role in Drosophila segmentation and neurogenesis,
affecting genes in posterior compartments of the developing embryo. It is
also required for the development of the central nervous system. Homologues
found in other species may play a role in neurogenesis, possibly in both
the compartmentalisation of the developing neural tube and specification of
particular neuronal populations. Members of the engrailed subfamily of
proteins contain a conserved region of 20 amino acids located to the
C-terminal of the homeobox, the specific function of which is unclear.
ENGRAILED is a 2-element fingerprint that provides a signature for the
engrailed-type homeobox proteins. The fingerprint was derived from an
initial alignment of 6 sequences: motif 2 encodes the conserved region to
the C-terminus of the homeobox (cf. PROSITE pattern ENGRAILED (PS00033)).
Two iterations on OWL20.0 were required to reach convergence, at which
point a true set comprising 20 sequences was identified.
An update on SPTR37_9f identified a true set of 25 sequences.
SUMMARY INFORMATION
25 codes involving 2 elements
COMPOSITE FINGERPRINT INDEX
2| 25 25
--+-----------
| 1 2
True positives..
HME1_CHICK HME1_MOUSE HME1_HUMAN HMEC_XENLA
HME2_CHICK HME2_HUMAN HME2_MOUSE HME2_BRARE
HMED_XENLA HMEN_ARTSF P90688 HME1_BRARE
HME3_BRARE HMIN_DROME HMEN_ANOGA Q26371
HMEN_BOMMO HMEN_DROVI HMEN_DROME HMIN_BOMMO
Q25212 O76848 Q26601 HX11_MOUSE
HX11_HUMAN
PROTEIN TITLES
HME1_CHICK HOMEOBOX PROTEIN ENGRAILED-1 (GG-EN-1) - GALLUS GALLUS (CHIC
HME1_MOUSE HOMEOBOX PROTEIN ENGRAILED-1 (MO-EN-1) - MUS MUSCULUS (MOUSE
HME1_HUMAN HOMEOBOX PROTEIN ENGRAILED-1 (HU-EN-1) - HOMO SAPIENS (HUMAN
HMEC_XENLA HOMEOBOX PROTEIN ENGRAILED-2A (EN-2A) (EN2 1.4) - XENOPUS LA
HME2_CHICK HOMEOBOX PROTEIN ENGRAILED-2 (GG-EN-2) - GALLUS GALLUS (CHIC
HME2_HUMAN HOMEOBOX PROTEIN ENGRAILED-2 (HU-EN-2) - HOMO SAPIENS (HUMAN
HME2_MOUSE HOMEOBOX PROTEIN ENGRAILED-2 (MO-EN-2) - MUS MUSCULUS (MOUSE
HME2_BRARE HOMEOBOX PROTEIN ENGRAILED-2 (ZF-EN-2) - BRACHYDANIO RERIO (
HMED_XENLA HOMEOBOX PROTEIN ENGRAILED-2B (EN-2B) (EN2 MABEN) - XENOPUS
HMEN_ARTSF HOMEOBOX PROTEIN ENGRAILED - ARTEMIA SANFRANCISCANA (BRINE S
P90688 ENGRAILED PROTEIN - BRANCHIOSTOMA FLORIDAE (FLORIDA LANCELET
HME1_BRARE HOMEOBOX PROTEIN ENGRAILED-1 - BRACHYDANIO RERIO (ZEBRAFISH)
HME3_BRARE HOMEOBOX PROTEIN ENGRAILED-3 (ZF-EN-1) - BRACHYDANIO RERIO (
HMIN_DROME HOMEOBOX PROTEIN INVECTED - DROSOPHILA MELANOGASTER (FRUIT F
HMEN_ANOGA SEGMENTATION POLARITY HOMEOBOX PROTEIN ENGRAILED - ANOPHELES
Q26371 ENGRAILED HOMOLOG - TRIBOLIUM CASTANEUM (RED FLOUR BEETLE).
HMEN_BOMMO SEGMENTATION POLARITY HOMEOBOX PROTEIN ENGRAILED - BOMBYX MO
HMEN_DROVI SEGMENTATION POLARITY HOMEOBOX PROTEIN ENGRAILED - DROSOPHIL
HMEN_DROME SEGMENTATION POLARITY HOMEOBOX PROTEIN ENGRAILED - DROSOPHIL
HMIN_BOMMO HOMEOBOX PROTEIN INVECTED - BOMBYX MORI (SILK MOTH).
Q25212 INVECTED HOMEODOMAIN PROTEIN - JUNONIA COENIA (PEACOCK BUTTE
O76848 HOMEOBOX PROTEIN - CUPIENNIUS SALEI.
Q26601 HOMEOBOX PROTEIN ENGRAILED-LIKE SMOX-2 - SCHISTOSOMA MANSONI
HX11_MOUSE HOMEOBOX PROTEIN HOX-11 (T-CELL LEUKEMIA HOMEOBOX 1) (HOMEOB
HX11_HUMAN HOMEOBOX PROTEIN HOX-11 (TCL-3 PROTO-ONCOGENE) - HOMO SAPIEN
SCAN HISTORY
OWL20_0 2 100 NSINGLE
OWL26_0 2 300 NSINGLE
SPTR37_9f 2 300 NSINGLE
INITIAL MOTIF SETS
ENGRAILED1 Length of motif = 18 Motif number = 1
Engrailed motif I - 1
PCODE ST INT
EEKRPRTAFSGAQLARLK HMEN_BOMMO 279 279
DEKRPRTAFSGPQLARLK HMIN_BOMMO 371 371
EDKRPRTAFTAEQLQRLK HME1_MOUSE 34 34
EEKRPRTAFSAEQLARLK HME3_APIME 18 18
EDKRPRTAFSGTQLARLK HMIN_DROMO 470 470
DEKRPRTAFSASQLQRLK HMEN_TRIGR 36 36
ENGRAILED2 Length of motif = 18 Motif number = 2
Engrailed motif II - 1
PCODE ST INT
RNPLALQLMAQGLYNHST HMEN_BOMMO 342 45
RNPLALQLMAQGLYNHST HMIN_BOMMO 434 45
KNGLALHLMAQGLYNHST HME1_MOUSE 97 45
KNPLALQLMAQGLYNHST HME3_APIME 81 45
KNPLALQLMAQGLYNHST HMIN_DROMO 533 45
KNDLARQLMAQGLYNHST HMEN_TRIGR 99 45
FINAL MOTIF SETS
ENGRAILED1 Length of motif = 18 Motif number = 1
Engrailed motif I - 2
PCODE ST INT
EDKRPRTAFTAEQLQRLK HME1_CHICK 243 243
EDKRPRTAFTAEQLQRLK HME1_MOUSE 311 311
EDKRPRTAFTAEQLQRLK HME1_HUMAN 301 301
EDKRPRTAFTADQLQRLK HMEC_XENLA 175 175
EDKRPRTAFTAEQLQRLK HME2_CHICK 198 198
EDKRPRTAFTAEQLQRLK HME2_HUMAN 242 242
EDKRPRTAFTAEQLQRLK HME2_MOUSE 234 234
EDKRPRTAFTAEQLQRLK HME2_BRARE 175 175
EDKRPRTAFTAEQLQRLK HMED_XENLA 175 175
DEKRPRTAFTAEQLSRLK HMEN_ARTSF 248 248
EEKRPRTAFTSEQLQRLK P90688 158 158
DDKRPRTAFTAEQLQRLK HME1_BRARE 142 142
EDKRPRTAFTAEQLQRLK HME3_BRARE 171 171
EDKRPRTAFSGTQLARLK HMIN_DROME 470 470
EEKRPRTAFSNAQLQRLK HMEN_ANOGA 497 497
EEKRPRTAFSGAQLARLK Q26371 227 227
EEKRPRTAFSGAQLARLK HMEN_BOMMO 279 279
DEKRPRTAFSSEQLARLK HMEN_DROVI 485 485
DEKRPRTAFSSEQLARLK HMEN_DROME 453 453
DEKRPRTAFSGPQLARLK HMIN_BOMMO 371 371
DEKRPRTAFSGPQLARLK Q25212 38 38
DDKRPRTAFTADQLSRLK O76848 145 145
NLKRPRTSFTVPQLKRLS Q26601 422 422
KKKKPRTSFTRLQICELE HX11_MOUSE 202 202
KKKKPRTSFTRLQICELE HX11_HUMAN 200 200
ENGRAILED2 Length of motif = 18 Motif number = 2
Engrailed motif II - 2
PCODE ST INT
KNGLALHLMAQGLYNHST HME1_CHICK 306 45
KNGLALHLMAQGLYNHST HME1_MOUSE 374 45
KNGLALHLMAQGLYNHST HME1_HUMAN 364 45
KNSLALHLMAQGLYNHST HMEC_XENLA 238 45
KNSLAVHLMAQGLYNHST HME2_CHICK 261 45
KNTLAVHLMAQGLYNHST HME2_HUMAN 305 45
KNTLAVHLMAQGLYNHST HME2_MOUSE 297 45
KNGLAIHLMAQGLYNHST HME2_BRARE 238 45
KNSLALHLMAQGLYNHAT HMED_XENLA 238 45
KNPLALQLMAQGLYNHST HMEN_ARTSF 311 45
RNGLALHLMAQGLYNHST P90688 221 45
KNALAMQLMAQGLYNHST HME1_BRARE 205 45
KNTLAVHLMAQGLYNHAT HME3_BRARE 234 45
KNPLALQLMAQGLYNHST HMIN_DROME 533 45
KNPLALQLMAQGLYNHST HMEN_ANOGA 560 45
KNPLALQLMAQGLYNHST Q26371 290 45
RNPLALQLMAQGLYNHST HMEN_BOMMO 342 45
KNPLALQLMAQGLYNHTT HMEN_DROVI 548 45
KNPLALQLMAQGLYNHTT HMEN_DROME 516 45
RNPLALQLMAQGLYNHST HMIN_BOMMO 434 45
RNPLALQLMAQGLYNHST Q25212 101 45
RSALALQLMAQGLYNHST O76848 208 45
QNCLALHLMAEGLYNHSV Q26601 485 45
QKSLAQPLPADPLCVHNS HX11_MOUSE 286 66
QKSLAQPLPADPLCVHNS HX11_HUMAN 284 66
User query: Display/Full Code "ENGRAILED"