WORKLIST ENTRIES (1):

ENGRAILED View alignment        Engrailed homeodomain signature
 Type of fingerprint: COMPOUND with 2  elements
Links:
   PRINTS; PR00024 HOMEOBOX; PR00025 ANTENNAPEDIA; PR00027 PAIREDBOX 
   PRINTS; PR00028 POUDOMAIN; PR00029 OCTAMER; PR00030 HTHCRO 
   PRINTS; PR00031 HTHREPRESSR
   INTERPRO; IPR000747
   PROSITE; PS00033 ENGRAILED

 Creation date 08-JUN-1993; UPDATE 14-JUN-1999

   1. XUE, Z.G., GEHRING, W.J. AND LE DOURAIN, N.M.
   Quox-1, A quail homeobox gene expressed in the embryonic central nervous
   system, including the forebrain.
   PROC.NATL.ACAD.SCI.U.S.A. 88(6) 2427-2431 (1991).

   2. GEHRING, W.J.
   Homeo boxes in the study of development.
   SCIENCE 236 1245-1252 (1987).

   3. ANGERER, L.M., DOLECKI, G.J., GAGNON, M.L., LUM, R., WANG, G., YANG, Q.,
   HUMPHREYS, T. AND ANGERER, R.C.
   Progressively restricted expression of a homeo box gene within the aboral
   ectoderm of developing sea urchin embryos.
   GENES DEV. 3(3) 370-383 (1989).

   4. SASAKI, H., YOKOYAMA, E. AND KUROIWA, A.
   Specific DNA-binding of the 2 chicken deformed fmily homeodomain proteins,
   Chox-1.4 AND Chox-A.
   NUCLEIC ACIDS RES. 18 1739-1747 (1990).

   5. BRENNAN, R.G., TAKEDA, Y., KIM, J., ANDERSON, W.F. AND MATTHEWS, B.W.
   Crystallization of a complex of cro repressor with a 17 base pair operator.
   J.MOL.BIOL. 188 115-118 (1986).

   Organisms develop according to a precise program that specifies the body
   plan in intricate detail and also determines the timing of developmental
   events [1]. The highly complex nature of these events has suggested that 
   the process may be regulated by proteins capable of controlling the 
   temporal and spatial expression of many structural genes. The genes for 
   this process were first discovered as homeotic mutations in Drosophila [2],
   and many similar genes are now known in a wide variety of organisms.
   
   Proteins that regulate developmental gene expression are nuclear proteins 
   [3] that contain a conserved domain known as the homeobox, the flanking
   sequences of which differ considerably among different proteins. The homeo 
   domain includes the helix-turn-helix (HTH) motif, which binds to DNA in a 
   sequence-specific manner to exert a temporal and spatial regulation of 
   developmental gene expression [4]. The second helix of this motif binds 
   to DNA via a number of hydrogen bonds and hydrophobic interactions, which
   occur between specific side chains and the exposed bases and thymine methyl
   groups within the major groove. The first helix may help to stablise the 
   structure [5].
   
   Many homeodomain-containing proteins have now been sequenced and, while
   the homeodomain-flanking regions vary, characteristic conserved sequences
   upstream of the domain allow the proteins to be grouped into 3 subfamilies:
   the so-called antennapedia, engrailed and 'paired box' proteins. Engrailed 
   plays an important role in Drosophila segmentation and neurogenesis, 
   affecting genes in posterior compartments of the developing embryo. It is 
   also required for the development of the central nervous system. Homologues
   found in other species may play a role in neurogenesis, possibly in both 
   the compartmentalisation of the developing neural tube and specification of
   particular neuronal populations. Members of the engrailed subfamily of
   proteins contain a conserved region of 20 amino acids located to the
   C-terminal of the homeobox, the specific function of which is unclear.
  
   ENGRAILED is a 2-element fingerprint that provides a signature for the
   engrailed-type homeobox proteins. The fingerprint was derived from an 
   initial alignment of 6 sequences: motif 2 encodes the conserved region to
   the C-terminus of the homeobox (cf. PROSITE pattern ENGRAILED (PS00033)).
   Two iterations on OWL20.0 were required to reach convergence, at which 
   point a true set comprising 20 sequences was identified.
  
   An update on SPTR37_9f identified a true set of 25 sequences.

  SUMMARY INFORMATION
     25 codes involving  2 elements

   COMPOSITE FINGERPRINT INDEX
  
    2|  25   25  
   --+-----------
     |   1    2  

True positives..
 HME1_CHICK     HME1_MOUSE     HME1_HUMAN     HMEC_XENLA     
 HME2_CHICK     HME2_HUMAN     HME2_MOUSE     HME2_BRARE     
 HMED_XENLA     HMEN_ARTSF     P90688         HME1_BRARE     
 HME3_BRARE     HMIN_DROME     HMEN_ANOGA     Q26371         
 HMEN_BOMMO     HMEN_DROVI     HMEN_DROME     HMIN_BOMMO     
 Q25212         O76848         Q26601         HX11_MOUSE     
 HX11_HUMAN     


  PROTEIN TITLES
   HME1_CHICK       HOMEOBOX PROTEIN ENGRAILED-1 (GG-EN-1) - GALLUS GALLUS (CHIC
   HME1_MOUSE       HOMEOBOX PROTEIN ENGRAILED-1 (MO-EN-1) - MUS MUSCULUS (MOUSE
   HME1_HUMAN       HOMEOBOX PROTEIN ENGRAILED-1 (HU-EN-1) - HOMO SAPIENS (HUMAN
   HMEC_XENLA       HOMEOBOX PROTEIN ENGRAILED-2A (EN-2A) (EN2 1.4) - XENOPUS LA
   HME2_CHICK       HOMEOBOX PROTEIN ENGRAILED-2 (GG-EN-2) - GALLUS GALLUS (CHIC
   HME2_HUMAN       HOMEOBOX PROTEIN ENGRAILED-2 (HU-EN-2) - HOMO SAPIENS (HUMAN
   HME2_MOUSE       HOMEOBOX PROTEIN ENGRAILED-2 (MO-EN-2) - MUS MUSCULUS (MOUSE
   HME2_BRARE       HOMEOBOX PROTEIN ENGRAILED-2 (ZF-EN-2) - BRACHYDANIO RERIO (
   HMED_XENLA       HOMEOBOX PROTEIN ENGRAILED-2B (EN-2B) (EN2 MABEN) - XENOPUS 
   HMEN_ARTSF       HOMEOBOX PROTEIN ENGRAILED - ARTEMIA SANFRANCISCANA (BRINE S
   P90688           ENGRAILED PROTEIN - BRANCHIOSTOMA FLORIDAE (FLORIDA LANCELET
   HME1_BRARE       HOMEOBOX PROTEIN ENGRAILED-1 - BRACHYDANIO RERIO (ZEBRAFISH)
   HME3_BRARE       HOMEOBOX PROTEIN ENGRAILED-3 (ZF-EN-1) - BRACHYDANIO RERIO (
   HMIN_DROME       HOMEOBOX PROTEIN INVECTED - DROSOPHILA MELANOGASTER (FRUIT F
   HMEN_ANOGA       SEGMENTATION POLARITY HOMEOBOX PROTEIN ENGRAILED - ANOPHELES
   Q26371           ENGRAILED HOMOLOG - TRIBOLIUM CASTANEUM (RED FLOUR BEETLE).
   HMEN_BOMMO       SEGMENTATION POLARITY HOMEOBOX PROTEIN ENGRAILED - BOMBYX MO
   HMEN_DROVI       SEGMENTATION POLARITY HOMEOBOX PROTEIN ENGRAILED - DROSOPHIL
   HMEN_DROME       SEGMENTATION POLARITY HOMEOBOX PROTEIN ENGRAILED - DROSOPHIL
   HMIN_BOMMO       HOMEOBOX PROTEIN INVECTED - BOMBYX MORI (SILK MOTH).
   Q25212           INVECTED HOMEODOMAIN PROTEIN - JUNONIA COENIA (PEACOCK BUTTE
   O76848           HOMEOBOX PROTEIN - CUPIENNIUS SALEI.
   Q26601           HOMEOBOX PROTEIN ENGRAILED-LIKE SMOX-2 - SCHISTOSOMA MANSONI
   HX11_MOUSE       HOMEOBOX PROTEIN HOX-11 (T-CELL LEUKEMIA HOMEOBOX 1) (HOMEOB
   HX11_HUMAN       HOMEOBOX PROTEIN HOX-11 (TCL-3 PROTO-ONCOGENE) - HOMO SAPIEN

SCAN HISTORY OWL20_0 2 100 NSINGLE OWL26_0 2 300 NSINGLE SPTR37_9f 2 300 NSINGLE INITIAL MOTIF SETS ENGRAILED1 Length of motif = 18 Motif number = 1 Engrailed motif I - 1 PCODE ST INT EEKRPRTAFSGAQLARLK HMEN_BOMMO 279 279 DEKRPRTAFSGPQLARLK HMIN_BOMMO 371 371 EDKRPRTAFTAEQLQRLK HME1_MOUSE 34 34 EEKRPRTAFSAEQLARLK HME3_APIME 18 18 EDKRPRTAFSGTQLARLK HMIN_DROMO 470 470 DEKRPRTAFSASQLQRLK HMEN_TRIGR 36 36 ENGRAILED2 Length of motif = 18 Motif number = 2 Engrailed motif II - 1 PCODE ST INT RNPLALQLMAQGLYNHST HMEN_BOMMO 342 45 RNPLALQLMAQGLYNHST HMIN_BOMMO 434 45 KNGLALHLMAQGLYNHST HME1_MOUSE 97 45 KNPLALQLMAQGLYNHST HME3_APIME 81 45 KNPLALQLMAQGLYNHST HMIN_DROMO 533 45 KNDLARQLMAQGLYNHST HMEN_TRIGR 99 45 FINAL MOTIF SETS ENGRAILED1 Length of motif = 18 Motif number = 1 Engrailed motif I - 2 PCODE ST INT EDKRPRTAFTAEQLQRLK HME1_CHICK 243 243 EDKRPRTAFTAEQLQRLK HME1_MOUSE 311 311 EDKRPRTAFTAEQLQRLK HME1_HUMAN 301 301 EDKRPRTAFTADQLQRLK HMEC_XENLA 175 175 EDKRPRTAFTAEQLQRLK HME2_CHICK 198 198 EDKRPRTAFTAEQLQRLK HME2_HUMAN 242 242 EDKRPRTAFTAEQLQRLK HME2_MOUSE 234 234 EDKRPRTAFTAEQLQRLK HME2_BRARE 175 175 EDKRPRTAFTAEQLQRLK HMED_XENLA 175 175 DEKRPRTAFTAEQLSRLK HMEN_ARTSF 248 248 EEKRPRTAFTSEQLQRLK P90688 158 158 DDKRPRTAFTAEQLQRLK HME1_BRARE 142 142 EDKRPRTAFTAEQLQRLK HME3_BRARE 171 171 EDKRPRTAFSGTQLARLK HMIN_DROME 470 470 EEKRPRTAFSNAQLQRLK HMEN_ANOGA 497 497 EEKRPRTAFSGAQLARLK Q26371 227 227 EEKRPRTAFSGAQLARLK HMEN_BOMMO 279 279 DEKRPRTAFSSEQLARLK HMEN_DROVI 485 485 DEKRPRTAFSSEQLARLK HMEN_DROME 453 453 DEKRPRTAFSGPQLARLK HMIN_BOMMO 371 371 DEKRPRTAFSGPQLARLK Q25212 38 38 DDKRPRTAFTADQLSRLK O76848 145 145 NLKRPRTSFTVPQLKRLS Q26601 422 422 KKKKPRTSFTRLQICELE HX11_MOUSE 202 202 KKKKPRTSFTRLQICELE HX11_HUMAN 200 200 ENGRAILED2 Length of motif = 18 Motif number = 2 Engrailed motif II - 2 PCODE ST INT KNGLALHLMAQGLYNHST HME1_CHICK 306 45 KNGLALHLMAQGLYNHST HME1_MOUSE 374 45 KNGLALHLMAQGLYNHST HME1_HUMAN 364 45 KNSLALHLMAQGLYNHST HMEC_XENLA 238 45 KNSLAVHLMAQGLYNHST HME2_CHICK 261 45 KNTLAVHLMAQGLYNHST HME2_HUMAN 305 45 KNTLAVHLMAQGLYNHST HME2_MOUSE 297 45 KNGLAIHLMAQGLYNHST HME2_BRARE 238 45 KNSLALHLMAQGLYNHAT HMED_XENLA 238 45 KNPLALQLMAQGLYNHST HMEN_ARTSF 311 45 RNGLALHLMAQGLYNHST P90688 221 45 KNALAMQLMAQGLYNHST HME1_BRARE 205 45 KNTLAVHLMAQGLYNHAT HME3_BRARE 234 45 KNPLALQLMAQGLYNHST HMIN_DROME 533 45 KNPLALQLMAQGLYNHST HMEN_ANOGA 560 45 KNPLALQLMAQGLYNHST Q26371 290 45 RNPLALQLMAQGLYNHST HMEN_BOMMO 342 45 KNPLALQLMAQGLYNHTT HMEN_DROVI 548 45 KNPLALQLMAQGLYNHTT HMEN_DROME 516 45 RNPLALQLMAQGLYNHST HMIN_BOMMO 434 45 RNPLALQLMAQGLYNHST Q25212 101 45 RSALALQLMAQGLYNHST O76848 208 45 QNCLALHLMAEGLYNHSV Q26601 485 45 QKSLAQPLPADPLCVHNS HX11_MOUSE 286 66 QKSLAQPLPADPLCVHNS HX11_HUMAN 284 66

User query: Display/Full Code "ENGRAILED"