WORKLIST ENTRIES (1):

GLIADGLUTEN View alignment      Gliadin and LMW glutenin superfamily signature
 Type of fingerprint: COMPOUND with 3  elements
Links:
   PRINTS; PR00209 GLIADIN; PR00210 GLUTENIN; PR00211 GLUTELIN
   INTERPRO; IPR001954

 Creation date 23-OCT-1992; UPDATE 28-JUL-1999

   1. OKITA, T.W., CHEESBROUGH, V. AND REEVES, C.D.
   Evolution and heterogeneity of the alpha-type, beta-type, and gamma-type
   gliadin DNA sequences.
   J.BIOL.CHEM. 260(13) 8203-8213 (1985).

   2. RAFALSKI, J.A.
   Structure of wheat gamma-gliadin genes.
   GENE 43(3) 221-229 (1986).

   3. THOMPSON, R.D., BARTELS, D. AND HARBERD, N.P.
   Nucleotide sequence of a gene from chromosome 1D of wheat encoding a HMW
   glutenin subunit
   NUCLEIC ACIDS RES. 13(19) 6833-6848 (1985).

   4. ANDERSON, O.D., GREENE, F.C., YIP, R.E., HALFORD, N.G., SHEWRY, P.R. AND 
   MALPICAROMERO, J.M.
   Nucleotide sequences of the 2 high molecular weight glutenin genes from the
   D-genome of a hexaploid bread wheat, Triticum aestivum L-CV Cheyenne.
   NUCLEIC ACIDS RES. 17(1) 461-462 (1989).

   Gluten is the protein component of wheat flour. It consists of numerous
   proteins, which are of 2 different types responsible for different physical
   properties of dough: the glutenins, which are primarily responsible for the
   elasticity, and the gliadins, which contribute to the extensibility. The 
   glutenins themselves are of 2 different types, termed low and high [3] 
   molecular weight subunits. The latter have unusual structures: a central 
   region contains multiple tandem repeats of blocks of amino acids, forming a
   loose helix based on beta reverse turns, and is flanked by globular 
   regions, which can be cross-linked by disulphide bonds. The result is an 
   elastic network in which the elasticity may derive from the cross-linking, 
   the helical structure, or a combination of these [4].
   
   The gliadins are also of different types (e.g., alpha/beta or gamma) and, 
   like the glutenins, contain repetitive sequences [1] that form loose 
   helical structures, but are usually associated with more extensive non-
   repetitive regions, which are compact and globular [2]. 
   
   GLIADGLUTEN is a 3-element fingerprint that provides a signature for the
   gliadins and low molecular weight glutenins. The fingerprint was derived 
   from an initial alignment of 7 sequences: motifs 1 and 2 encode 2 Gln/Cys-
   rich regions, and motif 3 a hydrophobic region. Three iterations on OWL18.0
   were required to reach convergence, at which point a true set comprising
   45 sequences was identified, including seed storage proteins such as
   hordeins, avenins and secalins. A subfamily of sequences matching 2 motifs
   (all matching motif 1) was also identified, this comprising further seed
   storage proteins - these are the prolamins from rice, whose structures are
   different from the storage proteins from the wheat/barley/oat cereal crops.
  
   An update on SPTR37_9f identified a true set of 51 sequences, and 4
   partial matches.

  SUMMARY INFORMATION
     51 codes involving  3 elements
      4 codes involving  2 elements

   COMPOSITE FINGERPRINT INDEX
  
    3|  51   51   51  
    2|   4    0    4  
   --+----------------
     |   1    2    3  

True positives..
 P93792         GDB1_WHEAT     GLTB_WHEAT     P93791         
 P94021         O22108         GDA9_WHEAT     P93790         
 HOR1_HORVU     GDA7_WHEAT     Q41546         Q41529         
 Q40026         GDA4_WHEAT     GLTC_WHEAT     GDA3_WHEAT     
 GLTA_WHEAT     P93794         Q09114         Q09072         
 Q40021         GDA2_WHEAT     GDBB_WHEAT     AVE3_AVESA     
 Q41545         GDA5_WHEAT     GDA1_WHEAT     Q41509         
 Q41632         GDA6_WHEAT     O22116         Q38794         
 Q41531         GDA0_WHEAT     GDBX_WHEAT     Q41528         
 GDB2_WHEAT     Q41530         P93793         HOG1_HORVU     
 Q09071         PRO7_ORYSA     HOG3_HORVU     GLU2_MAIZE     
 Q41295         Q41506         Q40714         Q42398         
 PRO2_ORYSA     Q00318         ZEB2_MAIZE     
Subfamily:  Codes involving 2 elements
 Subfamily True positives..
 P93413         Q43603         PRO6_ORYSA     PRO4_ORYSA     


  PROTEIN TITLES
   P93792           LOW-MOLECULAR-WEIGHT GLUTENIN STORAGE PROTEIN - TRITICUM AES
   GDB1_WHEAT       GAMMA-GLIADIN B-I PRECURSOR - TRITICUM AESTIVUM (WHEAT).
   GLTB_WHEAT       GLUTENIN, LOW MOLECULAR WEIGHT SUBUNIT 1D1 PRECURSOR - TRITI
   P93791           LOW-MOLECULAR-WEIGHT GLUTENIN STORAGE PROTEIN - TRITICUM AES
   P94021           LMW GLUTENIN (LOW-MOLECULAR-WEIGHT GLUTENIN STORAGE PROTEIN)
   O22108           LMW GLUTENIN - TRITICUM AESTIVUM (WHEAT).
   GDA9_WHEAT       ALPHA/BETA-GLIADIN MM1 PRECURSOR (PROLAMIN) - TRITICUM AESTI
   P93790           LOW-MOLECULAR-WEIGHT GLUTENIN STORAGE PROTEIN - TRITICUM AES
   HOR1_HORVU       B1-HORDEIN PRECURSOR - HORDEUM VULGARE (BARLEY).
   GDA7_WHEAT       ALPHA/BETA-GLIADIN CLONE PW8142 PRECURSOR (PROLAMIN) - TRITI
   Q41546           ALPHA/BETA-GLIADIN STORAGE PROTEIN PRECURSOR - TRITICUM AEST
   Q41529           ALPHA-GLIADIN STORAGE PROTEIN - TRITICUM AESTIVUM (WHEAT).
   Q40026           B HORDEIN PRECURSOR - HORDEUM VULGARE (BARLEY).
   GDA4_WHEAT       ALPHA/BETA-GLIADIN A-IV PRECURSOR (PROLAMIN) - TRITICUM AEST
   GLTC_WHEAT       GLUTENIN, LOW MOLECULAR WEIGHT SUBUNIT PTDUCD1 PRECURSOR - T
   GDA3_WHEAT       ALPHA/BETA-GLIADIN A-III PRECURSOR (PROLAMIN) - TRITICUM AES
   GLTA_WHEAT       GLUTENIN, LOW MOLECULAR WEIGHT SUBUNIT PRECURSOR - TRITICUM 
   P93794           LOW-MOLECULAR-WEIGHT GLUTENIN STORAGE PROTEIN - TRITICUM AES
   Q09114           AVENIN N9 (PROLAMIN) - AVENA SATIVA (OAT).
   Q09072           AVENIN PRECURSOR (PROLAMIN) (CLONE PAV122) - AVENA SATIVA (O
   Q40021           B1 HORDEIN - HORDEUM VULGARE (BARLEY).
   GDA2_WHEAT       ALPHA/BETA-GLIADIN A-II PRECURSOR (PROLAMIN) - TRITICUM AEST
   GDBB_WHEAT       GAMMA-GLIADIN B PRECURSOR - TRITICUM AESTIVUM (WHEAT).
   AVE3_AVESA       AVENIN-3 PRECURSOR (PROLAMIN) - AVENA SATIVA (OAT).
   Q41545           (T. AESTIVUM) ALPHA-TYPE GLIADIN PRECURSOR - TRITICUM AESTIV
   GDA5_WHEAT       ALPHA/BETA-GLIADIN A-V PRECURSOR (PROLAMIN) - TRITICUM AESTI
   GDA1_WHEAT       ALPHA/BETA-GLIADIN A-I PRECURSOR (PROLAMIN) - TRITICUM AESTI
   Q41509           ALPHA-GLIADIN - TRITICUM AESTIVUM (WHEAT).
   Q41632           ALPHA/BETA-TYPE GLIADIN - TRITICUM URARTU.
   GDA6_WHEAT       ALPHA/BETA-GLIADIN CLONE PW1215 PRECURSOR (PROLAMIN) - TRITI
   O22116           LMW GLUTENIN - TRITICUM AESTIVUM (WHEAT).
   Q38794           SEED STORAGE PROTEIN - AVENA SATIVA (OAT).
   Q41531           ALPHA-GLIADIN STORAGE PROTEIN - TRITICUM AESTIVUM (WHEAT).
   GDA0_WHEAT       ALPHA/BETA-GLIADIN PRECURSOR (PROLAMIN) - TRITICUM AESTIVUM 
   GDBX_WHEAT       GAMMA-GLIADIN PRECURSOR - TRITICUM AESTIVUM (WHEAT).
   Q41528           ALPHA-GLIADIN - TRITICUM AESTIVUM (WHEAT).
   GDB2_WHEAT       GAMMA-GLIADIN PRECURSOR - TRITICUM AESTIVUM (WHEAT).
   Q41530           ALPHA-GLIADIN STORAGE PROTEIN - TRITICUM AESTIVUM (WHEAT).
   P93793           LOW-MOLECULAR-WEIGHT GLUTENIN STORAGE PROTEIN - TRITICUM AES
   HOG1_HORVU       GAMMA-HORDEIN 1 PRECURSOR - HORDEUM VULGARE (BARLEY).
   Q09071           AVENIN PRECURSOR (PROLAMIN) (CLONE PAV10) - AVENA SATIVA (OA
   PRO7_ORYSA       PROLAMIN PPROL 17 PRECURSOR - ORYZA SATIVA (RICE).
   HOG3_HORVU       GAMMA-HORDEIN 3 - HORDEUM VULGARE (BARLEY).
   GLU2_MAIZE       GLUTELIN 2 PRECURSOR (ZEIN-GAMMA) (27 KD ZEIN) (ALCOHOL-SOLU
   Q41295           ENDOSPERM TISSUE PRECURSOR - SORGHUM BICOLOR MILO (SORGHUM).
   Q41506           GAMMA-KAFIRIN PREPROTEIN PRECURSOR - SORGHUM VULGARE (SORGHU
   Q40714           PROLAMIN PRECURSOR - ORYZA SATIVA (RICE).
   Q42398           13 KD PROLAMIN PRECURSOR - ORYZA SATIVA (RICE).
   PRO2_ORYSA       13 KD PROLAMIN PRECURSOR - ORYZA SATIVA (RICE).
   Q00318           22 KD GAMMA-COIXIN PRECURSOR - COIX LACHRYMA-JOBI (JOBS'TEAR
   ZEB2_MAIZE       ZEIN-BETA PRECURSOR (ZEIN 2) (16 KD) (ZEIN ZC1) - ZEA MAYS (
 
   P93413           PROLAMIN - ORYZA SATIVA (RICE).
   Q43603           PROLAMIN PRECURSOR - ORYZA SATIVA (RICE).
   PRO6_ORYSA       PROLAMIN PPROL 14 PRECURSOR - ORYZA SATIVA (RICE).
   PRO4_ORYSA       PROLAMIN PPROL 4A PRECURSOR - ORYZA SATIVA (RICE).

SCAN HISTORY OWL18_0 3 300 NSINGLE OWL19_1 1 310 NSINGLE OWL26_0 1 300 NSINGLE SPTR37_9f 2 100 NSINGLE INITIAL MOTIF SETS GLIADGLUTEN1 Length of motif = 18 Motif number = 1 Gliadin/LMW glutenin motif I - 1 PCODE ST INT LNPCKVFLQQQCSPVAMP GDB1_WHEAT 130 130 LNPCKVFLQQQCSPVAMP GLTB_WHEAT 146 146 LNPCKVFLQQQCSPVPVP HOR1_HORVU 139 139 LNPCKVFLQQQCNPVAMP GLTC_WHEAT 123 123 LNPCKVFLQQQCIPVAMQ GLTA_WHEAT 193 193 LNPCKNFLLQQCKPVSLV GDBB_WHEAT 145 145 MIPCQMFLMQQCSPVEMV JQ1048 44 44 GLIADGLUTEN2 Length of motif = 15 Motif number = 2 Gliadin/LMW glutenin motif II - 1 PCODE ST INT CHVMQQQCCQQLQQI GDB1_WHEAT 161 13 CHVMQQQCCQQLPQI GLTB_WHEAT 177 13 CHVLQQQCCQQLPQI HOR1_HORVU 170 13 CHVMQQQCCQQLPQI GLTC_WHEAT 154 13 CHVMQQQCCQQLRQI GLTA_WHEAT 224 13 CQVMRQQCCQQLAQI GDBB_WHEAT 175 12 CHVMRRQCCRQLAQI JQ1048 74 12 GLIADGLUTEN3 Length of motif = 17 Motif number = 3 Gliadin/LMW glutenin motif III - 1 PCODE ST INT EQSRYEAIRAIIYSIIL GDB1_WHEAT 177 1 QQSRYEAIRAIIYSIIL GLTB_WHEAT 193 1 EQFRHEAIRAIVYSIFL HOR1_HORVU 186 1 EQSRYDVIRAITYSIIL GLTC_WHEAT 170 1 EQSRHESIRAIIYSIIL GLTA_WHEAT 240 1 QQLQCAAIHSVVHSIIM GDBB_WHEAT 191 1 RQLRCPAIHSMVHAIIM JQ1048 90 1 FINAL MOTIF SETS GLIADGLUTEN1 Length of motif = 18 Motif number = 1 Gliadin/LMW glutenin motif I - 2 PCODE ST INT LNPCKVFLQQQCSPVAMP P93792 131 131 LNPCKVFLQQQCSPVAMP GDB1_WHEAT 131 131 LNPCKVFLQQQCSPVAMP GLTB_WHEAT 147 147 LNPCKVFLQQQCSPVAMP P93791 147 147 LNPCKVFLQQQCSPVAMP P94021 125 125 LNPCKVFLQQQCSPVAMP O22108 116 116 LQPQNPSQQQPQEQVPLV GDA9_WHEAT 28 28 LNPCKVFLQQQCSPVAMP P93790 128 128 LNPCKVFLQQQCSPVPVP HOR1_HORVU 140 140 LQPKNPSQQQPQEQVPLV GDA7_WHEAT 25 25 LQPKNPSQQQPQEQVPLV Q41546 25 25 LQPKNPSQQQPQEQVPLV Q41529 25 25 LNPCKVFLQQQCSPVRMP Q40026 140 140 LQPQNPSQQQPQKQVPLV GDA4_WHEAT 28 28 LNPCKVFLQQQCNPVAMP GLTC_WHEAT 124 124 LQPQNPSQQQPQEQVPLM GDA3_WHEAT 28 28 LNPCKVFLQQQCIPVAMQ GLTA_WHEAT 194 194 LNPCKVFLQQQCIPVAMQ P93794 192 192 LNPCKQFLVQQCSPVAVV Q09114 52 52 LNPCKQFLVQQCSPVAAV Q09072 71 71 LTPCKVFLQQQCSPVRMP Q40021 117 117 LQLQNPSQQQPQEQVPLV GDA2_WHEAT 28 28 LNPCKNFLLQQCKPVSLV GDBB_WHEAT 146 146 LNPCRQFLVQQCSPVAVV AVE3_AVESA 66 66 LQPQNPSQQQPQEQVPLV Q41545 28 28 LQPQNPSQQQPQEQVPLV GDA5_WHEAT 28 28 LQPQNPSQQQPQEQVPLV GDA1_WHEAT 28 28 LQPQNPSQQQPQEQVPLV Q41509 28 28 PQPQNPSQPQPQRQVPLV Q41632 28 28 PQPQNPSQPQPQGQVPLV GDA6_WHEAT 28 28 LNPCMVFLQQQCIPVAMQ O22116 205 205 LNPCRQFLVQQCSPVAAV Q38794 65 65 LQPQNPSQQQPQEQVPLV Q41531 28 28 LQPQNPSQQLPQEQVPLV GDA0_WHEAT 28 28 MNPCKNFLLQQCNHVSLV GDBX_WHEAT 168 168 LQPQNPSQQLPQEQVPLV Q41528 28 28 LNPCKNILLQQSKPASLV GDB2_WHEAT 181 181 LQPQNPSQQQPQEQVPLV Q41530 28 28 LNPCKVFLQQCSPVAMPQ P93793 147 147 LNPCKEFLLQQCRPVSLL HOG1_HORVU 163 163 MIPCQMFLMQQCSPVEMV Q09071 45 45 LSPCGEFVRQQCSTVATP PRO7_ORYSA 38 38 LNLCKEFLLQQCTLDEKV HOG3_HORVU 139 139 LGQCVEFLRHQCSPTATP GLU2_MAIZE 125 125 LGQCIEFLRHQCSPAATP Q41295 108 108 LGQCIEFLRHQCSPAATP Q41506 107 107 LSPCSEFVRQQHSIVATP Q40714 44 44 LSPCSEFVRQQYSIVATP Q42398 44 44 LSPCSEFVRQQYSIVATP PRO2_ORYSA 44 44 LGQCIEFLRHQCSPAATP Q00318 100 100 LGQCVEFLRHQCSPAATP ZEB2_MAIZE 83 83 GLIADGLUTEN2 Length of motif = 15 Motif number = 2 Gliadin/LMW glutenin motif II - 2 PCODE ST INT CHVMQQQCCQQLQQI P93792 162 13 CHVMQQQCCQQLQQI GDB1_WHEAT 162 13 CHVMQQQCCQQLPQI GLTB_WHEAT 178 13 CHVMQQQCCQQLPQI P93791 178 13 CHVMQQQCCQQLPQI P94021 156 13 CHVMQQQCCQQLSQI O22108 147 13 YQLVQQLCCQQLWQI GDA9_WHEAT 186 140 CNVMQQQCCQQLPRI P93790 159 13 CHVLQQQCCQQLPQI HOR1_HORVU 171 13 YQLLQQLCCQQLLQI GDA7_WHEAT 174 131 YQLLQQLCCQQLLQI Q41546 174 131 YQLLQQLCCQQLLQI Q41529 174 131 CHVLQQQCCQQLPQI Q40026 171 13 YQLVQQFCCQQLWQI GDA4_WHEAT 173 127 CHVMQQQCCQQLPQI GLTC_WHEAT 155 13 YQQLQQLCCQQLFQI GDA3_WHEAT 163 117 CHVMQQQCCQQLRQI GLTA_WHEAT 225 13 CHVMQQQCCQQLRQI P93794 223 13 CQVARQQCCRQLAQI Q09114 82 12 CQVTRQQCCRQLAQI Q09072 101 12 CHVLQQQCCQQLPQI Q40021 148 13 YQLVQQLCCQQLWQI GDA2_WHEAT 166 120 CQVMRQQCCQQLAQI GDBB_WHEAT 176 12 CQVMRQQCCRQLEQI AVE3_AVESA 96 12 YQLLQQLCCQQLLQI Q41545 174 128 YQLLQQLCCQQLLQI GDA5_WHEAT 175 129 YQLLQELCCQHLWQI GDA1_WHEAT 170 124 YQLLQELCCQHLWQI Q41509 170 124 YQPLQQLCCQQLWQI Q41632 178 132 YQPLQQLCCQQLWQI GDA6_WHEAT 178 132 CHVMQRQCCQQLRQI O22116 236 13 CQVMRQQCCRRLEQI Q38794 95 12 YQLLRELCCQHLWQI Q41531 172 126 YQLLQELCCQHLWQI GDA0_WHEAT 169 123 CQVMQQQCCQQLAQI GDBX_WHEAT 198 12 YQLLRELCCQHLWQI Q41528 169 123 CQVMRQQCCQQLAQI GDB2_WHEAT 211 12 YQLLQELCCQHLWQI Q41530 171 125 CHVMQQQCCQQLPQI P93793 177 12 CRVMQQQCCLQLAQI HOG1_HORVU 193 12 CHVMRRQCCRQLAQI Q09071 75 12 CQVMQQQCCQQLRMI PRO7_ORYSA 67 11 CQLKRQQCCQQLANI HOG3_HORVU 176 19 CQSLRQQCCQQLRQV GLU2_MAIZE 148 5 CQALRQQCCQQLRQV Q41295 131 5 CQALRQQCCQQLRQV Q41506 130 5 NQVMQQQCCQQLRLV Q40714 73 11 NQVMQQQCCQQLRLV Q42398 73 11 NQVMQQQCCQQLRLV PRO2_ORYSA 73 11 CQALRQQCCHQLRQV Q00318 123 5 CQALQQQCCHQIRQV ZEB2_MAIZE 106 5 GLIADGLUTEN3 Length of motif = 17 Motif number = 3 Gliadin/LMW glutenin motif III - 2 PCODE ST INT EQSRYEAIRAIIYSIIL P93792 178 1 EQSRYEAIRAIIYSIIL GDB1_WHEAT 178 1 QQSRYEAIRAIIYSIIL GLTB_WHEAT 194 1 QQSRYEAIRAIIYSIIL P93791 194 1 QQSRYEAIRAIIYSIIL P94021 172 1 EQSRYDAIRAITYSIIL O22108 163 1 EQSRCQAIHNVVHAIIL GDA9_WHEAT 202 1 EQSRYEAIRAIIFSIIL P93790 175 1 EQFRHEAIRAIVYSIFL HOR1_HORVU 187 1 EQSRCQAIHNVVHAIIM GDA7_WHEAT 190 1 EQSRCQAIHNVVHAIIM Q41546 190 1 EQSRCQAIHNVVHAIIM Q41529 190 1 EQFRHEAIRAIVYSIFL Q40026 187 1 EQSRCQAIHNVVHAIIL GDA4_WHEAT 189 1 EQSRYDVIRAITYSIIL GLTC_WHEAT 171 1 EQSRCQAIHNVVHAIIL GDA3_WHEAT 179 1 EQSRHESIRAIIYSIIL GLTA_WHEAT 241 1 EQSRHESIRAIIYSIIL P93794 239 1 EQLRCPAIHSVVQAIIL Q09114 98 1 EQLRCPAIHSVVQSIIL Q09072 117 1 EQFRHEAIRAIVYSIFL Q40021 164 1 EQSRCQAIHNVVHAIIL GDA2_WHEAT 182 1 QQLQCAAIHSVVHSIIM GDBB_WHEAT 192 1 EQLRCPAIHSVVQAIIM AVE3_AVESA 112 1 EQSQCQAIHNVAHAIIM Q41545 190 1 EQSQCQAIHNVAHAIIM GDA5_WHEAT 191 1 EQSQCQAIHNVVHAIIL GDA1_WHEAT 186 1 EQSQCQAIHNVVHAIIL Q41509 186 1 EQSRCQAIHNVVHAIIL Q41632 194 1 EQSRCQAIHNVVHAIIL GDA6_WHEAT 194 1 EQSRHESIRAIIYSIIL O22116 252 1 EQLRCPAIHSVVQAIIM Q38794 111 1 EQSQCQAIHNVVHAIIL Q41531 188 1 EQSQCQAIHNVVHAIIL GDA0_WHEAT 185 1 QQLQCAAIHSVAHSIIM GDBX_WHEAT 214 1 EQSQCQAIHNVVHAIIL Q41528 185 1 QQLQCAAIHSVVHSIIM GDB2_WHEAT 227 1 EKLQCQAIHNVVHAIIL Q41530 187 1 QQSRYEAIRAIIYSIIL P93793 193 1 EQYKCTAIDSIVHAIFM HOG1_HORVU 209 1 RQLRCPAIHSMVHAIIM Q09071 91 1 QQSHCQAISSVQAIVQQ PRO7_ORYSA 83 1 EQSRCPAIQTIVHAIVM HOG3_HORVU 192 1 PQHRYQAIFGLVLQSIL GLU2_MAIZE 164 1 PLHRYQAIFGVVLQSIQ Q41295 147 1 PLHRYQAIFGVVLQSIQ Q41506 146 1 QQSHYQAISSVQAIVQQ Q40714 89 1 QQSHYQAISIVQAIVQQ Q42398 89 1 QQSHYQAISIVQAIVQQ PRO2_ORYSA 89 1 PLHRQQAIFGVVLQSIQ Q00318 139 1 PLHRYQATYGVVLQSFL ZEB2_MAIZE 122 1

User query: Display/Full Code "GLIADGLUTEN"