WORKLIST ENTRIES (1):
GLHYDRLASE30 View alignment Glycosyl hydrolase family 30 signature
Type of fingerprint: COMPOUND with 6 elements
Links:
PRINTS; PR00131 GLHYDRLASE1; PR00132 GLHYDRLASE2; PR00133 GLHYDRLASE3
PRINTS; PR00732 GLHYDRLASE4; PR00733 GLHYDRLASE6; PR00734 GLHYDRLASE7
PRINTS; PR00735 GLHYDRLASE8; PR00134 GLHYDRLASE10; PR00911 GLHYDRLASE11
PRINTS; PR00736 GLHYDRLASE15; PR00737 GLHYDRLASE16; PR00738 GLHYDRLASE20
PRINTS; PR00739 GLHYDRLASE26; PR00740 GLHYDRLASE27; PR00741 GLHYDRLASE29
PRINTS; PR00742 GLHYDRLASE35; PR00743 GLHYDRLASE36; PR00744 GLHYDRLASE37
PRINTS; PR00745 GLHYDRLASE39; PR00746 GLHYDRLASE41; PR00747 GLHYDRLASE47
PRINTS; PR00844 GLHYDRLASE48; PR00845 GLHYDRLASE52; PR00846 GLHYDRLASE56
PRINTS; PR00849 GLHYDRLASE58; PR00850 GLHYDRLASE59; PR00748 MELIBIASE
PRINTS; PR00137 LYSOZYME; PR00684 T4LYSOZYME; PR00749 LYSOZYMEG
PRINTS; PR00110 ALPHAAMYLASE; PR00750 BETAAMYLASE
INTERPRO; IPR001139
Creation date 13-FEB-1998; UPDATE 07-JUN-1999
1. HENRISSAT, B.
A classification of glycosyl hydrolases based on amino acid sequence
similarities.
BIOCHEM.J. 280 309-316 (1991).
2. HENRISSAT, B. AND BAIROCH, A.
New families in the classification of glycosyl hydrolases based on amino
acid sequence similarities.
BIOCHEM.J. 293 781-788 (1993).
3. HENRISSAT, B. AND BAIROCH, A.
Updating the sequence-based classification of glycosyl hydrolases.
BIOCHEM.J. 316 695-696 (1996).
4. EL HASSOUNI, M., HENRISSAT, B., CHIPPAUX, M. AND BARRAS, F.
Nucleotide sequences of the Arb genes, which control beta-glucosidase
utilisation in Erwinia chrysanthemi - Comparison with the Escherichia
coli Bgl operon and evidence for a new beta-glycohydrolase family
including enzymes from eubacteria, archaebacteria and humans.
J.BACTERIOL. 174 765-777 (1992).
5. DINUR, T., OSIECKI, K.M., LEGLER, G., GATT, S., DESNICK, R.J.
AND GRABOWSKI, G.A.
Human acid beta-glucosidase: isolation and amino acid sequence of a peptide
containing the catalytic site.
PROC.NATL.ACAD.SCI.U.S.A. 83 1660-1664 (1986).
6. WINFIELD, S.L., TAYEBI, N., MARTIN, B.M., GINNS, E.I. AND SIDRANSKY, E.
Identification of three additional genes contiguous to the glucocerebrosidase
locus on chromosome 1q21: Implications for Gaucher disease.
GENOME RES. 7 1020-1026 (1997).
7. IWASAWA, K., IDA, H. AND ETO, Y.
Differences in origin of the 1448C mutation in patients with Gaucher disease.
ACTA PAEDIATR.JPN. 39 451-453 (1997).
O-Glycosyl hydrolases (EC 3.2.1.-) are a widespread group of enzymes that
hydrolyse the glycosidic bond between two or more carbohydrates, or between
a carbohydrate and a non-carbohydrate moiety. A classification system for
glycosyl hydrolases, based on sequence similarity, has led to the definition
of up to 60 different families [1-4] (http://expasy.hcuge.ch/cgi-bin/lists?
glycosid.txt). Family 30 encompasses the mammalian glucosyl-ceramidases (EC
3.2.1.45).
Human acid beta-glucosidase (D-glucosyl-N-acylsphingosine glucohydrolase),
cleaves the glucosidic bonds of glucosylceramide and synthetic beta-
glucosides [5]. Any one of over 50 different mutations in the gene of
glucocerebrosidase have been found to affect activity of this hydrolase,
producing variants of Gaucher disease, the most prevalent lysosomal
storage disease [5,7].
GLHYDRLASE30 is a 6-element fingerprint that provides a signature for
family 30 glycosyl hydrolases. The fingerprint was derived from an initial
alignment of 5 sequences: the motifs were drawn from conserved regions
spanning the N-terminal half of the alignment - motifs 1 and 4 each include
potential glycosylation sites; and motif 5 encodes a putative proton donor
site. A single iteration on OWL29.6 was required to reach convergence, no
further sequences being identified beyond the starting set. Two partial
matches were found: CEF11E6 is a fragment that lacks the portion of sequence
bearing motifs 5 and 6; and I67792, a putative glucosylceramidase that
matches motifs 4, 5 and 6.
An update on SPTR37_9f identified a true set of 5 sequences.
SUMMARY INFORMATION
5 codes involving 6 elements
0 codes involving 5 elements
0 codes involving 4 elements
0 codes involving 3 elements
0 codes involving 2 elements
COMPOSITE FINGERPRINT INDEX
6| 5 5 5 5 5 5
5| 0 0 0 0 0 0
4| 0 0 0 0 0 0
3| 0 0 0 0 0 0
2| 0 0 0 0 0 0
--+-------------------------------
| 1 2 3 4 5 6
True positives..
GLCM_HUMAN Q16545 GLCM_MOUSE O16580
O16581
PROTEIN TITLES
GLCM_HUMAN GLUCOSYLCERAMIDASE PRECURSOR (EC 3.2.1.45) (BETA-GLUCOCEREBR
Q16545 GLUCOCEREBROSIDASE PRECURSOR - HOMO SAPIENS (HUMAN).
GLCM_MOUSE GLUCOSYLCERAMIDASE PRECURSOR (EC 3.2.1.45) (BETA-GLUCOCEREBR
O16580 C33C12.3 PROTEIN - CAENORHABDITIS ELEGANS.
O16581 C33C12.8 PROTEIN - CAENORHABDITIS ELEGANS.
SCAN HISTORY
OWL29_6 1 50 NSINGLE
SPTR37_9f 2 6 NSINGLE
INITIAL MOTIF SETS
GLHYDRLASE301 Length of motif = 20 Motif number = 1
Glycosyl hydrolase family 30 motif I - 1
PCODE ST INT
SSVVCVCNATYCDSFDPPTF GLCM_HUMAN 51 51
SSVVCVCNATYCDSFDPPTF HUMGCBL 31 31
SSVVCVCNASYCDSLDPVTL GLCM_MOUSE 31 31
TGIVCVCNITYCDEIPDINL CELC33C125 78 78
TGTVCVCSLDSCDEIPPLDI CELC33C124 33 33
GLHYDRLASE302 Length of motif = 21 Motif number = 2
Glycosyl hydrolase family 30 motif II - 1
PCODE ST INT
TLQPEQKFQKVKGFGGAMTDA GLCM_HUMAN 107 36
TLQPEQKFQKVKGFGGAMTDA HUMGCBL 87 36
TLQPEKKFQKVKGFGGAMTDA GLCM_MOUSE 87 36
TIDSSKTYQTIQGFGSTFSDA CELC33C125 133 35
TIDSSKKYQTIQGFGSTFSDA CELC33C124 88 35
GLHYDRLASE303 Length of motif = 27 Motif number = 3
Glycosyl hydrolase family 30 motif III - 1
PCODE ST INT
LLLKSYFSEEGIGYNIIRVPMASCDFS GLCM_HUMAN 142 14
LLLKSYFSEEGIGYNIIRVPMASCDFS HUMGCBL 122 14
LLLRSYFSTNGIEYNIIRVPMASCDFS GLCM_MOUSE 122 14
TILRQYFSDSGLNLQFGRVPIASNDFS CELC33C125 168 14
LIMKQYFSDTGLNLQFGRVPIASTDFS CELC33C124 123 14
GLHYDRLASE304 Length of motif = 29 Motif number = 4
Glycosyl hydrolase family 30 motif IV - 1
PCODE ST INT
RTYTYADTPDDFQLHNFSLPEEDTKLKIP GLCM_HUMAN 170 1
RTYTYADTPDDFQLHNFSLPEEDTKLKIP HUMGCBL 150 1
RVYTYADTPNDFQLSNFSLPEEDTKLKIP GLCM_MOUSE 150 1
RVYTYDDNLEDYNMAHFSLQREDYQWKIP CELC33C125 196 1
RVYSYNDVANDYSMQNFNLTKEDFQWKIP CELC33C124 151 1
GLHYDRLASE305 Length of motif = 18 Motif number = 5
Glycosyl hydrolase family 30 motif V - 1
PCODE ST INT
DIYHQTWARYFVKFLDAY GLCM_HUMAN 242 43
DIYHQTWARYFVKFLDAY HUMGCBL 222 43
DIFHQTWANYFVKFLDAY GLCM_MOUSE 222 43
DTYHKSYVTYILHFLEEY CELC33C125 267 42
DNYHQAYAKYFVRFLEEY CELC33C124 222 42
GLHYDRLASE306 Length of motif = 23 Motif number = 6
Glycosyl hydrolase family 30 motif VI - 1
PCODE ST INT
VRLLMLDDQRLLLPHWAKVVLTD GLCM_HUMAN 315 55
VRLLMLDDQRLLLPHWAKVVLTD HUMGCBL 295 55
VKLLMLDDQRLLLPRWAEVVLSD GLCM_MOUSE 294 54
VKILILDDNRGNLPKWADTVLND CELC33C125 341 56
VKLLILDDNRGNLPKWADTVLND CELC33C124 296 56
FINAL MOTIF SETS
GLHYDRLASE301 Length of motif = 20 Motif number = 1
Glycosyl hydrolase family 30 motif I - 2
PCODE ST INT
SSVVCVCNATYCDSFDPPTF GLCM_HUMAN 51 51
SSVVCVCNATYCDSFDPPTF Q16545 51 51
SSVVCVCNASYCDSLDPVTL GLCM_MOUSE 31 31
TGTVCVCSLDSCDEIPPLDI O16580 33 33
TGIVCVCNITYCDEIPDINL O16581 78 78
GLHYDRLASE302 Length of motif = 21 Motif number = 2
Glycosyl hydrolase family 30 motif II - 2
PCODE ST INT
TLQPEQKFQKVKGFGGAMTDA GLCM_HUMAN 107 36
TLQPEQKFQKVKGFGGAMTDA Q16545 107 36
TLQPEKKFQKVKGFGGAMTDA GLCM_MOUSE 87 36
TIDSSKKYQTIQGFGSTFSDA O16580 88 35
TIDSSKTYQTIQGFGSTFSDA O16581 133 35
GLHYDRLASE303 Length of motif = 27 Motif number = 3
Glycosyl hydrolase family 30 motif III - 2
PCODE ST INT
LLLKSYFSEEGIGYNIIRVPMASCDFS GLCM_HUMAN 142 14
LLLKSYFSEEGIGYNIIRVPMASCDFS Q16545 142 14
LLLRSYFSTNGIEYNIIRVPMASCDFS GLCM_MOUSE 122 14
LIMKQYFSDTGLNLQFGRVPIASTDFS O16580 123 14
TILRQYFSDSGLNLQFGRVPIASNDFS O16581 168 14
GLHYDRLASE304 Length of motif = 29 Motif number = 4
Glycosyl hydrolase family 30 motif IV - 2
PCODE ST INT
RTYTYADTPDDFQLHNFSLPEEDTKLKIP GLCM_HUMAN 170 1
RTYTYADTPDDFQLHNFSLPEEDTKLKIP Q16545 170 1
RVYTYADTPNDFQLSNFSLPEEDTKLKIP GLCM_MOUSE 150 1
RVYSYNDVANDYSMQNFNLTKEDFQWKIP O16580 151 1
RVYTYDDNLEDYNMAHFSLQREDYQWKIP O16581 196 1
GLHYDRLASE305 Length of motif = 18 Motif number = 5
Glycosyl hydrolase family 30 motif V - 2
PCODE ST INT
DIYHQTWARYFVKFLDAY GLCM_HUMAN 242 43
DIYHQTWARYFVKFLDAY Q16545 242 43
DIFHQTWANYFVKFLDAY GLCM_MOUSE 222 43
DNYHQAYAKYFVRFLEEY O16580 222 42
DTYHKSYVTYILHFLEEY O16581 267 42
GLHYDRLASE306 Length of motif = 23 Motif number = 6
Glycosyl hydrolase family 30 motif VI - 2
PCODE ST INT
VRLLMLDDQRLLLPHWAKVVLTD GLCM_HUMAN 315 55
VRLLMLDDQRLLLPHWAKVVLTD Q16545 315 55
VKLLMLDDQRLLLPRWAEVVLSD GLCM_MOUSE 294 54
VKLLILDDNRGNLPKWADTVLND O16580 296 56
VKILILDDNRGNLPKWADTVLND O16581 341 56
User query: Display/Full Code "GLHYDRLASE30"