WORKLIST ENTRIES (1):

DISEASERSIST View alignment View Structure    Disease resistance protein signature
 Type of fingerprint: COMPOUND with 4  elements
Links:
   INTERPRO; IPR000767

 Creation date 21-MAY-1995; UPDATE 27-JUN-1999

   1. STASKAWICZ, B.J., AUSUBEL, F.M., BAKER, B.J., ELLIS, J.G.
   AND JONES, J.D.G.
   Molecular genetics of plant disease resistance.
   SCIENCE 268 661-667 (1995).

   Plants are attacked by a range of phytopathegenic organisms, including
   viruses, mycoplasma, bacteria, fungi, nematodes, protozoa and parasites.
   Resistance to a pathogen is manifested in several ways and is often
   correlated with a hypersensitive response (HR), localised induced cell
   death in the host plant at the site of infection [1]. The induction of
   the plant defense response that leads to HR is initiated by the plant's
   recognition of specific signal molecules (elicitors) produced by the
   pathogen; R genes are thought to encode receptors for these elicitors.
  
   RPS2, N and L6 genes confer resistance to bacterial, viral and fungal
   pathogens. Sequence analysis has shown that they contain C-terminal
   leucine-rich repeats, which are characteristic of plant and animal
   proteins involved in protein-protein interactions [1]. In addition,
   the sequences contain a conserved nucleotide-binding site towards
   their N-termini.
  
   DISEASERSIST is a 4-element fingerprint that provides a signature for
   disease resistance proteins. The fingerprint was derived from an initial
   alignment of 2 sequences: the motifs were drawn from short conserved
   regions spanning virtually the full alignment length, motif 1 encoding
   a nucleotide-binding P-loop. Two iterations on OWL26.0 were required to
   reach convergence, at which point a true set comprising 3 sequences was
   identified. A single partial match was also found, AFSR_STRCO, a
   Streptomyces regulatory protein that matches motifs 1 and 3 (and
   also weakly with motif 2).
  
   An update on SPTR37_9f identified a true set of 40 sequences, and 8
   partial matches.

  SUMMARY INFORMATION
     40 codes involving  4 elements
      4 codes involving  3 elements
      4 codes involving  2 elements

   COMPOSITE FINGERPRINT INDEX
  
    4|  40   40   40   40  
    3|   4    4    4    0  
    2|   0    3    4    1  
   --+---------------------
     |   1    2    3    4  

True positives..
 Q40392         O23533         O23532         O23535         
 O23528         O04264         O23529         O23538         
 O81401         O04093         O23001         O24015         
 O64790         O22728         O82484         P93244         
 O64789         Q42484         O81402         O82500         
 O22727         O23293         Q40253         O23317         
 Q40254         O49470         O49468         O04565         
 O23000         O65507         O65506         O49469         
 O04566         O81825         O81136         O48573         
 O81137         O49471         O48894         O50052         
Subfamily:  Codes involving 3 elements
 Subfamily True positives..
 O24016         O48647         Q39214         Q96485         
Subfamily:  Codes involving 2 elements
 Subfamily True positives..
 APAF_HUMAN     O88879         AFSR_STRCO     


  PROTEIN TITLES
   Q40392           VIRUS RESISTANCE (N) - NICOTIANA GLUTINOSA (TOBACCO).
   O23533           RESISTANCE GENE HOMOLOG - ARABIDOPSIS THALIANA (MOUSE-EAR CR
   O23532           RESISTANCE GENE - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
   O23535           RESISTANCE GENE HOMOLOG - ARABIDOPSIS THALIANA (MOUSE-EAR CR
   O23528           RESISTANCE GENE - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
   O04264           DOWNY MILDEW RESISTANCE PROTEIN RPP5 - ARABIDOPSIS THALIANA 
   O23529           SIMILARITY TO REVERSE TRANSCRIPTASE - ARABIDOPSIS THALIANA R
   O23538           RESISTANCE GENE HOMOLOG - ARABIDOPSIS THALIANA (MOUSE-EAR CR
   O81401           NBS/LRR DISEASE RESISTANCE PROTEIN - ARABIDOPSIS THALIANA (M
   O04093           DISEASE RESISTANCE PROTEIN RPM1 ISOLOG - ARABIDOPSIS THALIAN
   O23001           F6P23.9 PROTEIN - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
   O24015           RESISTANCE COMPLEX PROTEIN I2C-1 - LYCOPERSICON ESCULENTUM (
   O64790           T1F9.21 - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
   O22728           SIMILAR TO RPS-2 DISEASE RESISTANCE PROTEIN - ARABIDOPSIS TH
   O82484           T12H20.8 PROTEIN - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
   P93244           RUST RESISTANCE PROTEIN M - LINUM USITATISSIMUM (FLAX) (LINS
   O64789           T1F9.20 - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
   Q42484           RPS2 - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
   O81402           RESISTANCE TO PSEUDOMONAS SYRINGAE PROTEIN 5 - ARABIDOPSIS T
   O82500           F2P3.8 PROTEIN - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
   O22727           F11P17.9 PROTEIN - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
   O23293           TMV RESISTANCE PROTEIN HOMOLOG - ARABIDOPSIS THALIANA (MOUSE
   Q40253           ALTERNATIVELY SPLICED RUST RESISTANCE (L6) - LINUM USITATISS
   O23317           DISEASE RESISTANCE PROTEIN RPS2 HOMOLOG - ARABIDOPSIS THALIA
   Q40254           L6TR - LINUM USITATISSIMUM (FLAX) (LINSEED).
   O49470           RESISTANCE PROTEIN RPP5 - LIKE - ARABIDOPSIS THALIANA (MOUSE
   O49468           RESISTANCE PROTEIN-LIKE PROTEIN - ARABIDOPSIS THALIANA (MOUS
   O04565           T7N9.18 - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
   O23000           F6P23.8 PROTEIN - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
   O65507           PUTATIVE DISEASE RESISTANCE PROTEIN - ARABIDOPSIS THALIANA (
   O65506           PUTATIVE DISEASE RESISTANCE PROTEIN - ARABIDOPSIS THALIANA (
   O49469           TMV RESISTANCE PROTEIN N-LIKE - ARABIDOPSIS THALIANA (MOUSE-
   O04566           T7N9.19 - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
   O81825           HYPOTHETICAL 103.9 KD PROTEIN - ARABIDOPSIS THALIANA (MOUSE-
   O81136           PLANT RESISTANCE PROTEIN (DISEASE RESISTANCE GENE HOMOLOG MI
   O48573           PUTATIVE DISEASE RESISTANCE PROTEIN - ARABIDOPSIS THALIANA (
   O81137           ROOT-KNOT NEMATODE RESISTANCE PROTEIN - LYCOPERSICON ESCULEN
   O49471           TMV RESISTANCE PROTEIN N - LIKE - ARABIDOPSIS THALIANA (MOUS
   O48894           RESISTANCE PROTEIN CANDIDATE - LACTUCA SATIVA (GARDEN LETTUC
   O50052           HYPOTHETICAL 160.5 KD PROTEIN - ARABIDOPSIS THALIANA (MOUSE-
 
   O24016           RESISTANCE COMPLEX PROTEIN I2C-2 - LYCOPERSICON ESCULENTUM (
   O48647           XA1, COMPLETE CDS - ORYZA SATIVA (RICE).
   Q39214           DISEASE RESISTANCE PROTEIN RPM1 - ARABIDOPSIS THALIANA (MOUS
   Q96485           PRF (PRF) - LYCOPERSICON ESCULENTUM (TOMATO).
 
   APAF_HUMAN       APOPTOTIC PROTEASE ACTIVATING FACTOR 1 (APAF-1) - HOMO SAPIE
   O88879           APOPTOTIC PROTEASE ACTIVATING FACTOR 1 - MUS MUSCULUS (MOUSE
   AFSR_STRCO       REGULATORY PROTEIN AFSR - STREPTOMYCES COELICOLOR.

SCAN HISTORY OWL26_0 2 75 NSINGLE SPTR37_9f 5 270 NSINGLE INITIAL MOTIF SETS DISEASERSIST1 Length of motif = 16 Motif number = 1 RPS2 protein motif I - 1 PCODE ST INT IMGIWGMGGVGKTTIA NGU15605 211 211 IIGVYGPGGVGKTTLM A54809 177 177 DISEASERSIST2 Length of motif = 15 Motif number = 2 RPS2 protein motif II - 1 PCODE ST INT RLRSKKVLIVLDDID NGU15605 290 63 ALRQKRFLLLLDDVW A54809 251 58 DISEASERSIST3 Length of motif = 15 Motif number = 3 RPS2 protein motif III - 1 PCODE ST INT AKGLPLALKVWGSLL NGU15605 382 77 CGGLPLALITLGGAM A54809 345 79 DISEASERSIST4 Length of motif = 17 Motif number = 4 RPS2 protein motif IV - 1 PCODE ST INT EGLHSLEYLNLSYCNLI NGU15605 830 433 DCLRNIRCINISHCNKL A54809 769 409 FINAL MOTIF SETS DISEASERSIST1 Length of motif = 16 Motif number = 1 RPS2 protein motif I - 5 PCODE ST INT IMGIWGMGGVGKTTIA Q40392 211 211 MVGIWGQSGIGKSTIG O23533 241 241 MVGIWGPSGIGKSTIG O23532 207 207 MVGIWGQSGIGKSTIG O23535 46 46 MVGIWGQSGIGKSTIG O23528 209 209 MVGIWGQSGIGKSTIG O04264 211 211 MVGIWGQSGIGKSTIG O23529 47 47 MVGIWGQSGIGKSTIG O23538 211 211 IVGLYGMGGVGKTTLL O81401 178 178 VVSISGMGGIGKTTLA O04093 162 162 IVGVLGMPGIGKTTLV O23001 243 243 VVPIVGMGGMGKTTLA O24015 195 195 IMGLHGMGGVGKTTLF O64790 63 63 IMGLHGMGGVGKTTLF O22728 174 174 TMGLYGMGGVGKTTLL O82484 175 175 MVGLYGMGGIGKTTTA P93244 275 275 IMGLHGMGGVGKTTLF O64789 176 176 IIGVYGPGGVGKTTLM Q42484 177 177 ILGLYGMGGVGKTTLL O81402 178 178 IVGIWGPAGVGKTTIA O82500 207 207 IMGLHGMGGVGKTTLF O22727 175 175 IVGICGPAGIGKTTIA O23293 168 168 MVGLYGMGGIGKTTTA Q40253 260 260 IMGLYGMGGVGKTTLL O23317 151 151 MVGLYGMGGIGKTTTA Q40254 260 260 SLGIWGMAGIGKTTLA O49470 167 167 MVGISGPSGIGKTTIA O49468 206 206 VLGLYGMGGIGKTTLA O04565 360 360 SIGIWGMPGIGKTTLA O23000 48 48 TIGVVGMPGIGKTTLT O65507 238 238 LIGICGLPGSGKTTIA O65506 291 291 VVGVLGMTGIGKTTVA O49469 223 223 VMGLYGMGGIGKTTLA O04566 386 386 KIGVWGMGGVGKTTLV O81825 136 136 VISITGMPGSGKTTLA O81136 544 544 TVGIVGMPGIGKTTLA O48573 278 278 VISITGMPGSGKTTLA O81137 545 545 IVEVVGMPGIGKSTLL O49471 231 231 MIALWGMGGVGKTTMM O48894 174 174 KTVLVGEAGIGKTWLA O50052 28 28 DISEASERSIST2 Length of motif = 15 Motif number = 2 RPS2 protein motif II - 5 PCODE ST INT RLRSKKVLIVLDDID Q40392 290 63 RLKHKKVLILLDDVD O23533 315 58 MLNQKKVLIVLDDVD O23532 274 51 RLKHKKVLILLDDVD O23535 120 58 RLKHKKVLILLDDVD O23528 283 58 RLNHKKVLILLDDVD O04264 285 58 RLKHKKVLILLDDVD O23529 121 58 RLKHKKVLILLDDVD O23538 285 58 VLRRKKFVLLLDDIW O81401 254 60 LLETGRYLVVLDDVW O04093 237 59 ELLKKKVLLVLDDVS O23001 318 59 KLNGKRFLVVLDDVW O24015 282 71 VLKGKRFVLMLDDIW O64790 139 60 VLKGKRFVLMLDDIW O22728 250 60 CLSKKRFVLLLDDIW O82484 251 60 RVSKSKILVVLDDVD P93244 352 61 VLKGKRFVLMLDDIW O64789 252 60 ALRQKRFLLLLDDVW Q42484 251 58 VLRRRKFVLLLDDIW O81402 254 60 RLKSQKVLIILDDVD O82500 285 62 VLKGKRFVLMLDDIW O22727 251 60 RLCDQKVLIVLDDVN O23293 245 61 RVSRFKILVVLDDVD Q40253 338 62 VLRRHKFVLLLDDIW O23317 227 60 RVSRFKILVVLDDVD Q40254 338 62 TLRSKRILLVLDDVR O49470 234 51 SLMHKKVLIILDDVD O49468 279 57 NVHEKKIIVVLDDVD O04565 436 60 KSGQKRLLIVLDNVL O23000 110 46 LLLSKKSLVVLDNVS O65507 312 58 MLKDKKVVLVLDDVD O65506 370 63 FLRNKKLFIVLDNVT O49469 295 56 NVHEKKIIVVLDDVD O04566 463 61 LIDLKNFLLILDDVW O81825 212 60 QLFGKRYLIVLDDVW O81136 617 57 VLLLKKVFLVIDNVS O48573 354 60 QLFGKRYLIVLDDVW O81137 618 57 KLLKNTVFIVLDGIS O49471 303 56 NSGGKKILVILDDVW O48894 250 60 KHKKDNLLLILDDEG O50052 112 68 DISEASERSIST3 Length of motif = 15 Motif number = 3 RPS2 protein motif III - 5 PCODE ST INT AKGLPLALKVWGSLL Q40392 382 77 AGSLPLGLSVLGSSL O23533 408 78 AGNLPLGLSVLGSSL O23532 367 78 AGHLPLGLNVLGSSL O23535 213 78 VGSLPLGLSVLGSSL O23528 376 78 VGSLPLGLSVLGSSL O04264 378 78 VGSLPLGLSVLGSSL O23529 214 78 AGNLPLGLSVLGSSL O23538 378 78 CCGLPLALNVIGETM O81401 348 79 CGGLPLAVKVLGGLL O04093 333 81 ARGNPLALKILGREL O23001 411 78 CKGLPLALKALAGML O24015 378 81 CRGLPLALSVIGETM O64790 233 79 CRGLPLALNVIGETM O22728 344 79 CRGLPLALNVIGETM O82484 345 79 TGGLPLTLKVTGSFL P93244 447 80 CRGLPLALNVIGETM O64789 346 79 CGGLPLALITLGGAM Q42484 345 79 CRGLPLALNVIGEAM O81402 348 79 AGHLPLALRVLGSFM O82500 378 78 CRGLPLALSCIGETM O22727 345 79 FDNLPLGLRVMGSSL O23293 338 78 TAGLPLTLKVIGSLL Q40253 433 80 CRGLPLALNVIGETM O23317 321 79 TAGLPLTLKVIGSLL Q40254 433 80 ANGNPLALSICGKNL O49470 327 78 AGNLPLDLRVLVGLA O49468 372 78 SGLLPLAVEVFGSLL O04565 529 78 FSGNPLALSLYEEML O23000 206 81 AKGNPLALKILGKEL O65507 406 79 ASGNPLALSFYCRVL O65506 772 387 AKGLPLALKLLGKGL O49469 387 77 TGLLPLAVKVFGSHF O04566 557 79 CCGLPLAIITIGRTL O81825 305 78 CKGLPLVADLIAGVI O81136 711 79 AKGNPLALGAFGVEL O48573 446 77 CKGLPLVADLIAGVI O81137 712 79 ARGHPLILKLLGEEL O49471 404 86 CGGLPIAIKTMACTL O48894 346 81 SKGLPAAIVVLIKSL O50052 223 96 DISEASERSIST4 Length of motif = 17 Motif number = 4 RPS2 protein motif IV - 5 PCODE ST INT GSLSSLKKLDLSRNNFE Q40392 855 458 VNLSSLETLDLSGCSSL O23533 974 551 VNLSSLKMLDLSGCSSL O23532 823 441 VNLSSLETLDLSGCSSL O23535 773 545 VNLSSLIILDLSGCSSL O23528 982 591 VNLSSLETLDLSGCSSL O04264 990 597 LPLGSLKKMDLGCSNNL O23529 432 203 QLLGSLKKMILRNSKYL O23538 629 236 RCMPSLAVLDLSENHSL O81401 560 197 RSLPLLRVLDLSRVKFE O04093 551 203 FKLKSLKTLILSHCKNF O23001 747 321 IKLKLLRFLDLSETSIT O24015 606 213 RYMQKLVVLDLSYNRDF O64790 443 195 RYMQKLVVLDLSYNRDF O22728 554 195 RHMRKLVVLDLSENHQL O82484 558 198 GELKNLKTLDLTSCRIQ P93244 740 278 RYMQKLVVLDLSDNRDF O64789 567 206 GNLRKLKHLDLQRTQFL Q42484 601 241 RCMPHLVVLDLSENQSL O81402 560 197 QPLRNLRTMNLNSSRNL O82500 628 235 RYMQKLVVLDLSHNPDF O22727 558 198 QPLTNLKKMDLTRSSHL O23293 565 212 GELKKLKTLVLKFCPIQ Q40253 725 277 RFMPNLVVLDLSWNSSL O23317 443 107 NLLPNLKWLELPFYKHG Q40254 623 175 GDLALLDTLDLKNCNRL O49470 888 546 RNLTELVTLDLENCERL O49468 587 200 FLARQLSVLDLSESGIR O04565 789 245 IHLSSLEVLDLSNCKRL O23000 550 329 KDTQKLKWVDLSHSRKL O65507 649 228 VDFESLKVLNLSGCSDL O65506 1184 397 YKLKSLQELVLSGCSAL O49469 772 370 GNLGKLLQLDLRRCSSL O04566 887 315 QAFPNLRILDLSGVRIR O81825 515 195 RHLRLLRVLDLHTSFIM O81136 922 196 IKVSSLKILILSDCSKL O48573 767 306 ESFPNLEKLKLQECGKL O81137 1185 458 INLRSLKTLILSNCSNL O49471 731 312 GKLKKLRLLDLTNCYGV O48894 618 257 SELSNLKELILRNCSKL O50052 839 601

User query: Display/Full Code "DISEASERSIST"