WORKLIST ENTRIES (1):
DISEASERSIST View alignment View Structure Disease resistance protein signature
Type of fingerprint: COMPOUND with 4 elements
Links:
INTERPRO; IPR000767
Creation date 21-MAY-1995; UPDATE 27-JUN-1999
1. STASKAWICZ, B.J., AUSUBEL, F.M., BAKER, B.J., ELLIS, J.G.
AND JONES, J.D.G.
Molecular genetics of plant disease resistance.
SCIENCE 268 661-667 (1995).
Plants are attacked by a range of phytopathegenic organisms, including
viruses, mycoplasma, bacteria, fungi, nematodes, protozoa and parasites.
Resistance to a pathogen is manifested in several ways and is often
correlated with a hypersensitive response (HR), localised induced cell
death in the host plant at the site of infection [1]. The induction of
the plant defense response that leads to HR is initiated by the plant's
recognition of specific signal molecules (elicitors) produced by the
pathogen; R genes are thought to encode receptors for these elicitors.
RPS2, N and L6 genes confer resistance to bacterial, viral and fungal
pathogens. Sequence analysis has shown that they contain C-terminal
leucine-rich repeats, which are characteristic of plant and animal
proteins involved in protein-protein interactions [1]. In addition,
the sequences contain a conserved nucleotide-binding site towards
their N-termini.
DISEASERSIST is a 4-element fingerprint that provides a signature for
disease resistance proteins. The fingerprint was derived from an initial
alignment of 2 sequences: the motifs were drawn from short conserved
regions spanning virtually the full alignment length, motif 1 encoding
a nucleotide-binding P-loop. Two iterations on OWL26.0 were required to
reach convergence, at which point a true set comprising 3 sequences was
identified. A single partial match was also found, AFSR_STRCO, a
Streptomyces regulatory protein that matches motifs 1 and 3 (and
also weakly with motif 2).
An update on SPTR37_9f identified a true set of 40 sequences, and 8
partial matches.
SUMMARY INFORMATION
40 codes involving 4 elements
4 codes involving 3 elements
4 codes involving 2 elements
COMPOSITE FINGERPRINT INDEX
4| 40 40 40 40
3| 4 4 4 0
2| 0 3 4 1
--+---------------------
| 1 2 3 4
True positives..
Q40392 O23533 O23532 O23535
O23528 O04264 O23529 O23538
O81401 O04093 O23001 O24015
O64790 O22728 O82484 P93244
O64789 Q42484 O81402 O82500
O22727 O23293 Q40253 O23317
Q40254 O49470 O49468 O04565
O23000 O65507 O65506 O49469
O04566 O81825 O81136 O48573
O81137 O49471 O48894 O50052
Subfamily: Codes involving 3 elements
Subfamily True positives..
O24016 O48647 Q39214 Q96485
Subfamily: Codes involving 2 elements
Subfamily True positives..
APAF_HUMAN O88879 AFSR_STRCO
PROTEIN TITLES
Q40392 VIRUS RESISTANCE (N) - NICOTIANA GLUTINOSA (TOBACCO).
O23533 RESISTANCE GENE HOMOLOG - ARABIDOPSIS THALIANA (MOUSE-EAR CR
O23532 RESISTANCE GENE - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
O23535 RESISTANCE GENE HOMOLOG - ARABIDOPSIS THALIANA (MOUSE-EAR CR
O23528 RESISTANCE GENE - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
O04264 DOWNY MILDEW RESISTANCE PROTEIN RPP5 - ARABIDOPSIS THALIANA
O23529 SIMILARITY TO REVERSE TRANSCRIPTASE - ARABIDOPSIS THALIANA R
O23538 RESISTANCE GENE HOMOLOG - ARABIDOPSIS THALIANA (MOUSE-EAR CR
O81401 NBS/LRR DISEASE RESISTANCE PROTEIN - ARABIDOPSIS THALIANA (M
O04093 DISEASE RESISTANCE PROTEIN RPM1 ISOLOG - ARABIDOPSIS THALIAN
O23001 F6P23.9 PROTEIN - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
O24015 RESISTANCE COMPLEX PROTEIN I2C-1 - LYCOPERSICON ESCULENTUM (
O64790 T1F9.21 - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
O22728 SIMILAR TO RPS-2 DISEASE RESISTANCE PROTEIN - ARABIDOPSIS TH
O82484 T12H20.8 PROTEIN - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
P93244 RUST RESISTANCE PROTEIN M - LINUM USITATISSIMUM (FLAX) (LINS
O64789 T1F9.20 - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
Q42484 RPS2 - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
O81402 RESISTANCE TO PSEUDOMONAS SYRINGAE PROTEIN 5 - ARABIDOPSIS T
O82500 F2P3.8 PROTEIN - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
O22727 F11P17.9 PROTEIN - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
O23293 TMV RESISTANCE PROTEIN HOMOLOG - ARABIDOPSIS THALIANA (MOUSE
Q40253 ALTERNATIVELY SPLICED RUST RESISTANCE (L6) - LINUM USITATISS
O23317 DISEASE RESISTANCE PROTEIN RPS2 HOMOLOG - ARABIDOPSIS THALIA
Q40254 L6TR - LINUM USITATISSIMUM (FLAX) (LINSEED).
O49470 RESISTANCE PROTEIN RPP5 - LIKE - ARABIDOPSIS THALIANA (MOUSE
O49468 RESISTANCE PROTEIN-LIKE PROTEIN - ARABIDOPSIS THALIANA (MOUS
O04565 T7N9.18 - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
O23000 F6P23.8 PROTEIN - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
O65507 PUTATIVE DISEASE RESISTANCE PROTEIN - ARABIDOPSIS THALIANA (
O65506 PUTATIVE DISEASE RESISTANCE PROTEIN - ARABIDOPSIS THALIANA (
O49469 TMV RESISTANCE PROTEIN N-LIKE - ARABIDOPSIS THALIANA (MOUSE-
O04566 T7N9.19 - ARABIDOPSIS THALIANA (MOUSE-EAR CRESS).
O81825 HYPOTHETICAL 103.9 KD PROTEIN - ARABIDOPSIS THALIANA (MOUSE-
O81136 PLANT RESISTANCE PROTEIN (DISEASE RESISTANCE GENE HOMOLOG MI
O48573 PUTATIVE DISEASE RESISTANCE PROTEIN - ARABIDOPSIS THALIANA (
O81137 ROOT-KNOT NEMATODE RESISTANCE PROTEIN - LYCOPERSICON ESCULEN
O49471 TMV RESISTANCE PROTEIN N - LIKE - ARABIDOPSIS THALIANA (MOUS
O48894 RESISTANCE PROTEIN CANDIDATE - LACTUCA SATIVA (GARDEN LETTUC
O50052 HYPOTHETICAL 160.5 KD PROTEIN - ARABIDOPSIS THALIANA (MOUSE-
O24016 RESISTANCE COMPLEX PROTEIN I2C-2 - LYCOPERSICON ESCULENTUM (
O48647 XA1, COMPLETE CDS - ORYZA SATIVA (RICE).
Q39214 DISEASE RESISTANCE PROTEIN RPM1 - ARABIDOPSIS THALIANA (MOUS
Q96485 PRF (PRF) - LYCOPERSICON ESCULENTUM (TOMATO).
APAF_HUMAN APOPTOTIC PROTEASE ACTIVATING FACTOR 1 (APAF-1) - HOMO SAPIE
O88879 APOPTOTIC PROTEASE ACTIVATING FACTOR 1 - MUS MUSCULUS (MOUSE
AFSR_STRCO REGULATORY PROTEIN AFSR - STREPTOMYCES COELICOLOR.
SCAN HISTORY
OWL26_0 2 75 NSINGLE
SPTR37_9f 5 270 NSINGLE
INITIAL MOTIF SETS
DISEASERSIST1 Length of motif = 16 Motif number = 1
RPS2 protein motif I - 1
PCODE ST INT
IMGIWGMGGVGKTTIA NGU15605 211 211
IIGVYGPGGVGKTTLM A54809 177 177
DISEASERSIST2 Length of motif = 15 Motif number = 2
RPS2 protein motif II - 1
PCODE ST INT
RLRSKKVLIVLDDID NGU15605 290 63
ALRQKRFLLLLDDVW A54809 251 58
DISEASERSIST3 Length of motif = 15 Motif number = 3
RPS2 protein motif III - 1
PCODE ST INT
AKGLPLALKVWGSLL NGU15605 382 77
CGGLPLALITLGGAM A54809 345 79
DISEASERSIST4 Length of motif = 17 Motif number = 4
RPS2 protein motif IV - 1
PCODE ST INT
EGLHSLEYLNLSYCNLI NGU15605 830 433
DCLRNIRCINISHCNKL A54809 769 409
FINAL MOTIF SETS
DISEASERSIST1 Length of motif = 16 Motif number = 1
RPS2 protein motif I - 5
PCODE ST INT
IMGIWGMGGVGKTTIA Q40392 211 211
MVGIWGQSGIGKSTIG O23533 241 241
MVGIWGPSGIGKSTIG O23532 207 207
MVGIWGQSGIGKSTIG O23535 46 46
MVGIWGQSGIGKSTIG O23528 209 209
MVGIWGQSGIGKSTIG O04264 211 211
MVGIWGQSGIGKSTIG O23529 47 47
MVGIWGQSGIGKSTIG O23538 211 211
IVGLYGMGGVGKTTLL O81401 178 178
VVSISGMGGIGKTTLA O04093 162 162
IVGVLGMPGIGKTTLV O23001 243 243
VVPIVGMGGMGKTTLA O24015 195 195
IMGLHGMGGVGKTTLF O64790 63 63
IMGLHGMGGVGKTTLF O22728 174 174
TMGLYGMGGVGKTTLL O82484 175 175
MVGLYGMGGIGKTTTA P93244 275 275
IMGLHGMGGVGKTTLF O64789 176 176
IIGVYGPGGVGKTTLM Q42484 177 177
ILGLYGMGGVGKTTLL O81402 178 178
IVGIWGPAGVGKTTIA O82500 207 207
IMGLHGMGGVGKTTLF O22727 175 175
IVGICGPAGIGKTTIA O23293 168 168
MVGLYGMGGIGKTTTA Q40253 260 260
IMGLYGMGGVGKTTLL O23317 151 151
MVGLYGMGGIGKTTTA Q40254 260 260
SLGIWGMAGIGKTTLA O49470 167 167
MVGISGPSGIGKTTIA O49468 206 206
VLGLYGMGGIGKTTLA O04565 360 360
SIGIWGMPGIGKTTLA O23000 48 48
TIGVVGMPGIGKTTLT O65507 238 238
LIGICGLPGSGKTTIA O65506 291 291
VVGVLGMTGIGKTTVA O49469 223 223
VMGLYGMGGIGKTTLA O04566 386 386
KIGVWGMGGVGKTTLV O81825 136 136
VISITGMPGSGKTTLA O81136 544 544
TVGIVGMPGIGKTTLA O48573 278 278
VISITGMPGSGKTTLA O81137 545 545
IVEVVGMPGIGKSTLL O49471 231 231
MIALWGMGGVGKTTMM O48894 174 174
KTVLVGEAGIGKTWLA O50052 28 28
DISEASERSIST2 Length of motif = 15 Motif number = 2
RPS2 protein motif II - 5
PCODE ST INT
RLRSKKVLIVLDDID Q40392 290 63
RLKHKKVLILLDDVD O23533 315 58
MLNQKKVLIVLDDVD O23532 274 51
RLKHKKVLILLDDVD O23535 120 58
RLKHKKVLILLDDVD O23528 283 58
RLNHKKVLILLDDVD O04264 285 58
RLKHKKVLILLDDVD O23529 121 58
RLKHKKVLILLDDVD O23538 285 58
VLRRKKFVLLLDDIW O81401 254 60
LLETGRYLVVLDDVW O04093 237 59
ELLKKKVLLVLDDVS O23001 318 59
KLNGKRFLVVLDDVW O24015 282 71
VLKGKRFVLMLDDIW O64790 139 60
VLKGKRFVLMLDDIW O22728 250 60
CLSKKRFVLLLDDIW O82484 251 60
RVSKSKILVVLDDVD P93244 352 61
VLKGKRFVLMLDDIW O64789 252 60
ALRQKRFLLLLDDVW Q42484 251 58
VLRRRKFVLLLDDIW O81402 254 60
RLKSQKVLIILDDVD O82500 285 62
VLKGKRFVLMLDDIW O22727 251 60
RLCDQKVLIVLDDVN O23293 245 61
RVSRFKILVVLDDVD Q40253 338 62
VLRRHKFVLLLDDIW O23317 227 60
RVSRFKILVVLDDVD Q40254 338 62
TLRSKRILLVLDDVR O49470 234 51
SLMHKKVLIILDDVD O49468 279 57
NVHEKKIIVVLDDVD O04565 436 60
KSGQKRLLIVLDNVL O23000 110 46
LLLSKKSLVVLDNVS O65507 312 58
MLKDKKVVLVLDDVD O65506 370 63
FLRNKKLFIVLDNVT O49469 295 56
NVHEKKIIVVLDDVD O04566 463 61
LIDLKNFLLILDDVW O81825 212 60
QLFGKRYLIVLDDVW O81136 617 57
VLLLKKVFLVIDNVS O48573 354 60
QLFGKRYLIVLDDVW O81137 618 57
KLLKNTVFIVLDGIS O49471 303 56
NSGGKKILVILDDVW O48894 250 60
KHKKDNLLLILDDEG O50052 112 68
DISEASERSIST3 Length of motif = 15 Motif number = 3
RPS2 protein motif III - 5
PCODE ST INT
AKGLPLALKVWGSLL Q40392 382 77
AGSLPLGLSVLGSSL O23533 408 78
AGNLPLGLSVLGSSL O23532 367 78
AGHLPLGLNVLGSSL O23535 213 78
VGSLPLGLSVLGSSL O23528 376 78
VGSLPLGLSVLGSSL O04264 378 78
VGSLPLGLSVLGSSL O23529 214 78
AGNLPLGLSVLGSSL O23538 378 78
CCGLPLALNVIGETM O81401 348 79
CGGLPLAVKVLGGLL O04093 333 81
ARGNPLALKILGREL O23001 411 78
CKGLPLALKALAGML O24015 378 81
CRGLPLALSVIGETM O64790 233 79
CRGLPLALNVIGETM O22728 344 79
CRGLPLALNVIGETM O82484 345 79
TGGLPLTLKVTGSFL P93244 447 80
CRGLPLALNVIGETM O64789 346 79
CGGLPLALITLGGAM Q42484 345 79
CRGLPLALNVIGEAM O81402 348 79
AGHLPLALRVLGSFM O82500 378 78
CRGLPLALSCIGETM O22727 345 79
FDNLPLGLRVMGSSL O23293 338 78
TAGLPLTLKVIGSLL Q40253 433 80
CRGLPLALNVIGETM O23317 321 79
TAGLPLTLKVIGSLL Q40254 433 80
ANGNPLALSICGKNL O49470 327 78
AGNLPLDLRVLVGLA O49468 372 78
SGLLPLAVEVFGSLL O04565 529 78
FSGNPLALSLYEEML O23000 206 81
AKGNPLALKILGKEL O65507 406 79
ASGNPLALSFYCRVL O65506 772 387
AKGLPLALKLLGKGL O49469 387 77
TGLLPLAVKVFGSHF O04566 557 79
CCGLPLAIITIGRTL O81825 305 78
CKGLPLVADLIAGVI O81136 711 79
AKGNPLALGAFGVEL O48573 446 77
CKGLPLVADLIAGVI O81137 712 79
ARGHPLILKLLGEEL O49471 404 86
CGGLPIAIKTMACTL O48894 346 81
SKGLPAAIVVLIKSL O50052 223 96
DISEASERSIST4 Length of motif = 17 Motif number = 4
RPS2 protein motif IV - 5
PCODE ST INT
GSLSSLKKLDLSRNNFE Q40392 855 458
VNLSSLETLDLSGCSSL O23533 974 551
VNLSSLKMLDLSGCSSL O23532 823 441
VNLSSLETLDLSGCSSL O23535 773 545
VNLSSLIILDLSGCSSL O23528 982 591
VNLSSLETLDLSGCSSL O04264 990 597
LPLGSLKKMDLGCSNNL O23529 432 203
QLLGSLKKMILRNSKYL O23538 629 236
RCMPSLAVLDLSENHSL O81401 560 197
RSLPLLRVLDLSRVKFE O04093 551 203
FKLKSLKTLILSHCKNF O23001 747 321
IKLKLLRFLDLSETSIT O24015 606 213
RYMQKLVVLDLSYNRDF O64790 443 195
RYMQKLVVLDLSYNRDF O22728 554 195
RHMRKLVVLDLSENHQL O82484 558 198
GELKNLKTLDLTSCRIQ P93244 740 278
RYMQKLVVLDLSDNRDF O64789 567 206
GNLRKLKHLDLQRTQFL Q42484 601 241
RCMPHLVVLDLSENQSL O81402 560 197
QPLRNLRTMNLNSSRNL O82500 628 235
RYMQKLVVLDLSHNPDF O22727 558 198
QPLTNLKKMDLTRSSHL O23293 565 212
GELKKLKTLVLKFCPIQ Q40253 725 277
RFMPNLVVLDLSWNSSL O23317 443 107
NLLPNLKWLELPFYKHG Q40254 623 175
GDLALLDTLDLKNCNRL O49470 888 546
RNLTELVTLDLENCERL O49468 587 200
FLARQLSVLDLSESGIR O04565 789 245
IHLSSLEVLDLSNCKRL O23000 550 329
KDTQKLKWVDLSHSRKL O65507 649 228
VDFESLKVLNLSGCSDL O65506 1184 397
YKLKSLQELVLSGCSAL O49469 772 370
GNLGKLLQLDLRRCSSL O04566 887 315
QAFPNLRILDLSGVRIR O81825 515 195
RHLRLLRVLDLHTSFIM O81136 922 196
IKVSSLKILILSDCSKL O48573 767 306
ESFPNLEKLKLQECGKL O81137 1185 458
INLRSLKTLILSNCSNL O49471 731 312
GKLKKLRLLDLTNCYGV O48894 618 257
SELSNLKELILRNCSKL O50052 839 601
User query: Display/Full Code "DISEASERSIST"