|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-125 |
1.10e-88 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 272.76 E-value: 1.10e-88
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 18 FKFTIPESLDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTICAQVIPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100
....*....|....*....|....*...
gi 1720408127 98 QEHQQQVAQAVERAKQVTMAELNAIIGV 125
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQ 108
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
419-704 |
7.43e-44 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 162.77 E-value: 7.43e-44
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 419 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDISHPGNKSPVSqldclNRDNYIRSCKLLPDGCTLIVGGEASTLSIWDLAa 497
Cdd:COG2319 77 HTAAVLSVAFSPDGRLLASASAdGTVRLWDLATGLLLRTLT-----GHTGAVRSVAFSPDGKTLASGSADGTVRLWDLA- 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 498 pTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 577
Cdd:COG2319 151 -TGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVR 229
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 578 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPD-KYQLHLHESCVLSLKFAYCGKWFVSTGKD 655
Cdd:COG2319 230 LWDLATGKLLRTlTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGElLRTLTGHSGGVNSVAFSPDGKLLASGSDD 309
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1720408127 656 NLLNAWRTPYGASIFQSK-ESSSVLSCDISVDDKYIVTGSGDKKATVYEV 704
Cdd:COG2319 310 GTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDL 359
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
419-703 |
1.01e-38 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 145.17 E-value: 1.01e-38
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 419 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLdCLNRDNyIRSCKLLPDGCTLIVGGEASTLSIWDLaa 497
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL-- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 498 PTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 577
Cdd:cd00200 81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 578 SWDLREGR---QLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPD-KYQLHLHESCVLSLKFAYCGKWFVSTG 653
Cdd:cd00200 161 LWDLRTGKcvaTLTGH--TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGS 238
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 1720408127 654 KDNLLNAWRTPYGASIFQ-SKESSSVLSCDISVDDKYIVTGSGDKKATVYE 703
Cdd:cd00200 239 EDGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
541-580 |
2.25e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 47.69 E-value: 2.25e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1720408127 541 NQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 580
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
531-702 |
7.79e-07 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 52.40 E-value: 7.79e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 531 DGNIAVWDLHNQTLVRQFQGHTDGASCIDISN-DGTKLWTGGLDNTVRSWDLREGRQLQQHDFTSQIFSLGY-CPTGEWL 608
Cdd:PLN00181 554 EGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFpSESGRSL 633
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 609 AVGMESSNVEVLHVNKPdkyQLHL-----HESCVLSLKFAYCGKwFVSTGKDNLLNAWRTPYGASIFQSKESSSVLS--- 680
Cdd:PLN00181 634 AFGSADHKVYYYDLRNP---KLPLctmigHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMSISGINETPLHSFMGhtn 709
|
170 180
....*....|....*....|....*.
gi 1720408127 681 ----CDISVDDKYIVTGSGDKKATVY 702
Cdd:PLN00181 710 vknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
543-580 |
2.81e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 2.81e-06
10 20 30
....*....|....*....|....*....|....*...
gi 1720408127 543 TLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 580
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
241-403 |
8.34e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 39.67 E-value: 8.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 241 HEKANTPVLKSSTPTPRSdmPTPGTSATPGLRPGLGKPPAMEPLVNQAAAGLRTPLavPGPYPAPFGMVPHAGMNGELTS 320
Cdd:PHA03378 671 HIPYQPSPTGANTMLPIQ--WAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRA--RPPAAAPGRARPPAAAPGRARP 746
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 321 PGAAYAGLHsmSPQMSAAAAaaaaavvaygRSPMVGFDPPPHMRVPSIPPNLAGIPGGKPAysfhvtadgQMQPVPFPPD 400
Cdd:PHA03378 747 PAAAPGRAR--PPAAAPGRA----------RPPAAAPGAPTPQPPPQAPPAPQQRPRGAPT---------PQPPPQAGPT 805
|
...
gi 1720408127 401 ALI 403
Cdd:PHA03378 806 SMQ 808
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| TLE_N |
pfam03920 |
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor ... |
18-125 |
1.10e-88 |
|
Groucho/TLE N-terminal Q-rich domain; The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerization.
Pssm-ID: 461094 Cd Length: 117 Bit Score: 272.76 E-value: 1.10e-88
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 18 FKFTIPESLDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNTICAQVIPFLS 97
Cdd:pfam03920 1 FKFTVPETCDRIKEEFQFLQAQYHSLKLECEKLASEKTEMQRHYVMYYEMSYGLNIEMHKQTEIAKRLNAICAQVIPFLS 80
|
90 100
....*....|....*....|....*...
gi 1720408127 98 QEHQQQVAQAVERAKQVTMAELNAIIGV 125
Cdd:pfam03920 81 QEHQQQVAQAVERAKQVTMAELNAIIGQ 108
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
419-704 |
7.43e-44 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 162.77 E-value: 7.43e-44
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 419 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDISHPGNKSPVSqldclNRDNYIRSCKLLPDGCTLIVGGEASTLSIWDLAa 497
Cdd:COG2319 77 HTAAVLSVAFSPDGRLLASASAdGTVRLWDLATGLLLRTLT-----GHTGAVRSVAFSPDGKTLASGSADGTVRLWDLA- 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 498 pTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 577
Cdd:COG2319 151 -TGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVR 229
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 578 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPD-KYQLHLHESCVLSLKFAYCGKWFVSTGKD 655
Cdd:COG2319 230 LWDLATGKLLRTlTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGElLRTLTGHSGGVNSVAFSPDGKLLASGSDD 309
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1720408127 656 NLLNAWRTPYGASIFQSK-ESSSVLSCDISVDDKYIVTGSGDKKATVYEV 704
Cdd:COG2319 310 GTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDL 359
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
419-704 |
1.34e-42 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 159.31 E-value: 1.34e-42
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 419 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLDclNRDNYIRSCKLLPDGCTLIVGGEASTLSIWDLAa 497
Cdd:COG2319 119 HTGAVRSVAFSPDGKTLASGSAdGTVRLWDLA---TGKLLRTLT--GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLA- 192
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 498 pTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 577
Cdd:COG2319 193 -TGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVR 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 578 SWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPDK-YQLHLHESCVLSLKFAYCGKWFVSTGKD 655
Cdd:COG2319 272 LWDLATGELLRTlTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLlRTLTGHTGAVRSVAFSPDGKTLASGSDD 351
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1720408127 656 NLLNAWRTPYGASIFQSKE-SSSVLSCDISVDDKYIVTGSGDKKATVYEV 704
Cdd:COG2319 352 GTVRLWDLATGELLRTLTGhTGAVTSVAFSPDGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
419-703 |
1.01e-38 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 145.17 E-value: 1.01e-38
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 419 HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLdCLNRDNyIRSCKLLPDGCTLIVGGEASTLSIWDLaa 497
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGdGTIKVWDLE---TGELLRTL-KGHTGP-VRDVAASADGTYLASGSSDKTIRLWDL-- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 498 PTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVR 577
Cdd:cd00200 81 ETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIK 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 578 SWDLREGR---QLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPD-KYQLHLHESCVLSLKFAYCGKWFVSTG 653
Cdd:cd00200 161 LWDLRTGKcvaTLTGH--TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKcLGTLRGHENGVNSVAFSPDGYLLASGS 238
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 1720408127 654 KDNLLNAWRTPYGASIFQ-SKESSSVLSCDISVDDKYIVTGSGDKKATVYE 703
Cdd:cd00200 239 EDGTIRVWDLRTGECVQTlSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
412-664 |
6.13e-38 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 145.82 E-value: 6.13e-38
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 412 RQINTLN-HGEVVCAVTISNPTRHVYTGGK-GCVKVWDIShpgNKSPVSQLDclNRDNYIRSCKLLPDGCTLIVGGEAST 489
Cdd:COG2319 153 KLLRTLTgHSGAVTSVAFSPDGKLLASGSDdGTVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKLLASGSADGT 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 490 LSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWT 569
Cdd:COG2319 228 VRLWDLA--TGKLLRTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLAS 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 570 GGLDNTVRSWDLREGRQLQQHD-FTSQIFSLGYCPTGEWLAVGMESSNVEVLHVN-KPDKYQLHLHESCVLSLKFAYCGK 647
Cdd:COG2319 306 GSDDGTVRLWDLATGKLLRTLTgHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAtGELLRTLTGHTGAVTSVAFSPDGR 385
|
250
....*....|....*..
gi 1720408127 648 WFVSTGKDNLLNAWRTP 664
Cdd:COG2319 386 TLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
469-704 |
3.46e-30 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 123.48 E-value: 3.46e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 469 IRSCKLLPDGCTLIVGGEASTLSIWDLAAPTPRIKAELTSSAPacYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQF 548
Cdd:COG2319 39 VASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAV--LSVAFSPDGRLLASASADGTVRLWDLATGLLLRTL 116
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 549 QGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPDK 627
Cdd:COG2319 117 TGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTlTGHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLATGKL 196
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720408127 628 YQ-LHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASIFQSK-ESSSVLSCDISVDDKYIVTGSGDKKATVYEV 704
Cdd:COG2319 197 LRtLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTgHSGSVRSVAFSPDGRLLASGSADGTVRLWDL 275
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
513-704 |
2.01e-26 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 109.73 E-value: 2.01e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 513 CYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREGRQLQ-QHD 591
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRtLTG 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 592 FTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPD-KYQLHLHESCVLSLKFAyCGKWFVSTGK-DNLLNAWRTPYGASI 669
Cdd:cd00200 92 HTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKcLTTLRGHTDWVNSVAFS-PDGTFVASSSqDGTIKLWDLRTGKCV 170
|
170 180 190
....*....|....*....|....*....|....*..
gi 1720408127 670 --FQSkESSSVLSCDISVDDKYIVTGSGDKKATVYEV 704
Cdd:cd00200 171 atLTG-HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDL 206
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
544-704 |
2.38e-20 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 92.01 E-value: 2.38e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 544 LVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWDLREG---RQLQQHdfTSQIFSLGYCPTGEWLAVGMESSNVEVL 620
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGellRTLKGH--TGPVRDVAASADGTYLASGSSDKTIRLW 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 621 HVNKPDK-YQLHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASI--FQSKEsSSVLSCDISVDDKYIVTGSGDK 697
Cdd:cd00200 79 DLETGECvRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLttLRGHT-DWVNSVAFSPDGTFVASSSQDG 157
|
....*..
gi 1720408127 698 KATVYEV 704
Cdd:cd00200 158 TIKLWDL 164
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
487-704 |
2.64e-19 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 90.74 E-value: 2.64e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 487 ASTLSIWDLAAPTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASCIDISNDGTK 566
Cdd:COG2319 13 SADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRL 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 567 LWTGGLDNTVRSWDLREGRQLQQ-HDFTSQIFSLGYCPTGEWLAVGMESSNVEVLHVNKPDK-YQLHLHESCVLSLKFAY 644
Cdd:COG2319 93 LASASADGTVRLWDLATGLLLRTlTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLlRTLTGHSGAVTSVAFSP 172
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720408127 645 CGKWFVSTGKDNLLNAWRTPYGASIFQ-SKESSSVLSCDISVDDKYIVTGSGDKKATVYEV 704
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLATGKLLRTlTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDL 233
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
412-541 |
4.45e-12 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 68.40 E-value: 4.45e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 412 RQINTLN-HGEVVCAVTISNPTRHVYTGGKGC-VKVWDIShpgNKSPVSQLDclNRDNYIRSCKLLPDGCTLIVGGEAST 489
Cdd:COG2319 279 ELLRTLTgHSGGVNSVAFSPDGKLLASGSDDGtVRLWDLA---TGKLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGT 353
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1720408127 490 LSIWDLAapTPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWDLHN 541
Cdd:COG2319 354 VRLWDLA--TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
541-580 |
2.25e-07 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 47.69 E-value: 2.25e-07
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1720408127 541 NQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 580
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
531-702 |
7.79e-07 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 52.40 E-value: 7.79e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 531 DGNIAVWDLHNQTLVRQFQGHTDGASCIDISN-DGTKLWTGGLDNTVRSWDLREGRQLQQHDFTSQIFSLGY-CPTGEWL 608
Cdd:PLN00181 554 EGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSaDPTLLASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFpSESGRSL 633
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 609 AVGMESSNVEVLHVNKPdkyQLHL-----HESCVLSLKFAYCGKwFVSTGKDNLLNAWRTPYGASIFQSKESSSVLS--- 680
Cdd:PLN00181 634 AFGSADHKVYYYDLRNP---KLPLctmigHSKTVSYVRFVDSST-LVSSSTDNTLKLWDLSMSISGINETPLHSFMGhtn 709
|
170 180
....*....|....*....|....*.
gi 1720408127 681 ----CDISVDDKYIVTGSGDKKATVY 702
Cdd:PLN00181 710 vknfVGLSVSDGYIATGSETNEVFVY 735
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
543-580 |
2.81e-06 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 44.26 E-value: 2.81e-06
10 20 30
....*....|....*....|....*....|....*...
gi 1720408127 543 TLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRSWD 580
Cdd:pfam00400 2 KLLKTLEGHTGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
499-538 |
5.92e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.06 E-value: 5.92e-04
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1720408127 499 TPRIKAELTSSAPACYALAISPDSKVCFSCCSDGNIAVWD 538
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| NBCH_WD40 |
pfam20426 |
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at ... |
504-585 |
7.91e-04 |
|
Neurobeachin beta propeller domain; This entry represents the beta propeller domain found at the C-terminus of neurobeachin-like proteins.
Pssm-ID: 466575 [Multi-domain] Cd Length: 350 Bit Score: 42.37 E-value: 7.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 504 AELTSSAPACYALAISPDSKVCFSCcsdGNiavWD-------LHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTV 576
Cdd:pfam20426 75 AENVELGAQCFATLQTPSENFLISC---GN---WEnsfqvisLNDGRMVQSIRQHKDVVSCVAVTSDGSILATGSYDTTV 148
|
....*....
gi 1720408127 577 RSWDLREGR 585
Cdd:pfam20426 149 MVWEVLRGR 157
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
514-538 |
6.45e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 35.01 E-value: 6.45e-03
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
241-403 |
8.34e-03 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 39.67 E-value: 8.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 241 HEKANTPVLKSSTPTPRSdmPTPGTSATPGLRPGLGKPPAMEPLVNQAAAGLRTPLavPGPYPAPFGMVPHAGMNGELTS 320
Cdd:PHA03378 671 HIPYQPSPTGANTMLPIQ--WAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRA--RPPAAAPGRARPPAAAPGRARP 746
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408127 321 PGAAYAGLHsmSPQMSAAAAaaaaavvaygRSPMVGFDPPPHMRVPSIPPNLAGIPGGKPAysfhvtadgQMQPVPFPPD 400
Cdd:PHA03378 747 PAAAPGRAR--PPAAAPGRA----------RPPAAAPGAPTPQPPPQAPPAPQQRPRGAPT---------PQPPPQAGPT 805
|
...
gi 1720408127 401 ALI 403
Cdd:PHA03378 806 SMQ 808
|
|
|