|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-397 |
1.85e-39 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 152.76 E-value: 1.85e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319 116 LTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWD 190
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgktgDLQTILclacakeditySGAL 216
Cdd:COG2319 191 LATGKLLRTLTGHTGAVRSVAFSP-----------------------------------DGKLLA-----------SGSA 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319 225 DGTVRLWdlATGKLLRTLTG-HSGSVRSVAFSPDGrlLASGSADGTVRLWDLATG-----ELLRTLTGHSG-GVNSVAFS 297
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 293 AD--RLLAGTQDSEI--FEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EE 367
Cdd:COG2319 298 PDgkLLASGSDDGTVrlWDL---ATGKLLRTLTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTG 373
|
330 340 350
....*....|....*....|....*....|
gi 223005862 368 AVRSVAFSPDGSQLALGMKDGSFIVLRVRD 397
Cdd:COG2319 374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
725-1026 |
1.37e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 137.85 E-value: 1.37e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 725 GHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWK 804
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 805 KGEKIATTRGHKDKIFVVKCNPHHvdKLVTVGIKH--IKFWQ-QAGGGFTSKRGTFGSVgkletmMCVSYGRMEDLVFSG 881
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDvETGKCLTTLRGHTDWV------NSVAFSPDGTFVASS 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 882 AATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFERCLKTYAIKRS---ALSTSSKGLL 954
Cdd:cd00200 154 SQDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTLRGHENgvnSVAFSPDGYL 233
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 223005862 955 LednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWE 1026
Cdd:cd00200 234 L----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1471-1956 |
4.30e-32 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 131.19 E-value: 4.30e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1471 SATGKLLVSVGVDPEHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTQFVSVGVKHMKFWTLAGSALLYKKGVIGSl 1550
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA- 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1551 gaakmqTMLSVAFGANNLTF-TGAINGDVYVWK-DHFLIRLVAKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKL 1627
Cdd:COG2319 80 ------AVLSVAFSPDGRLLaSASADGTVRLWDlATGLLLRTLTGHTGAVRSV-AFSPDGkTLASGS------ADGTVRL 146
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1628 WDqemkrcrafqLETGQLVEcvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASN 1707
Cdd:COG2319 147 WD----------LATGKLLR---------------------------------TLTGH-SGAVTSVAFSPDGKLLASGSD 182
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1708 DGTARIWDLADKKLLNKVSlGHAA--RCAAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRDRKSAIQDIRISPDNRFL 1785
Cdd:COG2319 183 DGTVRLWDLATGKLLRTLT-GHTGavRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLL 261
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1786 AVGSSEHTVDFYDLTQGTNLNRIGyckDIPSFVIQMDFSADGKYIqVSTGAYKR-QVHEVPLGKQvteavviekitwasw 1864
Cdd:COG2319 262 ASGSADGTVRLWDLATGELLRTLT---GHSGGVNSVAFSPDGKLL-ASGSDDGTvRLWDLATGKL--------------- 322
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1865 tsvlgdevigIWPRNADKADVNCACVTHAGLNIVTGDDFGLVKLFDfpcTEKFAKHKRYFGHSAHVTNIRFSYDDKYVVS 1944
Cdd:COG2319 323 ----------LRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTGHTGAVTSVAFSPDGRTLAS 389
|
490
....*....|..
gi 223005862 1945 tGGDDCSVFVWR 1956
Cdd:COG2319 390 -GSADGTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
880-1266 |
1.29e-26 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 114.62 E-value: 1.29e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 880 SGAATGDIFIWKDILLLKTVKAHDGPVF--AMYALDKGFVTGGKDGIVELWDdmferclktyaikrsALSTSSKGLLLED 957
Cdd:COG2319 55 AGDLTLLLLDAAAGALLATLLGHTAAVLsvAFSPDGRLLASASADGTVRLWD---------------LATGLLLRTLTGH 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 958 NPSIRAITLGH-GHILV-GTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHP---LLpicATVSDDKTLRIWELSAQH 1031
Cdd:COG2319 120 TGAVRSVAFSPdGKTLAsGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGK 195
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1032 RMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYN 1111
Cdd:COG2319 196 LLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPD-GRLLASGSADGTVRLWD 274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1112 VLTSKRVGICKGASSYITHIDWDSRGKLLqVnSGAREqlffeaprgkrHIIRPseiekieWDTWTcvlgPTCEGIWPAHS 1191
Cdd:COG2319 275 LATGELLRTLTGHSGGVNSVAFSPDGKLL-A-SGSDD-----------GTVRL-------WDLAT----GKLLRTLTGHT 330
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 223005862 1192 DitDVNAASLTKDCSLLATGDDFGFVKLFSYPVKGQHARFKkyvGHSAHVTNVRWLHNDSVLLTvGGADTALMIW 1266
Cdd:COG2319 331 G--AVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-48 |
5.47e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. :
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.47e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 223005862 2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
668-715 |
5.68e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. :
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.68e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 223005862 668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
521-826 |
1.96e-14 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 75.83 E-value: 1.96e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRFipegvsn 600
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWDL------- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 601 gmletapqEGGADSYS-EESDSDLSDVpeldsDIEQEAQI----NYDRQVYKEDLPQLKQQSKEKNHavpflkrekapED 675
Cdd:cd00200 81 --------ETGECVRTlTGHTSYVSSV-----AFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TD 136
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 676 SLklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAA 752
Cdd:cd00200 137 WV--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGT 200
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 223005862 753 IHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNP 826
Cdd:cd00200 201 IKLWDLSTGKCLGTLRG-HENGVNSVAFSPDGYLLASGSED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
|
|
| HELP super family |
cl04081 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1350-1401 |
7.88e-13 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. The actual alignment was detected with superfamily member pfam03451:
Pssm-ID: 460922 Cd Length: 72 Bit Score: 65.27 E-value: 7.88e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 223005862 1350 KNNITKKKKLVEE-LALDHVFGYRGFDCRNNLHYLNDGaDIIFHTAAAGIVQN 1401
Cdd:pfam03451 20 KDDLDQKKEPPDKkLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYD 71
|
|
| COG4946 super family |
cl27624 |
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ... |
368-488 |
5.55e-05 |
|
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown]; The actual alignment was detected with superfamily member COG4946:
Pssm-ID: 443973 [Multi-domain] Cd Length: 1072 Bit Score: 48.50 E-value: 5.55e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 368 AVRSVAFSPDGSQLA-LGMKDGSF-IVLR--VRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA-QRYK 442
Cdd:COG4946 344 RERLPAWSPDGKSIAyFSDASGEYeLYIApaDGSGEPKQLTLGDLGRVFNPVWSPDGKKIAFTDNRGRLWVVDLAsGKVR 423
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 223005862 443 KIGEcSKSLSFITHIDWSLDSKYL---QTNDGAGERLF-YRMPSGK--PLTS 488
Cdd:COG4946 424 KVDT-DGYGDGISDLAWSPDSKWLaysKPGPNQLSQIFlYDVETGKtvQLTD 474
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-397 |
1.85e-39 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 152.76 E-value: 1.85e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319 116 LTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWD 190
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgktgDLQTILclacakeditySGAL 216
Cdd:COG2319 191 LATGKLLRTLTGHTGAVRSVAFSP-----------------------------------DGKLLA-----------SGSA 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319 225 DGTVRLWdlATGKLLRTLTG-HSGSVRSVAFSPDGrlLASGSADGTVRLWDLATG-----ELLRTLTGHSG-GVNSVAFS 297
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 293 AD--RLLAGTQDSEI--FEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EE 367
Cdd:COG2319 298 PDgkLLASGSDDGTVrlWDL---ATGKLLRTLTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTG 373
|
330 340 350
....*....|....*....|....*....|
gi 223005862 368 AVRSVAFSPDGSQLALGMKDGSFIVLRVRD 397
Cdd:COG2319 374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
57-353 |
1.11e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 138.24 E-value: 1.11e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKdVHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 137 WRKGKLLASATGHSDRIFDISWDPYqpNRVVSCGVKH--IKFWTL-CGNALTAKRGIFGktgdlqTILCLA-CAKEDITY 212
Cdd:cd00200 80 LETGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVeTGKCLTTLRGHTD------WVNSVAfSPDGTFVA 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 213 SGALNGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKPITKIdLRETEQGykglsIRS 288
Cdd:cd00200 152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGT-LRGHENG-----VNS 224
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 223005862 289 VCWKADRLL--AGTQDS--EIFEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:cd00200 225 VAFSPDGYLlaSGSEDGtiRVWDL---RTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
725-1026 |
1.37e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 137.85 E-value: 1.37e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 725 GHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWK 804
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 805 KGEKIATTRGHKDKIFVVKCNPHHvdKLVTVGIKH--IKFWQ-QAGGGFTSKRGTFGSVgkletmMCVSYGRMEDLVFSG 881
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDvETGKCLTTLRGHTDWV------NSVAFSPDGTFVASS 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 882 AATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFERCLKTYAIKRS---ALSTSSKGLL 954
Cdd:cd00200 154 SQDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTLRGHENgvnSVAFSPDGYL 233
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 223005862 955 LednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWE 1026
Cdd:cd00200 234 L----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
708-1140 |
2.71e-34 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 137.35 E-value: 2.71e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 708 AVAVVYNRQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCL 787
Cdd:COG2319 59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTL 135
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 788 VSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVkcnphhvdklvtvgikhikfwqqagggftskrgTFGSVGKLetmm 867
Cdd:COG2319 136 ASGSAD--GTVRLWDLATGKLLRTLTGHSGAVTSV---------------------------------AFSPDGKL---- 176
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 868 cvsygrmedlVFSGAATGDIFIWkDIL---LLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDdmferclktyaik 942
Cdd:COG2319 177 ----------LASGSDDGTVRLW-DLAtgkLLRTLTGHTGAVRSVaFSPDgKLLASGSADGTVRLWD------------- 232
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 943 rsalstsskglllednpsiraitlghghilvgTKNGEILeidksgpmtLLVQGHmEGEVWGLAAHP---LLpicATVSDD 1019
Cdd:COG2319 233 --------------------------------LATGKLL---------RTLTGH-SGSVRSVAFSPdgrLL---ASGSAD 267
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1020 KTLRIWELSAQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLA 1099
Cdd:COG2319 268 GTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPD-GKTLA 346
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 223005862 1100 VASHDNFVDIYNVLTSKRVGICKGASSYITHIDWDSRGKLL 1140
Cdd:COG2319 347 SGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTL 387
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1471-1956 |
4.30e-32 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 131.19 E-value: 4.30e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1471 SATGKLLVSVGVDPEHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTQFVSVGVKHMKFWTLAGSALLYKKGVIGSl 1550
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA- 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1551 gaakmqTMLSVAFGANNLTF-TGAINGDVYVWK-DHFLIRLVAKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKL 1627
Cdd:COG2319 80 ------AVLSVAFSPDGRLLaSASADGTVRLWDlATGLLLRTLTGHTGAVRSV-AFSPDGkTLASGS------ADGTVRL 146
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1628 WDqemkrcrafqLETGQLVEcvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASN 1707
Cdd:COG2319 147 WD----------LATGKLLR---------------------------------TLTGH-SGAVTSVAFSPDGKLLASGSD 182
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1708 DGTARIWDLADKKLLNKVSlGHAA--RCAAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRDRKSAIQDIRISPDNRFL 1785
Cdd:COG2319 183 DGTVRLWDLATGKLLRTLT-GHTGavRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLL 261
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1786 AVGSSEHTVDFYDLTQGTNLNRIGyckDIPSFVIQMDFSADGKYIqVSTGAYKR-QVHEVPLGKQvteavviekitwasw 1864
Cdd:COG2319 262 ASGSADGTVRLWDLATGELLRTLT---GHSGGVNSVAFSPDGKLL-ASGSDDGTvRLWDLATGKL--------------- 322
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1865 tsvlgdevigIWPRNADKADVNCACVTHAGLNIVTGDDFGLVKLFDfpcTEKFAKHKRYFGHSAHVTNIRFSYDDKYVVS 1944
Cdd:COG2319 323 ----------LRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTGHTGAVTSVAFSPDGRTLAS 389
|
490
....*....|..
gi 223005862 1945 tGGDDCSVFVWR 1956
Cdd:COG2319 390 -GSADGTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
880-1266 |
1.29e-26 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 114.62 E-value: 1.29e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 880 SGAATGDIFIWKDILLLKTVKAHDGPVF--AMYALDKGFVTGGKDGIVELWDdmferclktyaikrsALSTSSKGLLLED 957
Cdd:COG2319 55 AGDLTLLLLDAAAGALLATLLGHTAAVLsvAFSPDGRLLASASADGTVRLWD---------------LATGLLLRTLTGH 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 958 NPSIRAITLGH-GHILV-GTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHP---LLpicATVSDDKTLRIWELSAQH 1031
Cdd:COG2319 120 TGAVRSVAFSPdGKTLAsGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGK 195
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1032 RMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYN 1111
Cdd:COG2319 196 LLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPD-GRLLASGSADGTVRLWD 274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1112 VLTSKRVGICKGASSYITHIDWDSRGKLLqVnSGAREqlffeaprgkrHIIRPseiekieWDTWTcvlgPTCEGIWPAHS 1191
Cdd:COG2319 275 LATGELLRTLTGHSGGVNSVAFSPDGKLL-A-SGSDD-----------GTVRL-------WDLAT----GKLLRTLTGHT 330
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 223005862 1192 DitDVNAASLTKDCSLLATGDDFGFVKLFSYPVKGQHARFKkyvGHSAHVTNVRWLHNDSVLLTvGGADTALMIW 1266
Cdd:COG2319 331 G--AVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1413-1763 |
1.24e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 97.41 E-value: 1.24e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1413 HTDDILCLTVNQHPKYrnVVATSQIGTtpsIHIWDamtKHTLSMLRCF--HSKGVNYINFSATGKLLVSVGVDpeHTITV 1490
Cdd:cd00200 8 HTGGVTCVAFSPDGKL--LATGSGDGT---IKVWD---LETGELLRTLkgHTGPVRDVAASADGTYLASGSSD--KTIRL 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1491 WRWQEGAKVASRGGHLERIFVVEFRPDSdtQFVSVGVKH--MKFWTLAgsallykKGVIGSLGAAKMQTMLSVAF-GANN 1567
Cdd:cd00200 78 WDLETGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDVE-------TGKCLTTLRGHTDWVNSVAFsPDGT 148
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1568 LTFTGAINGDVYVW--KDHFLIRlVAKAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWDQEMKRCRAfqletgql 1645
Cdd:cd00200 149 FVASSSQDGTIKLWdlRTGKCVA-TLTGHTGEVNSVAFSPDGEKLLSSS------SDGTIKLWDLSTGKCLG-------- 213
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1646 vecvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLNKV 1725
Cdd:cd00200 214 -----------------------------------TLRGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTL 257
|
330 340 350 360
....*....|....*....|....*....|....*....|
gi 223005862 1726 SlGHAAR--CAAYSPDGEMVAIGMKNgefvillvNSLKVW 1763
Cdd:cd00200 258 S-GHTNSvtSLAWSPDGKRLASGSAD--------GTIRIW 288
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-48 |
5.47e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.47e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 223005862 2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
668-715 |
5.68e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.68e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 223005862 668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
895-1221 |
1.95e-18 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 87.78 E-value: 1.95e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 895 LLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD---DMFERCLKTYAIKRSALSTSSKGLLL----EDNpSIRait 965
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDletGELLRTLKGHTGPVRDVAASADGTYLasgsSDK-TIR--- 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 966 lghghiLVGTKNGEILEIdksgpmtllVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWELSAQHRMLAVRKLKKGGRC 1045
Cdd:cd00200 77 ------LWDLETGECVRT---------LTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1046 CAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNVLTSKRVGICKGAS 1125
Cdd:cd00200 141 VAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPD-GEKLLSSSSDGTIKLWDLSTGKCLGTLRGHE 219
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1126 SYITHIDWDSRGKLLqvnSGAreqlffeaprgkrhiirpSEIEKIE-WDTWTCVLGPTCEGiwpaHSdiTDVNAASLTKD 1204
Cdd:cd00200 220 NGVNSVAFSPDGYLL---ASG------------------SEDGTIRvWDLRTGECVQTLSG----HT--NSVTSLAWSPD 272
|
330
....*....|....*..
gi 223005862 1205 CSLLATGDDFGFVKLFS 1221
Cdd:cd00200 273 GKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
521-826 |
1.96e-14 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 75.83 E-value: 1.96e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRFipegvsn 600
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWDL------- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 601 gmletapqEGGADSYS-EESDSDLSDVpeldsDIEQEAQI----NYDRQVYKEDLPQLKQQSKEKNHavpflkrekapED 675
Cdd:cd00200 81 --------ETGECVRTlTGHTSYVSSV-----AFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TD 136
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 676 SLklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAA 752
Cdd:cd00200 137 WV--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGT 200
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 223005862 753 IHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNP 826
Cdd:cd00200 201 IKLWDLSTGKCLGTLRG-HENGVNSVAFSPDGYLLASGSED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1350-1401 |
7.88e-13 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 65.27 E-value: 7.88e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 223005862 1350 KNNITKKKKLVEE-LALDHVFGYRGFDCRNNLHYLNDGaDIIFHTAAAGIVQN 1401
Cdd:pfam03451 20 KDDLDQKKEPPDKkLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYD 71
|
|
| COG4946 |
COG4946 |
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ... |
368-488 |
5.55e-05 |
|
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];
Pssm-ID: 443973 [Multi-domain] Cd Length: 1072 Bit Score: 48.50 E-value: 5.55e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 368 AVRSVAFSPDGSQLA-LGMKDGSF-IVLR--VRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA-QRYK 442
Cdd:COG4946 344 RERLPAWSPDGKSIAyFSDASGEYeLYIApaDGSGEPKQLTLGDLGRVFNPVWSPDGKKIAFTDNRGRLWVVDLAsGKVR 423
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 223005862 443 KIGEcSKSLSFITHIDWSLDSKYL---QTNDGAGERLF-YRMPSGK--PLTS 488
Cdd:COG4946 424 KVDT-DGYGDGISDLAWSPDSKWLaysKPGPNQLSQIFlYDVETGKtvQLTD 474
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1682-1715 |
2.58e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.99 E-value: 2.58e-04
10 20 30
....*....|....*....|....*....|....
gi 223005862 1682 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWD 1715
Cdd:smart00320 8 LKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
105-136 |
3.40e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.99 E-value: 3.40e-04
10 20 30
....*....|....*....|....*....|..
gi 223005862 105 HTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:smart00320 11 HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1682-1715 |
4.24e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 39.64 E-value: 4.24e-04
10 20 30
....*....|....*....|....*....|....
gi 223005862 1682 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWD 1715
Cdd:pfam00400 7 LEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
315-353 |
7.88e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.87 E-value: 7.88e-04
10 20 30
....*....|....*....|....*....|....*....
gi 223005862 315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:pfam00400 2 KLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
760-802 |
1.25e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.06 E-value: 1.25e-03
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 223005862 760 TLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:smart00320 1 SGELLKTLKG-HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-397 |
1.85e-39 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 152.76 E-value: 1.85e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319 116 LTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWD 190
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgktgDLQTILclacakeditySGAL 216
Cdd:COG2319 191 LATGKLLRTLTGHTGAVRSVAFSP-----------------------------------DGKLLA-----------SGSA 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319 225 DGTVRLWdlATGKLLRTLTG-HSGSVRSVAFSPDGrlLASGSADGTVRLWDLATG-----ELLRTLTGHSG-GVNSVAFS 297
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 293 AD--RLLAGTQDSEI--FEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EE 367
Cdd:COG2319 298 PDgkLLASGSDDGTVrlWDL---ATGKLLRTLTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTG 373
|
330 340 350
....*....|....*....|....*....|
gi 223005862 368 AVRSVAFSPDGSQLALGMKDGSFIVLRVRD 397
Cdd:COG2319 374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-466 |
2.63e-39 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 152.37 E-value: 2.63e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319 74 LLGHTAAVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgkTGDLqtilcLAcakeditySGAL 216
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSP---------------------------------DGKL-----LA--------SGSD 182
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDfkpitkidlreteqgykglsirsvcwk 292
Cdd:COG2319 183 DGTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLA--------------------------- 234
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 293 adrllagtqdseifevivreRDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNMEE-AVRS 371
Cdd:COG2319 235 --------------------TGKLLRTLTGH-SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSgGVNS 293
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 372 VAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQrykkiGECSKSL 451
Cdd:COG2319 294 VAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAT-----GELLRTL 368
|
410
....*....|....*....
gi 223005862 452 ----SFITHIDWSLDSKYL 466
Cdd:COG2319 369 tghtGAVTSVAFSPDGRTL 387
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-438 |
7.42e-37 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 145.05 E-value: 7.42e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319 32 LLGLAAAVASLAASPDGARLAAGAGDLT--LLLLDAAAGALLATLLG-HTAAVLSVAFSPDGRLLASASAD--GTVRLWD 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 137 WRKGKLLASATGHSDRIFDISWDPyQPNRVVSCGV-KHIKFWtlcgNALTAKRgIFGKTGDLQTILCLAC-AKEDITYSG 214
Cdd:COG2319 107 LATGLLLRTLTGHTGAVRSVAFSP-DGKTLASGSAdGTVRLW----DLATGKL-LRTLTGHSGAVTSVAFsPDGKLLASG 180
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 215 ALNGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDfkpiTKiDLRETEQGYKGlSIRSVC 290
Cdd:COG2319 181 SDDGTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLA----TG-KLLRTLTGHSG-SVRSVA 253
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 291 WKAD--RLLAGTQDSEIfEVIVRERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNMEEA 368
Cdd:COG2319 254 FSPDgrLLASGSADGTV-RLWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTG 331
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 223005862 369 -VRSVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA 438
Cdd:COG2319 332 aVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
57-353 |
1.11e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 138.24 E-value: 1.11e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKdVHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 137 WRKGKLLASATGHSDRIFDISWDPYqpNRVVSCGVKH--IKFWTL-CGNALTAKRGIFGktgdlqTILCLA-CAKEDITY 212
Cdd:cd00200 80 LETGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVeTGKCLTTLRGHTD------WVNSVAfSPDGTFVA 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 213 SGALNGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKPITKIdLRETEQGykglsIRS 288
Cdd:cd00200 152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGT-LRGHENG-----VNS 224
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 223005862 289 VCWKADRLL--AGTQDS--EIFEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:cd00200 225 VAFSPDGYLlaSGSEDGtiRVWDL---RTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
725-1026 |
1.37e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 137.85 E-value: 1.37e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 725 GHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWK 804
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 805 KGEKIATTRGHKDKIFVVKCNPHHvdKLVTVGIKH--IKFWQ-QAGGGFTSKRGTFGSVgkletmMCVSYGRMEDLVFSG 881
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDvETGKCLTTLRGHTDWV------NSVAFSPDGTFVASS 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 882 AATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFERCLKTYAIKRS---ALSTSSKGLL 954
Cdd:cd00200 154 SQDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTLRGHENgvnSVAFSPDGYL 233
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 223005862 955 LednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWE 1026
Cdd:cd00200 234 L----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
708-1140 |
2.71e-34 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 137.35 E-value: 2.71e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 708 AVAVVYNRQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCL 787
Cdd:COG2319 59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTL 135
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 788 VSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVkcnphhvdklvtvgikhikfwqqagggftskrgTFGSVGKLetmm 867
Cdd:COG2319 136 ASGSAD--GTVRLWDLATGKLLRTLTGHSGAVTSV---------------------------------AFSPDGKL---- 176
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 868 cvsygrmedlVFSGAATGDIFIWkDIL---LLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDdmferclktyaik 942
Cdd:COG2319 177 ----------LASGSDDGTVRLW-DLAtgkLLRTLTGHTGAVRSVaFSPDgKLLASGSADGTVRLWD------------- 232
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 943 rsalstsskglllednpsiraitlghghilvgTKNGEILeidksgpmtLLVQGHmEGEVWGLAAHP---LLpicATVSDD 1019
Cdd:COG2319 233 --------------------------------LATGKLL---------RTLTGH-SGSVRSVAFSPdgrLL---ASGSAD 267
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1020 KTLRIWELSAQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLA 1099
Cdd:COG2319 268 GTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPD-GKTLA 346
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 223005862 1100 VASHDNFVDIYNVLTSKRVGICKGASSYITHIDWDSRGKLL 1140
Cdd:COG2319 347 SGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTL 387
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1471-1956 |
4.30e-32 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 131.19 E-value: 4.30e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1471 SATGKLLVSVGVDPEHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTQFVSVGVKHMKFWTLAGSALLYKKGVIGSl 1550
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA- 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1551 gaakmqTMLSVAFGANNLTF-TGAINGDVYVWK-DHFLIRLVAKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKL 1627
Cdd:COG2319 80 ------AVLSVAFSPDGRLLaSASADGTVRLWDlATGLLLRTLTGHTGAVRSV-AFSPDGkTLASGS------ADGTVRL 146
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1628 WDqemkrcrafqLETGQLVEcvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASN 1707
Cdd:COG2319 147 WD----------LATGKLLR---------------------------------TLTGH-SGAVTSVAFSPDGKLLASGSD 182
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1708 DGTARIWDLADKKLLNKVSlGHAA--RCAAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRDRKSAIQDIRISPDNRFL 1785
Cdd:COG2319 183 DGTVRLWDLATGKLLRTLT-GHTGavRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLL 261
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1786 AVGSSEHTVDFYDLTQGTNLNRIGyckDIPSFVIQMDFSADGKYIqVSTGAYKR-QVHEVPLGKQvteavviekitwasw 1864
Cdd:COG2319 262 ASGSADGTVRLWDLATGELLRTLT---GHSGGVNSVAFSPDGKLL-ASGSDDGTvRLWDLATGKL--------------- 322
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1865 tsvlgdevigIWPRNADKADVNCACVTHAGLNIVTGDDFGLVKLFDfpcTEKFAKHKRYFGHSAHVTNIRFSYDDKYVVS 1944
Cdd:COG2319 323 ----------LRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTGHTGAVTSVAFSPDGRTLAS 389
|
490
....*....|..
gi 223005862 1945 tGGDDCSVFVWR 1956
Cdd:COG2319 390 -GSADGTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1420-1800 |
4.01e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 128.11 E-value: 4.01e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1420 LTVNQHPKYRNVVATSQIGTT-------PSIHIWDAMTKHTLSMLRcFHSKGVNYINFSATGKLLVSVGVDpeHTITVWR 1492
Cdd:COG2319 72 ATLLGHTAAVLSVAFSPDGRLlasasadGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1493 WQEGAKVASRGGHLERIFVVEFRPDSdTQFVSVGV-KHMKFWTLAGSALLYKkgVIGSLGAAkmqtmLSVAFGANNLTF- 1570
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLRT--LTGHTGAV-----RSVAFSPDGKLLa 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1571 TGAINGDVYVW--KDHFLIRLVaKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKLWDqemkrcrafqLETGQLVE 1647
Cdd:COG2319 221 SGSADGTVRLWdlATGKLLRTL-TGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGELLR 282
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1648 cvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLNKVSl 1727
Cdd:COG2319 283 ---------------------------------TLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT- 327
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 223005862 1728 GHAA--RCAAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRDRKSAIQDIRISPDNRFLAVGSSEHTVDFYDLT 1800
Cdd:COG2319 328 GHTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
718-1071 |
2.50e-30 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 125.79 E-value: 2.50e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 718 HSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHS 797
Cdd:COG2319 111 LLLRTLTGHTGAVRSVAFSPDGKTLASG--SADGTVRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GT 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 798 IVFWDWKKGEKIATTRGHKDKIFVVKCNPHHvDKLVTVGI-KHIKFWQQAGGGF-TSKRGTFGSVgkletmMCVSY---G 872
Cdd:COG2319 186 VRLWDLATGKLLRTLTGHTGAVRSVAFSPDG-KLLASGSAdGTVRLWDLATGKLlRTLTGHSGSV------RSVAFspdG 258
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 873 RmedLVFSGAATGDIFIW--KDILLLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDdmferclktyaikrsalst 948
Cdd:COG2319 259 R---LLASGSADGTVRLWdlATGELLRTLTGHSGGVNSVaFSPDgKLLASGSDDGTVRLWD------------------- 316
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 949 sskglllednpsiraitlghghilvgTKNGEILEIdksgpmtllVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWELS 1028
Cdd:COG2319 317 --------------------------LATGKLLRT---------LTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLA 360
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 223005862 1029 AQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADT 1071
Cdd:COG2319 361 TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
274-844 |
2.07e-29 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 123.10 E-value: 2.07e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 274 LRETEQGYKGLSIRSVCWKADRLLAGTQDSEIFEVIVRERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:COG2319 28 LLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH-TAAVLSVAFSPDGRLLASASADGTVRLWD 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 354 LADHALIARCNM-EEAVRSVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPV 432
Cdd:COG2319 107 LATGLLLRTLTGhTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTV 186
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 433 DVYAVAQrykkiGECSKSL----SFITHIDWSLDSKYLqtndgagerlfyrmpsgkpltskeeikgipwaswtcvkgpev 508
Cdd:COG2319 187 RLWDLAT-----GKLLRTLtghtGAVRSVAFSPDGKLL------------------------------------------ 219
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 509 sgiwpkytevtdinsvdanynssvlVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSV 588
Cdd:COG2319 220 -------------------------ASGSADGTVRLWD---LATGKLLRTLTGHSGSVRSVAFSPDGRLLAS-GSADGTV 270
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 589 FQWRfipegvsngmletapqeggadsyseesdsdlsdvpeldsdieqeaqinydrqvykedlpqlkqqskeknhavpflk 668
Cdd:COG2319 271 RLWD---------------------------------------------------------------------------- 274
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 669 rekapedslklqfihgyrgydcrnnlfyTQAGEVVyhiaavavvynrqqhsqRLYLGHDDDILSLTIHPVKDYVATGqvG 748
Cdd:COG2319 275 ----------------------------LATGELL-----------------RTLTGHSGGVNSVAFSPDGKLLASG--S 307
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 749 RDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNPHH 828
Cdd:COG2319 308 DDGTVRLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGSDD--GTVRLWDLATGELLRTLTGHTGAVTSVAFSPDG 384
|
570
....*....|....*..
gi 223005862 829 vDKLVTVGI-KHIKFWQ 844
Cdd:COG2319 385 -RTLASGSAdGTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
880-1266 |
1.29e-26 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 114.62 E-value: 1.29e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 880 SGAATGDIFIWKDILLLKTVKAHDGPVF--AMYALDKGFVTGGKDGIVELWDdmferclktyaikrsALSTSSKGLLLED 957
Cdd:COG2319 55 AGDLTLLLLDAAAGALLATLLGHTAAVLsvAFSPDGRLLASASADGTVRLWD---------------LATGLLLRTLTGH 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 958 NPSIRAITLGH-GHILV-GTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHP---LLpicATVSDDKTLRIWELSAQH 1031
Cdd:COG2319 120 TGAVRSVAFSPdGKTLAsGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGK 195
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1032 RMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYN 1111
Cdd:COG2319 196 LLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPD-GRLLASGSADGTVRLWD 274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1112 VLTSKRVGICKGASSYITHIDWDSRGKLLqVnSGAREqlffeaprgkrHIIRPseiekieWDTWTcvlgPTCEGIWPAHS 1191
Cdd:COG2319 275 LATGELLRTLTGHSGGVNSVAFSPDGKLL-A-SGSDD-----------GTVRL-------WDLAT----GKLLRTLTGHT 330
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 223005862 1192 DitDVNAASLTKDCSLLATGDDFGFVKLFSYPVKGQHARFKkyvGHSAHVTNVRWLHNDSVLLTvGGADTALMIW 1266
Cdd:COG2319 331 G--AVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
943-1266 |
1.27e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 108.85 E-value: 1.27e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 943 RSALSTSSKGLLLEDNPSIRAITLGHGHILVGTKNGEILEIDKSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTL 1022
Cdd:COG2319 24 ALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH-TAAVLSVAFSPDGRLLASASADGTV 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1023 RIWELSAQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVAS 1102
Cdd:COG2319 103 RLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPD-GKLLASGS 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1103 HDNFVDIYNVLTSKRVGICKGASSYITHIDWDSRGKLLQVNSGAREqlffeaprgkrhiIRPseiekieWDtwtcVLGPT 1182
Cdd:COG2319 182 DDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGT-------------VRL-------WD----LATGK 237
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1183 CEGIWPAHSDitDVNAASLTKDCSLLATGDDFGFVKLFSyPVKGQHARFKKyvGHSAHVTNVRWLHNDSVLLTvGGADTA 1262
Cdd:COG2319 238 LLRTLTGHSG--SVRSVAFSPDGRLLASGSADGTVRLWD-LATGELLRTLT--GHSGGVNSVAFSPDGKLLAS-GSDDGT 311
|
....
gi 223005862 1263 LMIW 1266
Cdd:COG2319 312 VRLW 315
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1401-1718 |
1.09e-23 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 105.76 E-value: 1.09e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1401 NLSTGSQSFYLE-HTDDILCLTVnqHPKYRNVVATSQIGTtpsIHIWDAMTKHTLSMLRCfHSKGVNYINFSATGKLLVS 1479
Cdd:COG2319 106 DLATGLLLRTLTgHTGAVRSVAF--SPDGKTLASGSADGT---VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLAS 179
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1480 VGVDpeHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTqFVSVGV-KHMKFWTLAGSALLYKKGviGSLGAAkmqtm 1558
Cdd:COG2319 180 GSDD--GTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKL-LASGSAdGTVRLWDLATGKLLRTLT--GHSGSV----- 249
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1559 LSVAFGANNLTF-TGAINGDVYVWK-DHFLIRLVAKAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWDqemkrcr 1636
Cdd:COG2319 250 RSVAFSPDGRLLaSGSADGTVRLWDlATGELLRTLTGHSGGVNSVAFSPDGKLLASGS------DDGTVRLWD------- 316
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1637 afqLETGQLV-------ECVRSVC-RGKGKILV-GTKDGEI----IEVGEKNAAsnilIDGHmEGEIWGLATHPSKDLFI 1703
Cdd:COG2319 317 ---LATGKLLrtltghtGAVRSVAfSPDGKTLAsGSDDGTVrlwdLATGELLRT----LTGH-TGAVTSVAFSPDGRTLA 388
|
330
....*....|....*
gi 223005862 1704 SASNDGTARIWDLAD 1718
Cdd:COG2319 389 SGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
723-929 |
7.53e-23 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 100.87 E-value: 7.53e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 723 YLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:cd00200 89 LTGHTSYVSSVAFSPDGRILSSS--SRDKTIKVWDVETGKCLTTLRG-HTDWVNSVAFSPDGTFVASSSQD--GTIKLWD 163
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 803 WKKGEKIATTRGHKDKIFVVKCNPHHVDKLVTVGIKHIKFWQQAGGgftSKRGTFgsVGKLETMMCVSYGRMEDLVFSGA 882
Cdd:cd00200 164 LRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG---KCLGTL--RGHENGVNSVAFSPDGYLLASGS 238
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 223005862 883 ATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD 929
Cdd:cd00200 239 EDGTIRVWdlRTGECVQTLSGHTNSVTSLAWSPDGkrLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1413-1763 |
1.24e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 97.41 E-value: 1.24e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1413 HTDDILCLTVNQHPKYrnVVATSQIGTtpsIHIWDamtKHTLSMLRCF--HSKGVNYINFSATGKLLVSVGVDpeHTITV 1490
Cdd:cd00200 8 HTGGVTCVAFSPDGKL--LATGSGDGT---IKVWD---LETGELLRTLkgHTGPVRDVAASADGTYLASGSSD--KTIRL 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1491 WRWQEGAKVASRGGHLERIFVVEFRPDSdtQFVSVGVKH--MKFWTLAgsallykKGVIGSLGAAKMQTMLSVAF-GANN 1567
Cdd:cd00200 78 WDLETGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDVE-------TGKCLTTLRGHTDWVNSVAFsPDGT 148
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1568 LTFTGAINGDVYVW--KDHFLIRlVAKAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWDQEMKRCRAfqletgql 1645
Cdd:cd00200 149 FVASSSQDGTIKLWdlRTGKCVA-TLTGHTGEVNSVAFSPDGEKLLSSS------SDGTIKLWDLSTGKCLG-------- 213
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1646 vecvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLNKV 1725
Cdd:cd00200 214 -----------------------------------TLRGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTL 257
|
330 340 350 360
....*....|....*....|....*....|....*....|
gi 223005862 1726 SlGHAAR--CAAYSPDGEMVAIGMKNgefvillvNSLKVW 1763
Cdd:cd00200 258 S-GHTNSvtSLAWSPDGKRLASGSAD--------GTIRIW 288
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
228-466 |
1.05e-20 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 94.71 E-value: 1.05e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 228 LVRTIQGaHSAGIFSM--YACEEGFATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWKAD--RLLAGTQDS 303
Cdd:cd00200 1 LRRTLKG-HTGGVTCVafSPDGKLLATGSGDGTIKVWDLETG-----ELLRTLKGHTG-PVRDVAASADgtYLASGSSDK 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 304 EIFevIVRERDKPML-ILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EEAVRSVAFSPDGSQL 381
Cdd:cd00200 74 TIR--LWDLETGECVrTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGhTDWVNSVAFSPDGTFV 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 382 ALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQRyKKIGECSKSLSFITHIDWSL 461
Cdd:cd00200 151 ASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG-KCLGTLRGHENGVNSVAFSP 229
|
....*
gi 223005862 462 DSKYL 466
Cdd:cd00200 230 DGYLL 234
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
521-1112 |
2.60e-20 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 95.75 E-value: 2.60e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRfipegvsn 600
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWD---LATGLLLRTLTGHTGAVRSVAFSPDGKTLAS-GSADGTVRLWD-------- 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 601 gmletapqeggadsyseesdsdlsdvpeldsdieqeaqinydrqvykedlpqlkqqskeknhavpflkrekapedslklq 680
Cdd:COG2319 --------------------------------------------------------------------------------
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 681 fihgyrgydcrnnlfyTQAGEVVYHIAavavvynrqqhsqrlylGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQT 760
Cdd:COG2319 149 ----------------LATGKLLRTLT-----------------GHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLAT 193
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 761 LKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFvvkcnphhvdklvtvgikhi 840
Cdd:COG2319 194 GKLLRTLTG-HTGAVRSVAFSPDGKLLASGSAD--GTVRLWDLATGKLLRTLTGHSGSVR-------------------- 250
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 841 kfwqqagggftskrgtfgsvgkletmmcvsygrmeDLVFSgaatgdifiwkdilllktvkaHDGPVFAmyaldkgfvTGG 920
Cdd:COG2319 251 -----------------------------------SVAFS---------------------PDGRLLA---------SGS 265
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 921 KDGIVELWDdmferclktyaikrsalstsskglllednpsiraitlghghilvgTKNGEILEidksgpmtlLVQGHmEGE 1000
Cdd:COG2319 266 ADGTVRLWD---------------------------------------------LATGELLR---------TLTGH-SGG 290
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1001 VWGLAAHPLLPICATVSDDKTLRIWELSAQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHH 1080
Cdd:COG2319 291 VNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTG 370
|
570 580 590
....*....|....*....|....*....|..
gi 223005862 1081 RKEMISDIKFSKDtGKYLAVASHDNFVDIYNV 1112
Cdd:COG2319 371 HTGAVTSVAFSPD-GRTLASGSADGTVRLWDL 401
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-48 |
5.47e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.47e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 223005862 2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
668-715 |
5.68e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.68e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 223005862 668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1461-1798 |
1.17e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 91.63 E-value: 1.17e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1461 HSKGVNYINFSATGKLLVSVGVDpeHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSdTQFVSVGVKHM-KFWTLAGSA 1539
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADG-TYLASGSSDKTiRLWDLETGE 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1540 LLYkkgvigSLGAAKmQTMLSVAFGANNLTFTGAI-NGDVYVWK-DHFLIRLVAKAHTGPVFTMyTTLRDGLIVTGGKEr 1617
Cdd:cd00200 85 CVR------TLTGHT-SYVSSVAFSPDGRILSSSSrDKTIKVWDvETGKCLTTLRGHTDWVNSV-AFSPDGTFVASSSQ- 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1618 ptkeGGAVKLWDqemkrcrafqLETGQLVEcvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHP 1697
Cdd:cd00200 156 ----DGTIKLWD----------LRTGKCVA---------------------------------TLTGH-TGEVNSVAFSP 187
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1698 SKDLFISASNDGTARIWDLADKKLLnKVSLGH--AARCAAYSPDGEMVAIGMKNGefvillvnSLKVW-GKKRDRK---- 1770
Cdd:cd00200 188 DGEKLLSSSSDGTIKLWDLSTGKCL-GTLRGHenGVNSVAFSPDGYLLASGSEDG--------TIRVWdLRTGECVqtls 258
|
330 340 350
....*....|....*....|....*....|.
gi 223005862 1771 ---SAIQDIRISPDNRFLAVGSSEHTVDFYD 1798
Cdd:cd00200 259 ghtNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1592-1956 |
2.75e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 90.47 E-value: 2.75e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1592 KAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWDQEMKRC-RAFQLETGQlVECVRSVCRGKgKILVGTKDGEIIE 1670
Cdd:cd00200 6 KGHTGGVTCVAFSPDGKLLATGS------GDGTIKVWDLETGELlRTLKGHTGP-VRDVAASADGT-YLASGSSDKTIRL 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1671 VGEKNAASNILIDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLnKVSLGH--AARCAAYSPDGEMVAIGMK 1748
Cdd:cd00200 78 WDLETGECVRTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCL-TTLRGHtdWVNSVAFSPDGTFVASSSQ 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1749 NGefvillvnSLKVW----GKKRDR----KSAIQDIRISPDNRFLAVGSSEHTVDFYDLTQGTNLNRIGYCKDipsFVIQ 1820
Cdd:cd00200 156 DG--------TIKLWdlrtGKCVATltghTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHEN---GVNS 224
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1821 MDFSADGKYIqvstgaykrqvhevplgkqvteavviekitwaswTSVLGDEVIGIWprnadkadvncacvthaglNIVTG 1900
Cdd:cd00200 225 VAFSPDGYLL----------------------------------ASGSEDGTIRVW-------------------DLRTG 251
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 223005862 1901 DdfglvklfdfpctekfaKHKRYFGHSAHVTNIRFSYDDKYVVStGGDDCSVFVWR 1956
Cdd:cd00200 252 E-----------------CVQTLSGHTNSVTSLAWSPDGKRLAS-GSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
895-1221 |
1.95e-18 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 87.78 E-value: 1.95e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 895 LLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD---DMFERCLKTYAIKRSALSTSSKGLLL----EDNpSIRait 965
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDletGELLRTLKGHTGPVRDVAASADGTYLasgsSDK-TIR--- 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 966 lghghiLVGTKNGEILEIdksgpmtllVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWELSAQHRMLAVRKLKKGGRC 1045
Cdd:cd00200 77 ------LWDLETGECVRT---------LTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1046 CAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNVLTSKRVGICKGAS 1125
Cdd:cd00200 141 VAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPD-GEKLLSSSSDGTIKLWDLSTGKCLGTLRGHE 219
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1126 SYITHIDWDSRGKLLqvnSGAreqlffeaprgkrhiirpSEIEKIE-WDTWTCVLGPTCEGiwpaHSdiTDVNAASLTKD 1204
Cdd:cd00200 220 NGVNSVAFSPDGYLL---ASG------------------SEDGTIRvWDLRTGECVQTLSG----HT--NSVTSLAWSPD 272
|
330
....*....|....*..
gi 223005862 1205 CSLLATGDDFGFVKLFS 1221
Cdd:cd00200 273 GKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
291-929 |
7.87e-18 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 88.04 E-value: 7.87e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 291 WKADRLLAGTQDSEIFEVIVRERDKPMLILQGHCEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNMEEAVR 370
Cdd:COG2319 3 SADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVL 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 371 SVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVyavaqrykkigecsks 450
Cdd:COG2319 83 SVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRL---------------- 146
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 451 lsfithidWSLDSkylqtndgagerlfyrmpsGKPLtskeeikgipwaswtcvkgpevsgiwpkytevtdinsvdanyns 530
Cdd:COG2319 147 --------WDLAT-------------------GKLL-------------------------------------------- 155
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 531 svlvsgddfglvklfkfpclkrgakfRKYVGHSAHVTNVRWSHDFQWvLSTGGADHSVFQWRfipegvsngmletapqeg 610
Cdd:COG2319 156 --------------------------RTLTGHSGAVTSVAFSPDGKL-LASGSDDGTVRLWD------------------ 190
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 611 gadsyseesdsdlsdvpeldsdieqeaqinydrqvykedlpqlkqqskeknhavpflkrekapedslklqfihgyrgydc 690
Cdd:COG2319 --------------------------------------------------------------------------------
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 691 rnnlfyTQAGEVVyhiaavavvynrqqhsqRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGq 770
Cdd:COG2319 191 ------LATGKLL-----------------RTLTGHTGAVRSVAFSPDGKLLASG--SADGTVRLWDLATGKLLRTLTG- 244
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 771 HQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNPhhvD--KLVTVGI-KHIKFWQQAG 847
Cdd:COG2319 245 HSGSVRSVAFSPDGRLLASGSAD--GTVRLWDLATGELLRTLTGHSGGVNSVAFSP---DgkLLASGSDdGTVRLWDLAT 319
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 848 GgftSKRGTFGsvGKLETMMCVSYGRMEDLVFSGAATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDG 923
Cdd:COG2319 320 G---KLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGTVRLWdlATGELLRTLTGHTGAVTSVAFSPDGrtLASGSADG 394
|
....*.
gi 223005862 924 IVELWD 929
Cdd:COG2319 395 TVRLWD 400
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1682-1956 |
9.87e-18 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 85.85 E-value: 9.87e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1682 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLNKVSlGHAA--RCAAYSPDGEMVAIGMKNgefvillvNS 1759
Cdd:cd00200 5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLK-GHTGpvRDVAASADGTYLASGSSD--------KT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1760 LKVW----GKKRDR----KSAIQDIRISPDNRFLAVGSSEHTVDFYDLTQGTNLNRIGYCKDipsFVIQMDFSADGKYiq 1831
Cdd:cd00200 75 IRLWdletGECVRTltghTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF-- 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1832 VSTGAYKRQVH--EVPLGKqvteavVIEKITwaswtsvlgdevigiwprnADKADVNCACVTHAGLNIVTGDDFGLVKLF 1909
Cdd:cd00200 150 VASSSQDGTIKlwDLRTGK------CVATLT-------------------GHTGEVNSVAFSPDGEKLLSSSSDGTIKLW 204
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 223005862 1910 DFpctEKFAKHKRYFGHSAHVTNIRFSYDDKYVVStGGDDCSVFVWR 1956
Cdd:cd00200 205 DL---STGKCLGTLRGHENGVNSVAFSPDGYLLAS-GSEDGTIRVWD 247
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
46-177 |
4.14e-16 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 80.84 E-value: 4.14e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 46 VYNTREHS-QKFFLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKdVHTHGVACLAFDSDGQRLASV 124
Cdd:cd00200 161 LWDLRTGKcVATLTGHTGEVNSVAFSPDGEKLLSSSSDGT--IKLWDLSTGKCLGTLR-GHENGVNSVAFSPDGYLLASG 237
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 223005862 125 GLDakNTVCIWDWRKGKLLASATGHSDRIFDISWDPyQPNRVVSCGV-KHIKFW 177
Cdd:cd00200 238 SED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP-DGKRLASGSAdGTIRIW 288
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
55-179 |
4.59e-16 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 82.65 E-value: 4.59e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 55 KFFLGHNDDIISLALHPDKTLVATGqvGKEPYICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCI 134
Cdd:COG2319 282 RTLTGHSGGVNSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGSDD--GTVRL 356
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 223005862 135 WDWRKGKLLASATGHSDRIFDISWDPyQPNRVVSCGV-KHIKFWTL 179
Cdd:COG2319 357 WDLATGELLRTLTGHTGAVTSVAFSP-DGRTLASGSAdGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
521-826 |
1.96e-14 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 75.83 E-value: 1.96e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRFipegvsn 600
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWDL------- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 601 gmletapqEGGADSYS-EESDSDLSDVpeldsDIEQEAQI----NYDRQVYKEDLPQLKQQSKEKNHavpflkrekapED 675
Cdd:cd00200 81 --------ETGECVRTlTGHTSYVSSV-----AFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TD 136
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 676 SLklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAA 752
Cdd:cd00200 137 WV--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGT 200
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 223005862 753 IHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNP 826
Cdd:cd00200 201 IKLWDLSTGKCLGTLRG-HENGVNSVAFSPDGYLLASGSED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1398-1629 |
1.78e-13 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 73.14 E-value: 1.78e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1398 IVQNLSTGSQSFYLE-HTDDILCLTVNQHPKYrnVVATSQIGTtpsIHIWDAMTKHTLSMLRCfHSKGVNYINFSATGKL 1476
Cdd:cd00200 76 RLWDLETGECVRTLTgHTSYVSSVAFSPDGRI--LSSSSRDKT---IKVWDVETGKCLTTLRG-HTDWVNSVAFSPDGTF 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1477 LVSVGVDpeHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTQFVSVGVKHMKFWTLAGSALLYkkgvigsLGAAKMQ 1556
Cdd:cd00200 150 VASSSQD--GTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLG-------TLRGHEN 220
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 223005862 1557 TMLSVAFGANNLTFTGA-INGDVYVWK-DHFLIRLVAKAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWD 1629
Cdd:cd00200 221 GVNSVAFSPDGYLLASGsEDGTIRVWDlRTGECVQTLSGHTNSVTSLAWSPDGKRLASGS------ADGTIRIWD 289
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1350-1401 |
7.88e-13 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 65.27 E-value: 7.88e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 223005862 1350 KNNITKKKKLVEE-LALDHVFGYRGFDCRNNLHYLNDGaDIIFHTAAAGIVQN 1401
Cdd:pfam03451 20 KDDLDQKKEPPDKkLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYD 71
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1040-1266 |
5.46e-12 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 68.52 E-value: 5.46e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1040 KKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNVLTSKRVG 1119
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASAD-GTYLASGSSDKTIRLWDLETGECVR 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1120 ICKGASSYITHIDWDSRGKLLqvnSGAREQlffeaprgKRHIIrpseiekieWDTWTcvlgPTCEGIWPAHSDitDVNAA 1199
Cdd:cd00200 88 TLTGHTSYVSSVAFSPDGRIL---SSSSRD--------KTIKV---------WDVET----GKCLTTLRGHTD--WVNSV 141
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 223005862 1200 SLTKDCSLLATGDDFGFVKLFSyPVKGQhaRFKKYVGHSAHVTNVRWlHNDSVLLTVGGADTALMIW 1266
Cdd:cd00200 142 AFSPDGTFVASSSQDGTIKLWD-LRTGK--CVATLTGHTGEVNSVAF-SPDGEKLLSSSSDGTIKLW 204
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1123-1629 |
6.46e-09 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 60.31 E-value: 6.46e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1123 GASSYITHIDWDSRGKLLQVNSGAREQLFFEA-PRGKRHIIRPSEIEKIE-WDtwtcVLGPTCEGIWPAHSDitDVNAAS 1200
Cdd:COG2319 54 GAGDLTLLLLDAAAGALLATLLGHTAAVLSVAfSPDGRLLASASADGTVRlWD----LATGLLLRTLTGHTG--AVRSVA 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1201 LTKDCSLLATGDDFGFVKLFSYPVKGQHARFKkyvGHSAHVTNVRWlHNDSVLLTVGGADTALMIWtrefvgtqesklvd 1280
Cdd:COG2319 128 FSPDGKTLASGSADGTVRLWDLATGKLLRTLT---GHSGAVTSVAF-SPDGKLLASGSDDGTVRLW-------------- 189
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1281 seesdtdveedggydsdvarekaidyttkiyavsiremegtkphqqlkevsveerppvsraapqpeklqknnitkkkklv 1360
Cdd:COG2319 --------------------------------------------------------------------------------
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1361 eelaldhvfgyrgfdcrnnlhylndgadiifhtaaagivqNLSTGSQSFYLE-HTDDILCLTVNQHPKYrnVVATSQIGT 1439
Cdd:COG2319 190 ----------------------------------------DLATGKLLRTLTgHTGAVRSVAFSPDGKL--LASGSADGT 227
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1440 tpsIHIWDAMTKHTLSMLRcFHSKGVNYINFSATGKLLVSVGVDpeHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSD 1519
Cdd:COG2319 228 ---VRLWDLATGKLLRTLT-GHSGSVRSVAFSPDGRLLASGSAD--GTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGK 301
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1520 TqFVSVGV-KHMKFWTLAGSALLykKGVIGSLGAAkmqtmLSVAFGANNLT-FTGAINGDVYVWK-DHFLIRLVAKAHTG 1596
Cdd:COG2319 302 L-LASGSDdGTVRLWDLATGKLL--RTLTGHTGAV-----RSVAFSPDGKTlASGSDDGTVRLWDlATGELLRTLTGHTG 373
|
490 500 510
....*....|....*....|....*....|....
gi 223005862 1597 PVFTMyTTLRDG-LIVTGGkerptkEGGAVKLWD 1629
Cdd:COG2319 374 AVTSV-AFSPDGrTLASGS------ADGTVRLWD 400
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
1699-1830 |
3.23e-05 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 47.38 E-value: 3.23e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1699 KDLFISASNDGTARIWDLADKKLLNKVSLGHAARCAAYSPDGEMVAI-GMKNGEFVILLVNSLKVWGKKRDRKSAiQDIR 1777
Cdd:COG3391 80 RRLYVANSGSGRVSVIDLATGKVVATIPVGGGPRGLAVDPDGGRLYVaDSGNGRVSVIDTATGKVVATIPVGAGP-HGIA 158
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 223005862 1778 ISPDNRFLAVGSSE-HTVDFY----DLTQGTNLNRIgyckDIPSFVIQMDFSADGKYI 1830
Cdd:COG3391 159 VDPDGKRLYVANSGsNTVSVIvsviDTATGKVVATI----PVGGGPVGVAVSPDGRRL 212
|
|
| COG4946 |
COG4946 |
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ... |
368-488 |
5.55e-05 |
|
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];
Pssm-ID: 443973 [Multi-domain] Cd Length: 1072 Bit Score: 48.50 E-value: 5.55e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 368 AVRSVAFSPDGSQLA-LGMKDGSF-IVLR--VRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA-QRYK 442
Cdd:COG4946 344 RERLPAWSPDGKSIAyFSDASGEYeLYIApaDGSGEPKQLTLGDLGRVFNPVWSPDGKKIAFTDNRGRLWVVDLAsGKVR 423
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 223005862 443 KIGEcSKSLSFITHIDWSLDSKYL---QTNDGAGERLF-YRMPSGK--PLTS 488
Cdd:COG4946 424 KVDT-DGYGDGISDLAWSPDSKWLaysKPGPNQLSQIFlYDVETGKtvQLTD 474
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
1691-1802 |
1.87e-04 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 45.07 E-value: 1.87e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1691 WGLATHPSKD-LFISASNDGTARIWDLADKKLLNKVSLGHAARCAAYSPDGEMVAIGMKNGEFVILLV-----NSLKVWg 1764
Cdd:COG3391 113 RGLAVDPDGGrLYVADSGNGRVSVIDTATGKVVATIPVGAGPHGIAVDPDGKRLYVANSGSNTVSVIVsvidtATGKVV- 191
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 223005862 1765 KKRDRKSAIQDIRISPDNRFLAV--------GSSEHTVDFYDLTQG 1802
Cdd:COG3391 192 ATIPVGGGPVGVAVSPDGRRLYVanrgsntsNGGSNTVSVIDLATL 237
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1682-1715 |
2.58e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.99 E-value: 2.58e-04
10 20 30
....*....|....*....|....*....|....
gi 223005862 1682 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWD 1715
Cdd:smart00320 8 LKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
105-136 |
3.40e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.99 E-value: 3.40e-04
10 20 30
....*....|....*....|....*....|..
gi 223005862 105 HTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:smart00320 11 HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1682-1715 |
4.24e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 39.64 E-value: 4.24e-04
10 20 30
....*....|....*....|....*....|....
gi 223005862 1682 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWD 1715
Cdd:pfam00400 7 LEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
315-353 |
7.88e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.87 E-value: 7.88e-04
10 20 30
....*....|....*....|....*....|....*....
gi 223005862 315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:pfam00400 2 KLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
760-802 |
1.25e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.06 E-value: 1.25e-03
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 223005862 760 TLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:smart00320 1 SGELLKTLKG-HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
315-353 |
1.81e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.68 E-value: 1.81e-03
10 20 30
....*....|....*....|....*....|....*....
gi 223005862 315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:smart00320 3 ELLKTLKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
105-136 |
1.95e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 37.71 E-value: 1.95e-03
10 20 30
....*....|....*....|....*....|..
gi 223005862 105 HTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:pfam00400 10 HTGSVTSLAFSPDGKLLASGSDD--GTVKVWD 39
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
1734-1798 |
2.54e-03 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 38.80 E-value: 2.54e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 223005862 1734 AAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRD-RKSAIQDIRISPDNRFLAVGSSEHTVDFYD 1798
Cdd:pfam12894 1 MSWCPTMDLIALATEDGELLLHRLNWQRVWTLSPDkEDLEVTSLAWRPDGKLLAVGYSDGTVRLLD 66
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1921-1955 |
2.95e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 37.32 E-value: 2.95e-03
10 20 30
....*....|....*....|....*....|....*
gi 223005862 1921 KRYFGHSAHVTNIRFSYDDKYVVStGGDDCSVFVW 1955
Cdd:pfam00400 5 KTLEGHTGSVTSLAFSPDGKLLAS-GSDDGTVKVW 38
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
339-430 |
5.00e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 40.83 E-value: 5.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 339 LAVTGSDDRSVRLWSLADHALIARCNMEEAVRSVAFSPDGSQL-ALGMKDGSFIVLRVRDMTEVVHIKDRKEViHEMKFS 417
Cdd:COG3391 82 LYVANSGSGRVSVIDLATGKVVATIPVGGGPRGLAVDPDGGRLyVADSGNGRVSVIDTATGKVVATIPVGAGP-HGIAVD 160
|
90
....*....|...
gi 223005862 418 PDGSYLAVGSNDG 430
Cdd:COG3391 161 PDGKRLYVANSGS 173
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1916-1955 |
5.57e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 5.57e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 223005862 1916 KFAKHKRYFGHSAHVTNIRFSYDDKYVVStGGDDCSVFVW 1955
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLAS-GSDDGTIKLW 39
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
329-425 |
6.43e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 40.45 E-value: 6.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 329 WALALHPK-KPLAVTGSDDRSVRLWSLADHALIARCNMEEAVRSVAFSPDGSQLALGMKDGSFI-----VLRVRDMTEVV 402
Cdd:COG3391 113 RGLAVDPDgGRLYVADSGNGRVSVIDTATGKVVATIPVGAGPHGIAVDPDGKRLYVANSGSNTVsvivsVIDTATGKVVA 192
|
90 100
....*....|....*....|...
gi 223005862 403 HIkDRKEVIHEMKFSPDGSYLAV 425
Cdd:COG3391 193 TI-PVGGGPVGVAVSPDGRRLYV 214
|
|
|