|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-397 |
2.58e-39 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 152.37 E-value: 2.58e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319 116 LTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWD 190
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgktgDLQTILclacakeditySGAL 216
Cdd:COG2319 191 LATGKLLRTLTGHTGAVRSVAFSP-----------------------------------DGKLLA-----------SGSA 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319 225 DGTVRLWdlATGKLLRTLTG-HSGSVRSVAFSPDGrlLASGSADGTVRLWDLATG-----ELLRTLTGHSG-GVNSVAFS 297
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 293 AD--RLLAGTQDSEI--FEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EE 367
Cdd:COG2319 298 PDgkLLASGSDDGTVrlWDL---ATGKLLRTLTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTG 373
|
330 340 350
....*....|....*....|....*....|
gi 2217327924 368 AVRSVAFSPDGSQLALGMKDGSFIVLRVRD 397
Cdd:COG2319 374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
725-1026 |
1.33e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 137.85 E-value: 1.33e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 725 GHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWK 804
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 805 KGEKIATTRGHKDKIFVVKCNPHHvdKLVTVGIKH--IKFWQ-QAGGGFTSKRGTFGSVgkletmMCVSYGRMEDLVFSG 881
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDvETGKCLTTLRGHTDWV------NSVAFSPDGTFVASS 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 882 AATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFERCLKTYAIKRS---ALSTSSKGLL 954
Cdd:cd00200 154 SQDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTLRGHENgvnSVAFSPDGYL 233
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217327924 955 LednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWE 1026
Cdd:cd00200 234 L----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1438-1923 |
5.63e-32 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 130.80 E-value: 5.63e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1438 SATGKLLVSVGVDPEHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTQFVSVGVKHMKFWTLAGSALLYKKGVIGSl 1517
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA- 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1518 gaakmqTMLSVAFGANNLTF-TGAINGDVYVWK-DHFLIRLVAKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKL 1594
Cdd:COG2319 80 ------AVLSVAFSPDGRLLaSASADGTVRLWDlATGLLLRTLTGHTGAVRSV-AFSPDGkTLASGS------ADGTVRL 146
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1595 WDqemkrcrafqLETGQLVEcvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASN 1674
Cdd:COG2319 147 WD----------LATGKLLR---------------------------------TLTGH-SGAVTSVAFSPDGKLLASGSD 182
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1675 DGTARIWDLADKKLLNKVSlGHAA--RCAAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRDRKSAIQDIRISPDNRFL 1752
Cdd:COG2319 183 DGTVRLWDLATGKLLRTLT-GHTGavRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLL 261
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1753 AVGSSEHTVDFYDLTQGTNLNRIGyckDIPSFVIQMDFSADGKYIqVSTGAYKR-QVHEVPLGKQvteavviekitwasw 1831
Cdd:COG2319 262 ASGSADGTVRLWDLATGELLRTLT---GHSGGVNSVAFSPDGKLL-ASGSDDGTvRLWDLATGKL--------------- 322
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1832 tsvlgdevigIWPRNADKADVNCACVTHAGLNIVTGDDFGLVKLFDfpcTEKFAKHKRYFGHSAHVTNIRFSYDDKYVVS 1911
Cdd:COG2319 323 ----------LRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTGHTGAVTSVAFSPDGRTLAS 389
|
490
....*....|..
gi 2217327924 1912 tGGDDCSVFVWR 1923
Cdd:COG2319 390 -GSADGTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
880-1266 |
1.82e-26 |
|
WD40 repeat [General function prediction only]; :
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 114.24 E-value: 1.82e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 880 SGAATGDIFIWKDILLLKTVKAHDGPVF--AMYALDKGFVTGGKDGIVELWDdmferclktyaikrsALSTSSKGLLLED 957
Cdd:COG2319 55 AGDLTLLLLDAAAGALLATLLGHTAAVLsvAFSPDGRLLASASADGTVRLWD---------------LATGLLLRTLTGH 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 958 NPSIRAITLGH-GHILV-GTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHP---LLpicATVSDDKTLRIWELSAQH 1031
Cdd:COG2319 120 TGAVRSVAFSPdGKTLAsGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGK 195
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1032 RMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYN 1111
Cdd:COG2319 196 LLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPD-GRLLASGSADGTVRLWD 274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1112 VLTSKRVGICKGASSYITHIDWDSRGKLLqVnSGAREqlffeaprgkrHIIRPseiekieWDTWTcvlgPTCEGIWPAHS 1191
Cdd:COG2319 275 LATGELLRTLTGHSGGVNSVAFSPDGKLL-A-SGSDD-----------GTVRL-------WDLAT----GKLLRTLTGHT 330
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217327924 1192 DitDVNAASLTKDCSLLATGDDFGFVKLFSYPVKGQHARFKkyvGHSAHVTNVRWLHNDSVLLTvGGADTALMIW 1266
Cdd:COG2319 331 G--AVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-48 |
5.37e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. :
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.37e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 2217327924 2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
668-715 |
5.59e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. :
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.59e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 2217327924 668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 super family |
cl29593 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
521-826 |
1.93e-14 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment. The actual alignment was detected with superfamily member cd00200:
Pssm-ID: 475233 [Multi-domain] Cd Length: 289 Bit Score: 75.83 E-value: 1.93e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRFipegvsn 600
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWDL------- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 601 gmletapqEGGADSYS-EESDSDLSDVpeldsDIEQEAQI----NYDRQVYKEDLPQLKQQSKEKNHavpflkrekapED 675
Cdd:cd00200 81 --------ETGECVRTlTGHTSYVSSV-----AFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TD 136
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 676 SLklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAA 752
Cdd:cd00200 137 WV--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGT 200
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217327924 753 IHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNP 826
Cdd:cd00200 201 IKLWDLSTGKCLGTLRG-HENGVNSVAFSPDGYLLASGSED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
|
|
| HELP super family |
cl04081 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1350-1401 |
7.75e-13 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons. The actual alignment was detected with superfamily member pfam03451:
Pssm-ID: 460922 Cd Length: 72 Bit Score: 65.27 E-value: 7.75e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 2217327924 1350 KNNITKKKKLVEE-LALDHVFGYRGFDCRNNLHYLNDGaDIIFHTAAAGIVQN 1401
Cdd:pfam03451 20 KDDLDQKKEPPDKkLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYD 71
|
|
| COG4946 super family |
cl27624 |
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ... |
368-488 |
5.78e-05 |
|
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown]; The actual alignment was detected with superfamily member COG4946:
Pssm-ID: 443973 [Multi-domain] Cd Length: 1072 Bit Score: 48.50 E-value: 5.78e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 368 AVRSVAFSPDGSQLA-LGMKDGSF-IVLR--VRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA-QRYK 442
Cdd:COG4946 344 RERLPAWSPDGKSIAyFSDASGEYeLYIApaDGSGEPKQLTLGDLGRVFNPVWSPDGKKIAFTDNRGRLWVVDLAsGKVR 423
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 2217327924 443 KIGEcSKSLSFITHIDWSLDSKYL---QTNDGAGERLF-YRMPSGK--PLTS 488
Cdd:COG4946 424 KVDT-DGYGDGISDLAWSPDSKWLaysKPGPNQLSQIFlYDVETGKtvQLTD 474
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-397 |
2.58e-39 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 152.37 E-value: 2.58e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319 116 LTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWD 190
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgktgDLQTILclacakeditySGAL 216
Cdd:COG2319 191 LATGKLLRTLTGHTGAVRSVAFSP-----------------------------------DGKLLA-----------SGSA 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319 225 DGTVRLWdlATGKLLRTLTG-HSGSVRSVAFSPDGrlLASGSADGTVRLWDLATG-----ELLRTLTGHSG-GVNSVAFS 297
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 293 AD--RLLAGTQDSEI--FEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EE 367
Cdd:COG2319 298 PDgkLLASGSDDGTVrlWDL---ATGKLLRTLTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTG 373
|
330 340 350
....*....|....*....|....*....|
gi 2217327924 368 AVRSVAFSPDGSQLALGMKDGSFIVLRVRD 397
Cdd:COG2319 374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
57-353 |
1.09e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 138.24 E-value: 1.09e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKdVHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 137 WRKGKLLASATGHSDRIFDISWDPYqpNRVVSCGVKH--IKFWTL-CGNALTAKRGIFGktgdlqTILCLA-CAKEDITY 212
Cdd:cd00200 80 LETGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVeTGKCLTTLRGHTD------WVNSVAfSPDGTFVA 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 213 SGALNGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKPITKIdLRETEQGykglsIRS 288
Cdd:cd00200 152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGT-LRGHENG-----VNS 224
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217327924 289 VCWKADRLL--AGTQDS--EIFEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:cd00200 225 VAFSPDGYLlaSGSEDGtiRVWDL---RTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
725-1026 |
1.33e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 137.85 E-value: 1.33e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 725 GHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWK 804
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 805 KGEKIATTRGHKDKIFVVKCNPHHvdKLVTVGIKH--IKFWQ-QAGGGFTSKRGTFGSVgkletmMCVSYGRMEDLVFSG 881
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDvETGKCLTTLRGHTDWV------NSVAFSPDGTFVASS 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 882 AATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFERCLKTYAIKRS---ALSTSSKGLL 954
Cdd:cd00200 154 SQDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTLRGHENgvnSVAFSPDGYL 233
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217327924 955 LednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWE 1026
Cdd:cd00200 234 L----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
708-1140 |
3.62e-34 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 136.96 E-value: 3.62e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 708 AVAVVYNRQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCL 787
Cdd:COG2319 59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTL 135
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 788 VSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVkcnphhvdklvtvgikhikfwqqagggftskrgTFGSVGKLetmm 867
Cdd:COG2319 136 ASGSAD--GTVRLWDLATGKLLRTLTGHSGAVTSV---------------------------------AFSPDGKL---- 176
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 868 cvsygrmedlVFSGAATGDIFIWkDIL---LLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDdmferclktyaik 942
Cdd:COG2319 177 ----------LASGSDDGTVRLW-DLAtgkLLRTLTGHTGAVRSVaFSPDgKLLASGSADGTVRLWD------------- 232
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 943 rsalstsskglllednpsiraitlghghilvgTKNGEILeidksgpmtLLVQGHmEGEVWGLAAHP---LLpicATVSDD 1019
Cdd:COG2319 233 --------------------------------LATGKLL---------RTLTGH-SGSVRSVAFSPdgrLL---ASGSAD 267
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1020 KTLRIWELSAQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLA 1099
Cdd:COG2319 268 GTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPD-GKTLA 346
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 2217327924 1100 VASHDNFVDIYNVLTSKRVGICKGASSYITHIDWDSRGKLL 1140
Cdd:COG2319 347 SGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTL 387
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1438-1923 |
5.63e-32 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 130.80 E-value: 5.63e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1438 SATGKLLVSVGVDPEHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTQFVSVGVKHMKFWTLAGSALLYKKGVIGSl 1517
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA- 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1518 gaakmqTMLSVAFGANNLTF-TGAINGDVYVWK-DHFLIRLVAKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKL 1594
Cdd:COG2319 80 ------AVLSVAFSPDGRLLaSASADGTVRLWDlATGLLLRTLTGHTGAVRSV-AFSPDGkTLASGS------ADGTVRL 146
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1595 WDqemkrcrafqLETGQLVEcvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASN 1674
Cdd:COG2319 147 WD----------LATGKLLR---------------------------------TLTGH-SGAVTSVAFSPDGKLLASGSD 182
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1675 DGTARIWDLADKKLLNKVSlGHAA--RCAAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRDRKSAIQDIRISPDNRFL 1752
Cdd:COG2319 183 DGTVRLWDLATGKLLRTLT-GHTGavRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLL 261
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1753 AVGSSEHTVDFYDLTQGTNLNRIGyckDIPSFVIQMDFSADGKYIqVSTGAYKR-QVHEVPLGKQvteavviekitwasw 1831
Cdd:COG2319 262 ASGSADGTVRLWDLATGELLRTLT---GHSGGVNSVAFSPDGKLL-ASGSDDGTvRLWDLATGKL--------------- 322
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1832 tsvlgdevigIWPRNADKADVNCACVTHAGLNIVTGDDFGLVKLFDfpcTEKFAKHKRYFGHSAHVTNIRFSYDDKYVVS 1911
Cdd:COG2319 323 ----------LRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTGHTGAVTSVAFSPDGRTLAS 389
|
490
....*....|..
gi 2217327924 1912 tGGDDCSVFVWR 1923
Cdd:COG2319 390 -GSADGTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
880-1266 |
1.82e-26 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 114.24 E-value: 1.82e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 880 SGAATGDIFIWKDILLLKTVKAHDGPVF--AMYALDKGFVTGGKDGIVELWDdmferclktyaikrsALSTSSKGLLLED 957
Cdd:COG2319 55 AGDLTLLLLDAAAGALLATLLGHTAAVLsvAFSPDGRLLASASADGTVRLWD---------------LATGLLLRTLTGH 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 958 NPSIRAITLGH-GHILV-GTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHP---LLpicATVSDDKTLRIWELSAQH 1031
Cdd:COG2319 120 TGAVRSVAFSPdGKTLAsGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGK 195
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1032 RMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYN 1111
Cdd:COG2319 196 LLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPD-GRLLASGSADGTVRLWD 274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1112 VLTSKRVGICKGASSYITHIDWDSRGKLLqVnSGAREqlffeaprgkrHIIRPseiekieWDTWTcvlgPTCEGIWPAHS 1191
Cdd:COG2319 275 LATGELLRTLTGHSGGVNSVAFSPDGKLL-A-SGSDD-----------GTVRL-------WDLAT----GKLLRTLTGHT 330
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217327924 1192 DitDVNAASLTKDCSLLATGDDFGFVKLFSYPVKGQHARFKkyvGHSAHVTNVRWLHNDSVLLTvGGADTALMIW 1266
Cdd:COG2319 331 G--AVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1428-1730 |
4.15e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 95.86 E-value: 4.15e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1428 HSKGVNYINFSATGKLLVSVGVDpeHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSdTQFVSVGVKHM-KFWTLAGSA 1506
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADG-TYLASGSSDKTiRLWDLETGE 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1507 LLYkkgvigSLGAAKmQTMLSVAFGANNLTFTGAI-NGDVYVWK-DHFLIRLVAKAHTGPVFTMyTTLRDGLIVTGGker 1584
Cdd:cd00200 85 CVR------TLTGHT-SYVSSVAFSPDGRILSSSSrDKTIKVWDvETGKCLTTLRGHTDWVNSV-AFSPDGTFVASS--- 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1585 ptKEGGAVKLWD-QEMKRCRAFQLETGQlvecVRSVC--RGKGKILVGTKDGEIIeVGEKNAASNI-LIDGHmEGEIWGL 1660
Cdd:cd00200 154 --SQDGTIKLWDlRTGKCVATLTGHTGE----VNSVAfsPDGEKLLSSSSDGTIK-LWDLSTGKCLgTLRGH-ENGVNSV 225
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217327924 1661 ATHPSKDLFISASNDGTARIWDLADKKLLNKVSlGHAAR--CAAYSPDGEMVAIGMKNgefvillvNSLKVW 1730
Cdd:cd00200 226 AFSPDGYLLASGSEDGTIRVWDLRTGECVQTLS-GHTNSvtSLAWSPDGKRLASGSAD--------GTIRIW 288
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-48 |
5.37e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.37e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 2217327924 2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
668-715 |
5.59e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.59e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 2217327924 668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
895-1221 |
1.92e-18 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 87.78 E-value: 1.92e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 895 LLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD---DMFERCLKTYAIKRSALSTSSKGLLL----EDNpSIRait 965
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDletGELLRTLKGHTGPVRDVAASADGTYLasgsSDK-TIR--- 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 966 lghghiLVGTKNGEILEIdksgpmtllVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWELSAQHRMLAVRKLKKGGRC 1045
Cdd:cd00200 77 ------LWDLETGECVRT---------LTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1046 CAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNVLTSKRVGICKGAS 1125
Cdd:cd00200 141 VAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPD-GEKLLSSSSDGTIKLWDLSTGKCLGTLRGHE 219
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1126 SYITHIDWDSRGKLLqvnSGAreqlffeaprgkrhiirpSEIEKIE-WDTWTCVLGPTCEGiwpaHSdiTDVNAASLTKD 1204
Cdd:cd00200 220 NGVNSVAFSPDGYLL---ASG------------------SEDGTIRvWDLRTGECVQTLSG----HT--NSVTSLAWSPD 272
|
330
....*....|....*..
gi 2217327924 1205 CSLLATGDDFGFVKLFS 1221
Cdd:cd00200 273 GKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
521-826 |
1.93e-14 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 75.83 E-value: 1.93e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRFipegvsn 600
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWDL------- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 601 gmletapqEGGADSYS-EESDSDLSDVpeldsDIEQEAQI----NYDRQVYKEDLPQLKQQSKEKNHavpflkrekapED 675
Cdd:cd00200 81 --------ETGECVRTlTGHTSYVSSV-----AFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TD 136
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 676 SLklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAA 752
Cdd:cd00200 137 WV--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGT 200
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217327924 753 IHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNP 826
Cdd:cd00200 201 IKLWDLSTGKCLGTLRG-HENGVNSVAFSPDGYLLASGSED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1350-1401 |
7.75e-13 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 65.27 E-value: 7.75e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 2217327924 1350 KNNITKKKKLVEE-LALDHVFGYRGFDCRNNLHYLNDGaDIIFHTAAAGIVQN 1401
Cdd:pfam03451 20 KDDLDQKKEPPDKkLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYD 71
|
|
| COG4946 |
COG4946 |
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ... |
368-488 |
5.78e-05 |
|
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];
Pssm-ID: 443973 [Multi-domain] Cd Length: 1072 Bit Score: 48.50 E-value: 5.78e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 368 AVRSVAFSPDGSQLA-LGMKDGSF-IVLR--VRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA-QRYK 442
Cdd:COG4946 344 RERLPAWSPDGKSIAyFSDASGEYeLYIApaDGSGEPKQLTLGDLGRVFNPVWSPDGKKIAFTDNRGRLWVVDLAsGKVR 423
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 2217327924 443 KIGEcSKSLSFITHIDWSLDSKYL---QTNDGAGERLF-YRMPSGK--PLTS 488
Cdd:COG4946 424 KVDT-DGYGDGISDLAWSPDSKWLaysKPGPNQLSQIFlYDVETGKtvQLTD 474
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1649-1682 |
2.54e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.99 E-value: 2.54e-04
10 20 30
....*....|....*....|....*....|....
gi 2217327924 1649 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWD 1682
Cdd:smart00320 8 LKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
105-136 |
3.34e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.99 E-value: 3.34e-04
10 20 30
....*....|....*....|....*....|..
gi 2217327924 105 HTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:smart00320 11 HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1649-1682 |
4.09e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 39.64 E-value: 4.09e-04
10 20 30
....*....|....*....|....*....|....
gi 2217327924 1649 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWD 1682
Cdd:pfam00400 7 LEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
315-353 |
7.67e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.87 E-value: 7.67e-04
10 20 30
....*....|....*....|....*....|....*....
gi 2217327924 315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:pfam00400 2 KLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
760-802 |
1.21e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.06 E-value: 1.21e-03
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 2217327924 760 TLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:smart00320 1 SGELLKTLKG-HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-397 |
2.58e-39 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 152.37 E-value: 2.58e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319 116 LTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWD 190
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgktgDLQTILclacakeditySGAL 216
Cdd:COG2319 191 LATGKLLRTLTGHTGAVRSVAFSP-----------------------------------DGKLLA-----------SGSA 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319 225 DGTVRLWdlATGKLLRTLTG-HSGSVRSVAFSPDGrlLASGSADGTVRLWDLATG-----ELLRTLTGHSG-GVNSVAFS 297
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 293 AD--RLLAGTQDSEI--FEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EE 367
Cdd:COG2319 298 PDgkLLASGSDDGTVrlWDL---ATGKLLRTLTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTG 373
|
330 340 350
....*....|....*....|....*....|
gi 2217327924 368 AVRSVAFSPDGSQLALGMKDGSFIVLRVRD 397
Cdd:COG2319 374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-466 |
3.26e-39 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 151.99 E-value: 3.26e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319 74 LLGHTAAVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgkTGDLqtilcLAcakeditySGAL 216
Cdd:COG2319 149 LATGKLLRTLTGHSGAVTSVAFSP---------------------------------DGKL-----LA--------SGSD 182
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDfkpitkidlreteqgykglsirsvcwk 292
Cdd:COG2319 183 DGTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLA--------------------------- 234
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 293 adrllagtqdseifevivreRDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNMEE-AVRS 371
Cdd:COG2319 235 --------------------TGKLLRTLTGH-SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSgGVNS 293
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 372 VAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQrykkiGECSKSL 451
Cdd:COG2319 294 VAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAT-----GELLRTL 368
|
410
....*....|....*....
gi 2217327924 452 ----SFITHIDWSLDSKYL 466
Cdd:COG2319 369 tghtGAVTSVAFSPDGRTL 387
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
57-438 |
9.92e-37 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 144.67 E-value: 9.92e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319 32 LLGLAAAVASLAASPDGARLAAGAGDLT--LLLLDAAAGALLATLLG-HTAAVLSVAFSPDGRLLASASAD--GTVRLWD 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 137 WRKGKLLASATGHSDRIFDISWDPyQPNRVVSCGV-KHIKFWtlcgNALTAKRgIFGKTGDLQTILCLAC-AKEDITYSG 214
Cdd:COG2319 107 LATGLLLRTLTGHTGAVRSVAFSP-DGKTLASGSAdGTVRLW----DLATGKL-LRTLTGHSGAVTSVAFsPDGKLLASG 180
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 215 ALNGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDfkpiTKiDLRETEQGYKGlSIRSVC 290
Cdd:COG2319 181 SDDGTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLA----TG-KLLRTLTGHSG-SVRSVA 253
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 291 WKAD--RLLAGTQDSEIfEVIVRERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNMEEA 368
Cdd:COG2319 254 FSPDgrLLASGSADGTV-RLWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTG 331
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2217327924 369 -VRSVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA 438
Cdd:COG2319 332 aVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
57-353 |
1.09e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 138.24 E-value: 1.09e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKdVHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:cd00200 5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 137 WRKGKLLASATGHSDRIFDISWDPYqpNRVVSCGVKH--IKFWTL-CGNALTAKRGIFGktgdlqTILCLA-CAKEDITY 212
Cdd:cd00200 80 LETGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVeTGKCLTTLRGHTD------WVNSVAfSPDGTFVA 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 213 SGALNGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKPITKIdLRETEQGykglsIRS 288
Cdd:cd00200 152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGT-LRGHENG-----VNS 224
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2217327924 289 VCWKADRLL--AGTQDS--EIFEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:cd00200 225 VAFSPDGYLlaSGSEDGtiRVWDL---RTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
725-1026 |
1.33e-35 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 137.85 E-value: 1.33e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 725 GHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWK 804
Cdd:cd00200 7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADGTYLASGSSD--KTIRLWDLE 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 805 KGEKIATTRGHKDKIFVVKCNPHHvdKLVTVGIKH--IKFWQ-QAGGGFTSKRGTFGSVgkletmMCVSYGRMEDLVFSG 881
Cdd:cd00200 82 TGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDvETGKCLTTLRGHTDWV------NSVAFSPDGTFVASS 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 882 AATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFERCLKTYAIKRS---ALSTSSKGLL 954
Cdd:cd00200 154 SQDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTLRGHENgvnSVAFSPDGYL 233
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2217327924 955 LednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWE 1026
Cdd:cd00200 234 L----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
708-1140 |
3.62e-34 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 136.96 E-value: 3.62e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 708 AVAVVYNRQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCL 787
Cdd:COG2319 59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTL 135
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 788 VSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVkcnphhvdklvtvgikhikfwqqagggftskrgTFGSVGKLetmm 867
Cdd:COG2319 136 ASGSAD--GTVRLWDLATGKLLRTLTGHSGAVTSV---------------------------------AFSPDGKL---- 176
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 868 cvsygrmedlVFSGAATGDIFIWkDIL---LLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDdmferclktyaik 942
Cdd:COG2319 177 ----------LASGSDDGTVRLW-DLAtgkLLRTLTGHTGAVRSVaFSPDgKLLASGSADGTVRLWD------------- 232
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 943 rsalstsskglllednpsiraitlghghilvgTKNGEILeidksgpmtLLVQGHmEGEVWGLAAHP---LLpicATVSDD 1019
Cdd:COG2319 233 --------------------------------LATGKLL---------RTLTGH-SGSVRSVAFSPdgrLL---ASGSAD 267
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1020 KTLRIWELSAQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLA 1099
Cdd:COG2319 268 GTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPD-GKTLA 346
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 2217327924 1100 VASHDNFVDIYNVLTSKRVGICKGASSYITHIDWDSRGKLL 1140
Cdd:COG2319 347 SGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTL 387
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1438-1923 |
5.63e-32 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 130.80 E-value: 5.63e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1438 SATGKLLVSVGVDPEHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTQFVSVGVKHMKFWTLAGSALLYKKGVIGSl 1517
Cdd:COG2319 1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA- 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1518 gaakmqTMLSVAFGANNLTF-TGAINGDVYVWK-DHFLIRLVAKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKL 1594
Cdd:COG2319 80 ------AVLSVAFSPDGRLLaSASADGTVRLWDlATGLLLRTLTGHTGAVRSV-AFSPDGkTLASGS------ADGTVRL 146
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1595 WDqemkrcrafqLETGQLVEcvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASN 1674
Cdd:COG2319 147 WD----------LATGKLLR---------------------------------TLTGH-SGAVTSVAFSPDGKLLASGSD 182
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1675 DGTARIWDLADKKLLNKVSlGHAA--RCAAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRDRKSAIQDIRISPDNRFL 1752
Cdd:COG2319 183 DGTVRLWDLATGKLLRTLT-GHTGavRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLL 261
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1753 AVGSSEHTVDFYDLTQGTNLNRIGyckDIPSFVIQMDFSADGKYIqVSTGAYKR-QVHEVPLGKQvteavviekitwasw 1831
Cdd:COG2319 262 ASGSADGTVRLWDLATGELLRTLT---GHSGGVNSVAFSPDGKLL-ASGSDDGTvRLWDLATGKL--------------- 322
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1832 tsvlgdevigIWPRNADKADVNCACVTHAGLNIVTGDDFGLVKLFDfpcTEKFAKHKRYFGHSAHVTNIRFSYDDKYVVS 1911
Cdd:COG2319 323 ----------LRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTGHTGAVTSVAFSPDGRTLAS 389
|
490
....*....|..
gi 2217327924 1912 tGGDDCSVFVWR 1923
Cdd:COG2319 390 -GSADGTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1402-1767 |
3.93e-31 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 128.11 E-value: 3.93e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1402 LSTGTTPSIHIWDAMTKHTLSMLRcFHSKGVNYINFSATGKLLVSVGVDpeHTITVWRWQEGAKVASRGGHLERIFVVEF 1481
Cdd:COG2319 94 ASASADGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWDLATGKLLRTLTGHSGAVTSVAF 170
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1482 RPDSdTQFVSVGV-KHMKFWTLAGSALLYKkgVIGSLGAAkmqtmLSVAFGANNLTF-TGAINGDVYVW--KDHFLIRLV 1557
Cdd:COG2319 171 SPDG-KLLASGSDdGTVRLWDLATGKLLRT--LTGHTGAV-----RSVAFSPDGKLLaSGSADGTVRLWdlATGKLLRTL 242
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1558 aKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKLWDqemkrcrafqLETGQLVEcvrsvcrgkgkilvgtkdgeii 1636
Cdd:COG2319 243 -TGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGELLR---------------------- 282
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1637 evgeknaasniLIDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLNKVSlGHAA--RCAAYSPDGEMVAIGM 1714
Cdd:COG2319 283 -----------TLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT-GHTGavRSVAFSPDGKTLASGS 349
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|...
gi 2217327924 1715 KNGEFVILLVNSLKVWGKKRDRKSAIQDIRISPDNRFLAVGSSEHTVDFYDLT 1767
Cdd:COG2319 350 DDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
718-1071 |
3.15e-30 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 125.41 E-value: 3.15e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 718 HSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHS 797
Cdd:COG2319 111 LLLRTLTGHTGAVRSVAFSPDGKTLASG--SADGTVRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GT 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 798 IVFWDWKKGEKIATTRGHKDKIFVVKCNPHHvDKLVTVGI-KHIKFWQQAGGGF-TSKRGTFGSVgkletmMCVSY---G 872
Cdd:COG2319 186 VRLWDLATGKLLRTLTGHTGAVRSVAFSPDG-KLLASGSAdGTVRLWDLATGKLlRTLTGHSGSV------RSVAFspdG 258
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 873 RmedLVFSGAATGDIFIW--KDILLLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDdmferclktyaikrsalst 948
Cdd:COG2319 259 R---LLASGSADGTVRLWdlATGELLRTLTGHSGGVNSVaFSPDgKLLASGSDDGTVRLWD------------------- 316
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 949 sskglllednpsiraitlghghilvgTKNGEILEIdksgpmtllVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWELS 1028
Cdd:COG2319 317 --------------------------LATGKLLRT---------LTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLA 360
|
330 340 350 360
....*....|....*....|....*....|....*....|...
gi 2217327924 1029 AQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADT 1071
Cdd:COG2319 361 TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
274-844 |
2.69e-29 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 122.71 E-value: 2.69e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 274 LRETEQGYKGLSIRSVCWKADRLLAGTQDSEIFEVIVRERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:COG2319 28 LLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH-TAAVLSVAFSPDGRLLASASADGTVRLWD 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 354 LADHALIARCNM-EEAVRSVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPV 432
Cdd:COG2319 107 LATGLLLRTLTGhTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTV 186
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 433 DVYAVAQrykkiGECSKSL----SFITHIDWSLDSKYLqtndgagerlfyrmpsgkpltskeeikgipwaswtcvkgpev 508
Cdd:COG2319 187 RLWDLAT-----GKLLRTLtghtGAVRSVAFSPDGKLL------------------------------------------ 219
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 509 sgiwpkytevtdinsvdanynssvlVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSV 588
Cdd:COG2319 220 -------------------------ASGSADGTVRLWD---LATGKLLRTLTGHSGSVRSVAFSPDGRLLAS-GSADGTV 270
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 589 FQWRfipegvsngmletapqeggadsyseesdsdlsdvpeldsdieqeaqinydrqvykedlpqlkqqskeknhavpflk 668
Cdd:COG2319 271 RLWD---------------------------------------------------------------------------- 274
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 669 rekapedslklqfihgyrgydcrnnlfyTQAGEVVyhiaavavvynrqqhsqRLYLGHDDDILSLTIHPVKDYVATGqvG 748
Cdd:COG2319 275 ----------------------------LATGELL-----------------RTLTGHSGGVNSVAFSPDGKLLASG--S 307
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 749 RDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNPHH 828
Cdd:COG2319 308 DDGTVRLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGSDD--GTVRLWDLATGELLRTLTGHTGAVTSVAFSPDG 384
|
570
....*....|....*..
gi 2217327924 829 vDKLVTVGI-KHIKFWQ 844
Cdd:COG2319 385 -RTLASGSAdGTVRLWD 400
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
880-1266 |
1.82e-26 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 114.24 E-value: 1.82e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 880 SGAATGDIFIWKDILLLKTVKAHDGPVF--AMYALDKGFVTGGKDGIVELWDdmferclktyaikrsALSTSSKGLLLED 957
Cdd:COG2319 55 AGDLTLLLLDAAAGALLATLLGHTAAVLsvAFSPDGRLLASASADGTVRLWD---------------LATGLLLRTLTGH 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 958 NPSIRAITLGH-GHILV-GTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHP---LLpicATVSDDKTLRIWELSAQH 1031
Cdd:COG2319 120 TGAVRSVAFSPdGKTLAsGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGK 195
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1032 RMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYN 1111
Cdd:COG2319 196 LLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPD-GRLLASGSADGTVRLWD 274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1112 VLTSKRVGICKGASSYITHIDWDSRGKLLqVnSGAREqlffeaprgkrHIIRPseiekieWDTWTcvlgPTCEGIWPAHS 1191
Cdd:COG2319 275 LATGELLRTLTGHSGGVNSVAFSPDGKLL-A-SGSDD-----------GTVRL-------WDLAT----GKLLRTLTGHT 330
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2217327924 1192 DitDVNAASLTKDCSLLATGDDFGFVKLFSYPVKGQHARFKkyvGHSAHVTNVRWLHNDSVLLTvGGADTALMIW 1266
Cdd:COG2319 331 G--AVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
943-1266 |
1.45e-24 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 108.46 E-value: 1.45e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 943 RSALSTSSKGLLLEDNPSIRAITLGHGHILVGTKNGEILEIDKSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTL 1022
Cdd:COG2319 24 ALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH-TAAVLSVAFSPDGRLLASASADGTV 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1023 RIWELSAQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVAS 1102
Cdd:COG2319 103 RLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPD-GKLLASGS 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1103 HDNFVDIYNVLTSKRVGICKGASSYITHIDWDSRGKLLQVNSGAREqlffeaprgkrhiIRPseiekieWDtwtcVLGPT 1182
Cdd:COG2319 182 DDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGT-------------VRL-------WD----LATGK 237
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1183 CEGIWPAHSDitDVNAASLTKDCSLLATGDDFGFVKLFSyPVKGQHARFKKyvGHSAHVTNVRWLHNDSVLLTvGGADTA 1262
Cdd:COG2319 238 LLRTLTGHSG--SVRSVAFSPDGRLLASGSADGTVRLWD-LATGELLRTLT--GHSGGVNSVAFSPDGKLLAS-GSDDGT 311
|
....
gi 2217327924 1263 LMIW 1266
Cdd:COG2319 312 VRLW 315
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1402-1685 |
5.65e-23 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 103.84 E-value: 5.65e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1402 LSTGTTPSIHIWDAMTKHTLSMLRCfHSKGVNYINFSATGKLLVSVGVDpeHTITVWRWQEGAKVASRGGHLERIFVVEF 1481
Cdd:COG2319 136 ASGSADGTVRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWDLATGKLLRTLTGHTGAVRSVAF 212
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1482 RPDSDTqFVSVGV-KHMKFWTLAGSALLYKKGviGSLGAAkmqtmLSVAFGANNLTF-TGAINGDVYVWK-DHFLIRLVA 1558
Cdd:COG2319 213 SPDGKL-LASGSAdGTVRLWDLATGKLLRTLT--GHSGSV-----RSVAFSPDGRLLaSGSADGTVRLWDlATGELLRTL 284
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1559 KAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWDqemkrcrafqLETGQLV-------ECVRSVC-RGKGKILV-G 1629
Cdd:COG2319 285 TGHSGGVNSVAFSPDGKLLASGS------DDGTVRLWD----------LATGKLLrtltghtGAVRSVAfSPDGKTLAsG 348
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1630 TKDGEI----IEVGEKNAAsnilIDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLAD 1685
Cdd:COG2319 349 SDDGTVrlwdLATGELLRT----LTGH-TGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
723-929 |
7.39e-23 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 100.87 E-value: 7.39e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 723 YLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:cd00200 89 LTGHTSYVSSVAFSPDGRILSSS--SRDKTIKVWDVETGKCLTTLRG-HTDWVNSVAFSPDGTFVASSSQD--GTIKLWD 163
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 803 WKKGEKIATTRGHKDKIFVVKCNPHHVDKLVTVGIKHIKFWQQAGGgftSKRGTFgsVGKLETMMCVSYGRMEDLVFSGA 882
Cdd:cd00200 164 LRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG---KCLGTL--RGHENGVNSVAFSPDGYLLASGS 238
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 2217327924 883 ATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD 929
Cdd:cd00200 239 EDGTIRVWdlRTGECVQTLSGHTNSVTSLAWSPDGkrLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1428-1730 |
4.15e-21 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 95.86 E-value: 4.15e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1428 HSKGVNYINFSATGKLLVSVGVDpeHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSdTQFVSVGVKHM-KFWTLAGSA 1506
Cdd:cd00200 8 HTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADG-TYLASGSSDKTiRLWDLETGE 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1507 LLYkkgvigSLGAAKmQTMLSVAFGANNLTFTGAI-NGDVYVWK-DHFLIRLVAKAHTGPVFTMyTTLRDGLIVTGGker 1584
Cdd:cd00200 85 CVR------TLTGHT-SYVSSVAFSPDGRILSSSSrDKTIKVWDvETGKCLTTLRGHTDWVNSV-AFSPDGTFVASS--- 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1585 ptKEGGAVKLWD-QEMKRCRAFQLETGQlvecVRSVC--RGKGKILVGTKDGEIIeVGEKNAASNI-LIDGHmEGEIWGL 1660
Cdd:cd00200 154 --SQDGTIKLWDlRTGKCVATLTGHTGE----VNSVAfsPDGEKLLSSSSDGTIK-LWDLSTGKCLgTLRGH-ENGVNSV 225
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2217327924 1661 ATHPSKDLFISASNDGTARIWDLADKKLLNKVSlGHAAR--CAAYSPDGEMVAIGMKNgefvillvNSLKVW 1730
Cdd:cd00200 226 AFSPDGYLLASGSEDGTIRVWDLRTGECVQTLS-GHTNSvtSLAWSPDGKRLASGSAD--------GTIRIW 288
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
228-466 |
1.03e-20 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 94.71 E-value: 1.03e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 228 LVRTIQGaHSAGIFSM--YACEEGFATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWKAD--RLLAGTQDS 303
Cdd:cd00200 1 LRRTLKG-HTGGVTCVafSPDGKLLATGSGDGTIKVWDLETG-----ELLRTLKGHTG-PVRDVAASADgtYLASGSSDK 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 304 EIFevIVRERDKPML-ILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EEAVRSVAFSPDGSQL 381
Cdd:cd00200 74 TIR--LWDLETGECVrTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGhTDWVNSVAFSPDGTFV 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 382 ALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQRyKKIGECSKSLSFITHIDWSL 461
Cdd:cd00200 151 ASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG-KCLGTLRGHENGVNSVAFSP 229
|
....*
gi 2217327924 462 DSKYL 466
Cdd:cd00200 230 DGYLL 234
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
521-1112 |
3.39e-20 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 95.36 E-value: 3.39e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRfipegvsn 600
Cdd:COG2319 81 VLSVAFSPDGRLLASASADGTVRLWD---LATGLLLRTLTGHTGAVRSVAFSPDGKTLAS-GSADGTVRLWD-------- 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 601 gmletapqeggadsyseesdsdlsdvpeldsdieqeaqinydrqvykedlpqlkqqskeknhavpflkrekapedslklq 680
Cdd:COG2319 --------------------------------------------------------------------------------
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 681 fihgyrgydcrnnlfyTQAGEVVYHIAavavvynrqqhsqrlylGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQT 760
Cdd:COG2319 149 ----------------LATGKLLRTLT-----------------GHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLAT 193
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 761 LKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFvvkcnphhvdklvtvgikhi 840
Cdd:COG2319 194 GKLLRTLTG-HTGAVRSVAFSPDGKLLASGSAD--GTVRLWDLATGKLLRTLTGHSGSVR-------------------- 250
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 841 kfwqqagggftskrgtfgsvgkletmmcvsygrmeDLVFSgaatgdifiwkdilllktvkaHDGPVFAmyaldkgfvTGG 920
Cdd:COG2319 251 -----------------------------------SVAFS---------------------PDGRLLA---------SGS 265
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 921 KDGIVELWDdmferclktyaikrsalstsskglllednpsiraitlghghilvgTKNGEILEidksgpmtlLVQGHmEGE 1000
Cdd:COG2319 266 ADGTVRLWD---------------------------------------------LATGELLR---------TLTGH-SGG 290
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1001 VWGLAAHPLLPICATVSDDKTLRIWELSAQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHH 1080
Cdd:COG2319 291 VNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTG 370
|
570 580 590
....*....|....*....|....*....|..
gi 2217327924 1081 RKEMISDIKFSKDtGKYLAVASHDNFVDIYNV 1112
Cdd:COG2319 371 HTGAVTSVAFSPD-GRTLASGSADGTVRLWDL 401
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
2-48 |
5.37e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.37e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 2217327924 2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
668-715 |
5.59e-20 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 85.68 E-value: 5.59e-20
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 2217327924 668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451 25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1559-1923 |
2.70e-19 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 90.47 E-value: 2.70e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1559 KAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWDQEMKRC-RAFQLETGQlVECVRSVCRGKgKILVGTKDGEIIE 1637
Cdd:cd00200 6 KGHTGGVTCVAFSPDGKLLATGS------GDGTIKVWDLETGELlRTLKGHTGP-VRDVAASADGT-YLASGSSDKTIRL 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1638 VGEKNAASNILIDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLnKVSLGH--AARCAAYSPDGEMVAIGMK 1715
Cdd:cd00200 78 WDLETGECVRTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCL-TTLRGHtdWVNSVAFSPDGTFVASSSQ 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1716 NGefvillvnSLKVW----GKKRDR----KSAIQDIRISPDNRFLAVGSSEHTVDFYDLTQGTNLNRIGYCKDipsFVIQ 1787
Cdd:cd00200 156 DG--------TIKLWdlrtGKCVATltghTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHEN---GVNS 224
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1788 MDFSADGKYIqvstgaykrqvhevplgkqvteavviekitwaswTSVLGDEVIGIWprnadkadvncacvthaglNIVTG 1867
Cdd:cd00200 225 VAFSPDGYLL----------------------------------ASGSEDGTIRVW-------------------DLRTG 251
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 2217327924 1868 DdfglvklfdfpctekfaKHKRYFGHSAHVTNIRFSYDDKYVVStGGDDCSVFVWR 1923
Cdd:cd00200 252 E-----------------CVQTLSGHTNSVTSLAWSPDGKRLAS-GSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
895-1221 |
1.92e-18 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 87.78 E-value: 1.92e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 895 LLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD---DMFERCLKTYAIKRSALSTSSKGLLL----EDNpSIRait 965
Cdd:cd00200 1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDletGELLRTLKGHTGPVRDVAASADGTYLasgsSDK-TIR--- 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 966 lghghiLVGTKNGEILEIdksgpmtllVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWELSAQHRMLAVRKLKKGGRC 1045
Cdd:cd00200 77 ------LWDLETGECVRT---------LTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1046 CAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNVLTSKRVGICKGAS 1125
Cdd:cd00200 141 VAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPD-GEKLLSSSSDGTIKLWDLSTGKCLGTLRGHE 219
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1126 SYITHIDWDSRGKLLqvnSGAreqlffeaprgkrhiirpSEIEKIE-WDTWTCVLGPTCEGiwpaHSdiTDVNAASLTKD 1204
Cdd:cd00200 220 NGVNSVAFSPDGYLL---ASG------------------SEDGTIRvWDLRTGECVQTLSG----HT--NSVTSLAWSPD 272
|
330
....*....|....*..
gi 2217327924 1205 CSLLATGDDFGFVKLFS 1221
Cdd:cd00200 273 GKRLASGSADGTIRIWD 289
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1649-1923 |
9.68e-18 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 85.85 E-value: 9.68e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1649 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLNKVSlGHAA--RCAAYSPDGEMVAIGMKNgefvillvNS 1726
Cdd:cd00200 5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLK-GHTGpvRDVAASADGTYLASGSSD--------KT 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1727 LKVW----GKKRDR----KSAIQDIRISPDNRFLAVGSSEHTVDFYDLTQGTNLNRIGYCKDipsFVIQMDFSADGKYiq 1798
Cdd:cd00200 75 IRLWdletGECVRTltghTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF-- 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1799 VSTGAYKRQVH--EVPLGKqvteavVIEKITwaswtsvlgdevigiwprnADKADVNCACVTHAGLNIVTGDDFGLVKLF 1876
Cdd:cd00200 150 VASSSQDGTIKlwDLRTGK------CVATLT-------------------GHTGEVNSVAFSPDGEKLLSSSSDGTIKLW 204
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 2217327924 1877 DFpctEKFAKHKRYFGHSAHVTNIRFSYDDKYVVStGGDDCSVFVWR 1923
Cdd:cd00200 205 DL---STGKCLGTLRGHENGVNSVAFSPDGYLLAS-GSEDGTIRVWD 247
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
291-929 |
1.08e-17 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 87.66 E-value: 1.08e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 291 WKADRLLAGTQDSEIFEVIVRERDKPMLILQGHCEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNMEEAVR 370
Cdd:COG2319 3 SADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVL 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 371 SVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVyavaqrykkigecsks 450
Cdd:COG2319 83 SVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRL---------------- 146
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 451 lsfithidWSLDSkylqtndgagerlfyrmpsGKPLtskeeikgipwaswtcvkgpevsgiwpkytevtdinsvdanyns 530
Cdd:COG2319 147 --------WDLAT-------------------GKLL-------------------------------------------- 155
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 531 svlvsgddfglvklfkfpclkrgakfRKYVGHSAHVTNVRWSHDFQWvLSTGGADHSVFQWRfipegvsngmletapqeg 610
Cdd:COG2319 156 --------------------------RTLTGHSGAVTSVAFSPDGKL-LASGSDDGTVRLWD------------------ 190
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 611 gadsyseesdsdlsdvpeldsdieqeaqinydrqvykedlpqlkqqskeknhavpflkrekapedslklqfihgyrgydc 690
Cdd:COG2319 --------------------------------------------------------------------------------
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 691 rnnlfyTQAGEVVyhiaavavvynrqqhsqRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGq 770
Cdd:COG2319 191 ------LATGKLL-----------------RTLTGHTGAVRSVAFSPDGKLLASG--SADGTVRLWDLATGKLLRTLTG- 244
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 771 HQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNPhhvD--KLVTVGI-KHIKFWQQAG 847
Cdd:COG2319 245 HSGSVRSVAFSPDGRLLASGSAD--GTVRLWDLATGELLRTLTGHSGGVNSVAFSP---DgkLLASGSDdGTVRLWDLAT 319
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 848 GgftSKRGTFGsvGKLETMMCVSYGRMEDLVFSGAATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDG 923
Cdd:COG2319 320 G---KLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGTVRLWdlATGELLRTLTGHTGAVTSVAFSPDGrtLASGSADG 394
|
....*.
gi 2217327924 924 IVELWD 929
Cdd:COG2319 395 TVRLWD 400
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
46-177 |
4.06e-16 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 80.84 E-value: 4.06e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 46 VYNTREHS-QKFFLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKdVHTHGVACLAFDSDGQRLASV 124
Cdd:cd00200 161 LWDLRTGKcVATLTGHTGEVNSVAFSPDGEKLLSSSSDGT--IKLWDLSTGKCLGTLR-GHENGVNSVAFSPDGYLLASG 237
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 2217327924 125 GLDakNTVCIWDWRKGKLLASATGHSDRIFDISWDPyQPNRVVSCGV-KHIKFW 177
Cdd:cd00200 238 SED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP-DGKRLASGSAdGTIRIW 288
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
55-179 |
5.24e-16 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 82.65 E-value: 5.24e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 55 KFFLGHNDDIISLALHPDKTLVATGqvGKEPYICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCI 134
Cdd:COG2319 282 RTLTGHSGGVNSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGSDD--GTVRL 356
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 2217327924 135 WDWRKGKLLASATGHSDRIFDISWDPyQPNRVVSCGV-KHIKFWTL 179
Cdd:COG2319 357 WDLATGELLRTLTGHTGAVTSVAFSP-DGRTLASGSAdGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
521-826 |
1.93e-14 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 75.83 E-value: 1.93e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRFipegvsn 600
Cdd:cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWDL------- 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 601 gmletapqEGGADSYS-EESDSDLSDVpeldsDIEQEAQI----NYDRQVYKEDLPQLKQQSKEKNHavpflkrekapED 675
Cdd:cd00200 81 --------ETGECVRTlTGHTSYVSSV-----AFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TD 136
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 676 SLklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAA 752
Cdd:cd00200 137 WV--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGT 200
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2217327924 753 IHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNP 826
Cdd:cd00200 201 IKLWDLSTGKCLGTLRG-HENGVNSVAFSPDGYLLASGSED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
|
|
| HELP |
pfam03451 |
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ... |
1350-1401 |
7.75e-13 |
|
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.
Pssm-ID: 460922 Cd Length: 72 Bit Score: 65.27 E-value: 7.75e-13
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 2217327924 1350 KNNITKKKKLVEE-LALDHVFGYRGFDCRNNLHYLNDGaDIIFHTAAAGIVQN 1401
Cdd:pfam03451 20 KDDLDQKKEPPDKkLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYD 71
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
1123-1596 |
4.02e-12 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 70.33 E-value: 4.02e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1123 GASSYITHIDWDSRGKLLQVNSGAREQLFFEA-PRGKRHIIRPSEIEKIE-WDtwtcVLGPTCEGIWPAHSDitDVNAAS 1200
Cdd:COG2319 54 GAGDLTLLLLDAAAGALLATLLGHTAAVLSVAfSPDGRLLASASADGTVRlWD----LATGLLLRTLTGHTG--AVRSVA 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1201 LTKDCSLLATGDDFGFVKLFSYPVKGQHARFKkyvGHSAHVTNVRWlHNDSVLLTVGGADTALMIWtrefvgtqesklvd 1280
Cdd:COG2319 128 FSPDGKTLASGSADGTVRLWDLATGKLLRTLT---GHSGAVTSVAF-SPDGKLLASGSDDGTVRLW-------------- 189
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1281 seesdtdveedggydsDVAREKAidyttkiyavsIREMEGtkpHQQlkevsveerpPVSRAAPQPeklqknnitkkkklv 1360
Cdd:COG2319 190 ----------------DLATGKL-----------LRTLTG---HTG----------AVRSVAFSP--------------- 214
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1361 eelaldhvfgyrgfdcrnnlhylnDGadiifHTAAAGivqnlSTGTTpsIHIWDAMTKHTLSMLRcFHSKGVNYINFSAT 1440
Cdd:COG2319 215 ------------------------DG-----KLLASG-----SADGT--VRLWDLATGKLLRTLT-GHSGSVRSVAFSPD 257
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1441 GKLLVSVGVDpeHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTqFVSVGV-KHMKFWTLAGSALLykKGVIGSLGA 1519
Cdd:COG2319 258 GRLLASGSAD--GTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKL-LASGSDdGTVRLWDLATGKLL--RTLTGHTGA 332
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1520 AkmqtmLSVAFGANNLT-FTGAINGDVYVWK-DHFLIRLVAKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKLWD 1596
Cdd:COG2319 333 V-----RSVAFSPDGKTlASGSDDGTVRLWDlATGELLRTLTGHTGAVTSV-AFSPDGrTLASGS------ADGTVRLWD 400
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1040-1266 |
5.35e-12 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 68.52 E-value: 5.35e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1040 KKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNVLTSKRVG 1119
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASAD-GTYLASGSSDKTIRLWDLETGECVR 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1120 ICKGASSYITHIDWDSRGKLLqvnSGAREQlffeaprgKRHIIrpseiekieWDTWTcvlgPTCEGIWPAHSDitDVNAA 1199
Cdd:cd00200 88 TLTGHTSYVSSVAFSPDGRIL---SSSSRD--------KTIKV---------WDVET----GKCLTTLRGHTD--WVNSV 141
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2217327924 1200 SLTKDCSLLATGDDFGFVKLFSyPVKGQhaRFKKYVGHSAHVTNVRWlHNDSVLLTVGGADTALMIW 1266
Cdd:cd00200 142 AFSPDGTFVASSSQDGTIKLWD-LRTGK--CVATLTGHTGEVNSVAF-SPDGEKLLSSSSDGTIKLW 204
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
1402-1596 |
9.37e-09 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 58.89 E-value: 9.37e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1402 LSTGTTPSIHIWDAMTKHTLSMLRCfHSKGVNYINFSATGKLLVSVGVDpeHTITVWRWQEGAKVASRGGHLERIFVVEF 1481
Cdd:cd00200 109 SSSSRDKTIKVWDVETGKCLTTLRG-HTDWVNSVAFSPDGTFVASSSQD--GTIKLWDLRTGKCVATLTGHTGEVNSVAF 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1482 RPDSDTQFVSVGVKHMKFWTLAGSALLYkkgvigsLGAAKMQTMLSVAFGANNLTFTGA-INGDVYVWK-DHFLIRLVAK 1559
Cdd:cd00200 186 SPDGEKLLSSSSDGTIKLWDLSTGKCLG-------TLRGHENGVNSVAFSPDGYLLASGsEDGTIRVWDlRTGECVQTLS 258
|
170 180 190
....*....|....*....|....*....|....*..
gi 2217327924 1560 AHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWD 1596
Cdd:cd00200 259 GHTNSVTSLAWSPDGKRLASGS------ADGTIRIWD 289
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
1666-1797 |
3.20e-05 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 47.38 E-value: 3.20e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1666 KDLFISASNDGTARIWDLADKKLLNKVSLGHAARCAAYSPDGEMVAI-GMKNGEFVILLVNSLKVWGKKRDRKSAiQDIR 1744
Cdd:COG3391 80 RRLYVANSGSGRVSVIDLATGKVVATIPVGGGPRGLAVDPDGGRLYVaDSGNGRVSVIDTATGKVVATIPVGAGP-HGIA 158
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 2217327924 1745 ISPDNRFLAVGSSE-HTVDFY----DLTQGTNLNRIgyckDIPSFVIQMDFSADGKYI 1797
Cdd:COG3391 159 VDPDGKRLYVANSGsNTVSVIvsviDTATGKVVATI----PVGGGPVGVAVSPDGRRL 212
|
|
| COG4946 |
COG4946 |
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ... |
368-488 |
5.78e-05 |
|
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];
Pssm-ID: 443973 [Multi-domain] Cd Length: 1072 Bit Score: 48.50 E-value: 5.78e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 368 AVRSVAFSPDGSQLA-LGMKDGSF-IVLR--VRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA-QRYK 442
Cdd:COG4946 344 RERLPAWSPDGKSIAyFSDASGEYeLYIApaDGSGEPKQLTLGDLGRVFNPVWSPDGKKIAFTDNRGRLWVVDLAsGKVR 423
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 2217327924 443 KIGEcSKSLSFITHIDWSLDSKYL---QTNDGAGERLF-YRMPSGK--PLTS 488
Cdd:COG4946 424 KVDT-DGYGDGISDLAWSPDSKWLaysKPGPNQLSQIFlYDVETGKtvQLTD 474
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
1658-1769 |
1.85e-04 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 45.07 E-value: 1.85e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 1658 WGLATHPSKD-LFISASNDGTARIWDLADKKLLNKVSLGHAARCAAYSPDGEMVAIGMKNGEFVILLV-----NSLKVWg 1731
Cdd:COG3391 113 RGLAVDPDGGrLYVADSGNGRVSVIDTATGKVVATIPVGAGPHGIAVDPDGKRLYVANSGSNTVSVIVsvidtATGKVV- 191
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 2217327924 1732 KKRDRKSAIQDIRISPDNRFLAV--------GSSEHTVDFYDLTQG 1769
Cdd:COG3391 192 ATIPVGGGPVGVAVSPDGRRLYVanrgsntsNGGSNTVSVIDLATL 237
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1649-1682 |
2.54e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.99 E-value: 2.54e-04
10 20 30
....*....|....*....|....*....|....
gi 2217327924 1649 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWD 1682
Cdd:smart00320 8 LKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
105-136 |
3.34e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 39.99 E-value: 3.34e-04
10 20 30
....*....|....*....|....*....|..
gi 2217327924 105 HTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:smart00320 11 HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1649-1682 |
4.09e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 39.64 E-value: 4.09e-04
10 20 30
....*....|....*....|....*....|....
gi 2217327924 1649 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWD 1682
Cdd:pfam00400 7 LEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
315-353 |
7.67e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.87 E-value: 7.67e-04
10 20 30
....*....|....*....|....*....|....*....
gi 2217327924 315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:pfam00400 2 KLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
760-802 |
1.21e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.06 E-value: 1.21e-03
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 2217327924 760 TLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:smart00320 1 SGELLKTLKG-HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
315-353 |
1.76e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 37.68 E-value: 1.76e-03
10 20 30
....*....|....*....|....*....|....*....
gi 2217327924 315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:smart00320 3 ELLKTLKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
105-136 |
1.90e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 37.71 E-value: 1.90e-03
10 20 30
....*....|....*....|....*....|..
gi 2217327924 105 HTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:pfam00400 10 HTGSVTSLAFSPDGKLLASGSDD--GTVKVWD 39
|
|
| ANAPC4_WD40 |
pfam12894 |
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ... |
1701-1765 |
2.57e-03 |
|
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,
Pssm-ID: 403945 [Multi-domain] Cd Length: 91 Bit Score: 38.80 E-value: 2.57e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2217327924 1701 AAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRD-RKSAIQDIRISPDNRFLAVGSSEHTVDFYD 1765
Cdd:pfam12894 1 MSWCPTMDLIALATEDGELLLHRLNWQRVWTLSPDkEDLEVTSLAWRPDGKLLAVGYSDGTVRLLD 66
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
1888-1922 |
2.90e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 37.32 E-value: 2.90e-03
10 20 30
....*....|....*....|....*....|....*
gi 2217327924 1888 KRYFGHSAHVTNIRFSYDDKYVVStGGDDCSVFVW 1922
Cdd:pfam00400 5 KTLEGHTGSVTSLAFSPDGKLLAS-GSDDGTVKVW 38
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
339-430 |
4.96e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 40.83 E-value: 4.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 339 LAVTGSDDRSVRLWSLADHALIARCNMEEAVRSVAFSPDGSQL-ALGMKDGSFIVLRVRDMTEVVHIKDRKEViHEMKFS 417
Cdd:COG3391 82 LYVANSGSGRVSVIDLATGKVVATIPVGGGPRGLAVDPDGGRLyVADSGNGRVSVIDTATGKVVATIPVGAGP-HGIAVD 160
|
90
....*....|...
gi 2217327924 418 PDGSYLAVGSNDG 430
Cdd:COG3391 161 PDGKRLYVANSGS 173
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
1883-1922 |
5.42e-03 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 36.52 E-value: 5.42e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 2217327924 1883 KFAKHKRYFGHSAHVTNIRFSYDDKYVVStGGDDCSVFVW 1922
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSPDGKYLAS-GSDDGTIKLW 39
|
|
| YncE |
COG3391 |
DNA-binding beta-propeller fold protein YncE [General function prediction only]; |
329-425 |
6.37e-03 |
|
DNA-binding beta-propeller fold protein YncE [General function prediction only];
Pssm-ID: 442618 [Multi-domain] Cd Length: 237 Bit Score: 40.45 E-value: 6.37e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2217327924 329 WALALHPK-KPLAVTGSDDRSVRLWSLADHALIARCNMEEAVRSVAFSPDGSQLALGMKDGSFI-----VLRVRDMTEVV 402
Cdd:COG3391 113 RGLAVDPDgGRLYVADSGNGRVSVIDTATGKVVATIPVGAGPHGIAVDPDGKRLYVANSGSNTVsvivsVIDTATGKVVA 192
|
90 100
....*....|....*....|...
gi 2217327924 403 HIkDRKEVIHEMKFSPDGSYLAV 425
Cdd:COG3391 193 TI-PVGGGPVGVAVSPDGRRLYV 214
|
|
|