NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|223005862|ref|NP_001034842|]
View 

echinoderm microtubule-associated protein-like 6 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
57-397 1.85e-39

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 152.76  E-value: 1.85e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862   57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319   116 LTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWD 190
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgktgDLQTILclacakeditySGAL 216
Cdd:COG2319   191 LATGKLLRTLTGHTGAVRSVAFSP-----------------------------------DGKLLA-----------SGSA 224
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319   225 DGTVRLWdlATGKLLRTLTG-HSGSVRSVAFSPDGrlLASGSADGTVRLWDLATG-----ELLRTLTGHSG-GVNSVAFS 297
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  293 AD--RLLAGTQDSEI--FEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EE 367
Cdd:COG2319   298 PDgkLLASGSDDGTVrlWDL---ATGKLLRTLTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTG 373
                         330       340       350
                  ....*....|....*....|....*....|
gi 223005862  368 AVRSVAFSPDGSQLALGMKDGSFIVLRVRD 397
Cdd:COG2319   374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
725-1026 1.37e-35

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 137.85  E-value: 1.37e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  725 GHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWK 804
Cdd:cd00200     7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADGTYLASGSSD--KTIRLWDLE 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  805 KGEKIATTRGHKDKIFVVKCNPHHvdKLVTVGIKH--IKFWQ-QAGGGFTSKRGTFGSVgkletmMCVSYGRMEDLVFSG 881
Cdd:cd00200    82 TGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDvETGKCLTTLRGHTDWV------NSVAFSPDGTFVASS 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  882 AATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFERCLKTYAIKRS---ALSTSSKGLL 954
Cdd:cd00200   154 SQDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTLRGHENgvnSVAFSPDGYL 233
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 223005862  955 LednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWE 1026
Cdd:cd00200   234 L----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
1471-1956 4.30e-32

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 131.19  E-value: 4.30e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1471 SATGKLLVSVGVDPEHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTQFVSVGVKHMKFWTLAGSALLYKKGVIGSl 1550
Cdd:COG2319     1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA- 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1551 gaakmqTMLSVAFGANNLTF-TGAINGDVYVWK-DHFLIRLVAKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKL 1627
Cdd:COG2319    80 ------AVLSVAFSPDGRLLaSASADGTVRLWDlATGLLLRTLTGHTGAVRSV-AFSPDGkTLASGS------ADGTVRL 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1628 WDqemkrcrafqLETGQLVEcvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASN 1707
Cdd:COG2319   147 WD----------LATGKLLR---------------------------------TLTGH-SGAVTSVAFSPDGKLLASGSD 182
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1708 DGTARIWDLADKKLLNKVSlGHAA--RCAAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRDRKSAIQDIRISPDNRFL 1785
Cdd:COG2319   183 DGTVRLWDLATGKLLRTLT-GHTGavRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLL 261
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1786 AVGSSEHTVDFYDLTQGTNLNRIGyckDIPSFVIQMDFSADGKYIqVSTGAYKR-QVHEVPLGKQvteavviekitwasw 1864
Cdd:COG2319   262 ASGSADGTVRLWDLATGELLRTLT---GHSGGVNSVAFSPDGKLL-ASGSDDGTvRLWDLATGKL--------------- 322
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1865 tsvlgdevigIWPRNADKADVNCACVTHAGLNIVTGDDFGLVKLFDfpcTEKFAKHKRYFGHSAHVTNIRFSYDDKYVVS 1944
Cdd:COG2319   323 ----------LRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTGHTGAVTSVAFSPDGRTLAS 389
                         490
                  ....*....|..
gi 223005862 1945 tGGDDCSVFVWR 1956
Cdd:COG2319   390 -GSADGTVRLWD 400
WD40 COG2319
WD40 repeat [General function prediction only];
880-1266 1.29e-26

WD40 repeat [General function prediction only];


:

Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 114.62  E-value: 1.29e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  880 SGAATGDIFIWKDILLLKTVKAHDGPVF--AMYALDKGFVTGGKDGIVELWDdmferclktyaikrsALSTSSKGLLLED 957
Cdd:COG2319    55 AGDLTLLLLDAAAGALLATLLGHTAAVLsvAFSPDGRLLASASADGTVRLWD---------------LATGLLLRTLTGH 119
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  958 NPSIRAITLGH-GHILV-GTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHP---LLpicATVSDDKTLRIWELSAQH 1031
Cdd:COG2319   120 TGAVRSVAFSPdGKTLAsGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGK 195
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1032 RMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYN 1111
Cdd:COG2319   196 LLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPD-GRLLASGSADGTVRLWD 274
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1112 VLTSKRVGICKGASSYITHIDWDSRGKLLqVnSGAREqlffeaprgkrHIIRPseiekieWDTWTcvlgPTCEGIWPAHS 1191
Cdd:COG2319   275 LATGELLRTLTGHSGGVNSVAFSPDGKLL-A-SGSDD-----------GTVRL-------WDLAT----GKLLRTLTGHT 330
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 223005862 1192 DitDVNAASLTKDCSLLATGDDFGFVKLFSYPVKGQHARFKkyvGHSAHVTNVRWLHNDSVLLTvGGADTALMIW 1266
Cdd:COG2319   331 G--AVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
2-48 5.47e-20

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


:

Pssm-ID: 460922  Cd Length: 72  Bit Score: 85.68  E-value: 5.47e-20
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 223005862     2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451   25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
668-715 5.68e-20

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


:

Pssm-ID: 460922  Cd Length: 72  Bit Score: 85.68  E-value: 5.68e-20
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 223005862   668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451   25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
WD40 super family cl29593
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
521-826 1.96e-14

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


The actual alignment was detected with superfamily member cd00200:

Pssm-ID: 475233 [Multi-domain]  Cd Length: 289  Bit Score: 75.83  E-value: 1.96e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRFipegvsn 600
Cdd:cd00200    12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWDL------- 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  601 gmletapqEGGADSYS-EESDSDLSDVpeldsDIEQEAQI----NYDRQVYKEDLPQLKQQSKEKNHavpflkrekapED 675
Cdd:cd00200    81 --------ETGECVRTlTGHTSYVSSV-----AFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TD 136
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  676 SLklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAA 752
Cdd:cd00200   137 WV--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGT 200
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 223005862  753 IHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNP 826
Cdd:cd00200   201 IKLWDLSTGKCLGTLRG-HENGVNSVAFSPDGYLLASGSED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
HELP super family cl04081
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
1350-1401 7.88e-13

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


The actual alignment was detected with superfamily member pfam03451:

Pssm-ID: 460922  Cd Length: 72  Bit Score: 65.27  E-value: 7.88e-13
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 223005862  1350 KNNITKKKKLVEE-LALDHVFGYRGFDCRNNLHYLNDGaDIIFHTAAAGIVQN 1401
Cdd:pfam03451   20 KDDLDQKKEPPDKkLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYD 71
COG4946 super family cl27624
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ...
368-488 5.55e-05

Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];


The actual alignment was detected with superfamily member COG4946:

Pssm-ID: 443973 [Multi-domain]  Cd Length: 1072  Bit Score: 48.50  E-value: 5.55e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  368 AVRSVAFSPDGSQLA-LGMKDGSF-IVLR--VRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA-QRYK 442
Cdd:COG4946   344 RERLPAWSPDGKSIAyFSDASGEYeLYIApaDGSGEPKQLTLGDLGRVFNPVWSPDGKKIAFTDNRGRLWVVDLAsGKVR 423
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 223005862  443 KIGEcSKSLSFITHIDWSLDSKYL---QTNDGAGERLF-YRMPSGK--PLTS 488
Cdd:COG4946   424 KVDT-DGYGDGISDLAWSPDSKWLaysKPGPNQLSQIFlYDVETGKtvQLTD 474
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
57-397 1.85e-39

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 152.76  E-value: 1.85e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862   57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319   116 LTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWD 190
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgktgDLQTILclacakeditySGAL 216
Cdd:COG2319   191 LATGKLLRTLTGHTGAVRSVAFSP-----------------------------------DGKLLA-----------SGSA 224
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319   225 DGTVRLWdlATGKLLRTLTG-HSGSVRSVAFSPDGrlLASGSADGTVRLWDLATG-----ELLRTLTGHSG-GVNSVAFS 297
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  293 AD--RLLAGTQDSEI--FEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EE 367
Cdd:COG2319   298 PDgkLLASGSDDGTVrlWDL---ATGKLLRTLTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTG 373
                         330       340       350
                  ....*....|....*....|....*....|
gi 223005862  368 AVRSVAFSPDGSQLALGMKDGSFIVLRVRD 397
Cdd:COG2319   374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
57-353 1.11e-35

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 138.24  E-value: 1.11e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862   57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKdVHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:cd00200     5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  137 WRKGKLLASATGHSDRIFDISWDPYqpNRVVSCGVKH--IKFWTL-CGNALTAKRGIFGktgdlqTILCLA-CAKEDITY 212
Cdd:cd00200    80 LETGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVeTGKCLTTLRGHTD------WVNSVAfSPDGTFVA 151
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  213 SGALNGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKPITKIdLRETEQGykglsIRS 288
Cdd:cd00200   152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGT-LRGHENG-----VNS 224
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 223005862  289 VCWKADRLL--AGTQDS--EIFEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:cd00200   225 VAFSPDGYLlaSGSEDGtiRVWDL---RTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
725-1026 1.37e-35

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 137.85  E-value: 1.37e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  725 GHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWK 804
Cdd:cd00200     7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADGTYLASGSSD--KTIRLWDLE 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  805 KGEKIATTRGHKDKIFVVKCNPHHvdKLVTVGIKH--IKFWQ-QAGGGFTSKRGTFGSVgkletmMCVSYGRMEDLVFSG 881
Cdd:cd00200    82 TGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDvETGKCLTTLRGHTDWV------NSVAFSPDGTFVASS 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  882 AATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFERCLKTYAIKRS---ALSTSSKGLL 954
Cdd:cd00200   154 SQDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTLRGHENgvnSVAFSPDGYL 233
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 223005862  955 LednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWE 1026
Cdd:cd00200   234 L----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
708-1140 2.71e-34

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 137.35  E-value: 2.71e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  708 AVAVVYNRQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCL 787
Cdd:COG2319    59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTL 135
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  788 VSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVkcnphhvdklvtvgikhikfwqqagggftskrgTFGSVGKLetmm 867
Cdd:COG2319   136 ASGSAD--GTVRLWDLATGKLLRTLTGHSGAVTSV---------------------------------AFSPDGKL---- 176
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  868 cvsygrmedlVFSGAATGDIFIWkDIL---LLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDdmferclktyaik 942
Cdd:COG2319   177 ----------LASGSDDGTVRLW-DLAtgkLLRTLTGHTGAVRSVaFSPDgKLLASGSADGTVRLWD------------- 232
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  943 rsalstsskglllednpsiraitlghghilvgTKNGEILeidksgpmtLLVQGHmEGEVWGLAAHP---LLpicATVSDD 1019
Cdd:COG2319   233 --------------------------------LATGKLL---------RTLTGH-SGSVRSVAFSPdgrLL---ASGSAD 267
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1020 KTLRIWELSAQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLA 1099
Cdd:COG2319   268 GTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPD-GKTLA 346
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|.
gi 223005862 1100 VASHDNFVDIYNVLTSKRVGICKGASSYITHIDWDSRGKLL 1140
Cdd:COG2319   347 SGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTL 387
WD40 COG2319
WD40 repeat [General function prediction only];
1471-1956 4.30e-32

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 131.19  E-value: 4.30e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1471 SATGKLLVSVGVDPEHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTQFVSVGVKHMKFWTLAGSALLYKKGVIGSl 1550
Cdd:COG2319     1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA- 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1551 gaakmqTMLSVAFGANNLTF-TGAINGDVYVWK-DHFLIRLVAKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKL 1627
Cdd:COG2319    80 ------AVLSVAFSPDGRLLaSASADGTVRLWDlATGLLLRTLTGHTGAVRSV-AFSPDGkTLASGS------ADGTVRL 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1628 WDqemkrcrafqLETGQLVEcvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASN 1707
Cdd:COG2319   147 WD----------LATGKLLR---------------------------------TLTGH-SGAVTSVAFSPDGKLLASGSD 182
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1708 DGTARIWDLADKKLLNKVSlGHAA--RCAAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRDRKSAIQDIRISPDNRFL 1785
Cdd:COG2319   183 DGTVRLWDLATGKLLRTLT-GHTGavRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLL 261
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1786 AVGSSEHTVDFYDLTQGTNLNRIGyckDIPSFVIQMDFSADGKYIqVSTGAYKR-QVHEVPLGKQvteavviekitwasw 1864
Cdd:COG2319   262 ASGSADGTVRLWDLATGELLRTLT---GHSGGVNSVAFSPDGKLL-ASGSDDGTvRLWDLATGKL--------------- 322
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1865 tsvlgdevigIWPRNADKADVNCACVTHAGLNIVTGDDFGLVKLFDfpcTEKFAKHKRYFGHSAHVTNIRFSYDDKYVVS 1944
Cdd:COG2319   323 ----------LRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTGHTGAVTSVAFSPDGRTLAS 389
                         490
                  ....*....|..
gi 223005862 1945 tGGDDCSVFVWR 1956
Cdd:COG2319   390 -GSADGTVRLWD 400
WD40 COG2319
WD40 repeat [General function prediction only];
880-1266 1.29e-26

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 114.62  E-value: 1.29e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  880 SGAATGDIFIWKDILLLKTVKAHDGPVF--AMYALDKGFVTGGKDGIVELWDdmferclktyaikrsALSTSSKGLLLED 957
Cdd:COG2319    55 AGDLTLLLLDAAAGALLATLLGHTAAVLsvAFSPDGRLLASASADGTVRLWD---------------LATGLLLRTLTGH 119
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  958 NPSIRAITLGH-GHILV-GTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHP---LLpicATVSDDKTLRIWELSAQH 1031
Cdd:COG2319   120 TGAVRSVAFSPdGKTLAsGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGK 195
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1032 RMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYN 1111
Cdd:COG2319   196 LLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPD-GRLLASGSADGTVRLWD 274
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1112 VLTSKRVGICKGASSYITHIDWDSRGKLLqVnSGAREqlffeaprgkrHIIRPseiekieWDTWTcvlgPTCEGIWPAHS 1191
Cdd:COG2319   275 LATGELLRTLTGHSGGVNSVAFSPDGKLL-A-SGSDD-----------GTVRL-------WDLAT----GKLLRTLTGHT 330
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 223005862 1192 DitDVNAASLTKDCSLLATGDDFGFVKLFSYPVKGQHARFKkyvGHSAHVTNVRWLHNDSVLLTvGGADTALMIW 1266
Cdd:COG2319   331 G--AVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1413-1763 1.24e-21

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 97.41  E-value: 1.24e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1413 HTDDILCLTVNQHPKYrnVVATSQIGTtpsIHIWDamtKHTLSMLRCF--HSKGVNYINFSATGKLLVSVGVDpeHTITV 1490
Cdd:cd00200     8 HTGGVTCVAFSPDGKL--LATGSGDGT---IKVWD---LETGELLRTLkgHTGPVRDVAASADGTYLASGSSD--KTIRL 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1491 WRWQEGAKVASRGGHLERIFVVEFRPDSdtQFVSVGVKH--MKFWTLAgsallykKGVIGSLGAAKMQTMLSVAF-GANN 1567
Cdd:cd00200    78 WDLETGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDVE-------TGKCLTTLRGHTDWVNSVAFsPDGT 148
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1568 LTFTGAINGDVYVW--KDHFLIRlVAKAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWDQEMKRCRAfqletgql 1645
Cdd:cd00200   149 FVASSSQDGTIKLWdlRTGKCVA-TLTGHTGEVNSVAFSPDGEKLLSSS------SDGTIKLWDLSTGKCLG-------- 213
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1646 vecvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLNKV 1725
Cdd:cd00200   214 -----------------------------------TLRGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTL 257
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|
gi 223005862 1726 SlGHAAR--CAAYSPDGEMVAIGMKNgefvillvNSLKVW 1763
Cdd:cd00200   258 S-GHTNSvtSLAWSPDGKRLASGSAD--------GTIRIW 288
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
2-48 5.47e-20

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


Pssm-ID: 460922  Cd Length: 72  Bit Score: 85.68  E-value: 5.47e-20
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 223005862     2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451   25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
668-715 5.68e-20

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


Pssm-ID: 460922  Cd Length: 72  Bit Score: 85.68  E-value: 5.68e-20
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 223005862   668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451   25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
895-1221 1.95e-18

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 87.78  E-value: 1.95e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  895 LLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD---DMFERCLKTYAIKRSALSTSSKGLLL----EDNpSIRait 965
Cdd:cd00200     1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDletGELLRTLKGHTGPVRDVAASADGTYLasgsSDK-TIR--- 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  966 lghghiLVGTKNGEILEIdksgpmtllVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWELSAQHRMLAVRKLKKGGRC 1045
Cdd:cd00200    77 ------LWDLETGECVRT---------LTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS 140
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1046 CAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNVLTSKRVGICKGAS 1125
Cdd:cd00200   141 VAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPD-GEKLLSSSSDGTIKLWDLSTGKCLGTLRGHE 219
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1126 SYITHIDWDSRGKLLqvnSGAreqlffeaprgkrhiirpSEIEKIE-WDTWTCVLGPTCEGiwpaHSdiTDVNAASLTKD 1204
Cdd:cd00200   220 NGVNSVAFSPDGYLL---ASG------------------SEDGTIRvWDLRTGECVQTLSG----HT--NSVTSLAWSPD 272
                         330
                  ....*....|....*..
gi 223005862 1205 CSLLATGDDFGFVKLFS 1221
Cdd:cd00200   273 GKRLASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
521-826 1.96e-14

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 75.83  E-value: 1.96e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRFipegvsn 600
Cdd:cd00200    12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWDL------- 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  601 gmletapqEGGADSYS-EESDSDLSDVpeldsDIEQEAQI----NYDRQVYKEDLPQLKQQSKEKNHavpflkrekapED 675
Cdd:cd00200    81 --------ETGECVRTlTGHTSYVSSV-----AFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TD 136
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  676 SLklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAA 752
Cdd:cd00200   137 WV--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGT 200
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 223005862  753 IHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNP 826
Cdd:cd00200   201 IKLWDLSTGKCLGTLRG-HENGVNSVAFSPDGYLLASGSED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
1350-1401 7.88e-13

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


Pssm-ID: 460922  Cd Length: 72  Bit Score: 65.27  E-value: 7.88e-13
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 223005862  1350 KNNITKKKKLVEE-LALDHVFGYRGFDCRNNLHYLNDGaDIIFHTAAAGIVQN 1401
Cdd:pfam03451   20 KDDLDQKKEPPDKkLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYD 71
COG4946 COG4946
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ...
368-488 5.55e-05

Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];


Pssm-ID: 443973 [Multi-domain]  Cd Length: 1072  Bit Score: 48.50  E-value: 5.55e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  368 AVRSVAFSPDGSQLA-LGMKDGSF-IVLR--VRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA-QRYK 442
Cdd:COG4946   344 RERLPAWSPDGKSIAyFSDASGEYeLYIApaDGSGEPKQLTLGDLGRVFNPVWSPDGKKIAFTDNRGRLWVVDLAsGKVR 423
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 223005862  443 KIGEcSKSLSFITHIDWSLDSKYL---QTNDGAGERLF-YRMPSGK--PLTS 488
Cdd:COG4946   424 KVDT-DGYGDGISDLAWSPDSKWLaysKPGPNQLSQIFlYDVETGKtvQLTD 474
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1682-1715 2.58e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 39.99  E-value: 2.58e-04
                            10        20        30
                    ....*....|....*....|....*....|....
gi 223005862   1682 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWD 1715
Cdd:smart00320    8 LKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
105-136 3.40e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 39.99  E-value: 3.40e-04
                            10        20        30
                    ....*....|....*....|....*....|..
gi 223005862    105 HTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:smart00320   11 HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
1682-1715 4.24e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 39.64  E-value: 4.24e-04
                           10        20        30
                   ....*....|....*....|....*....|....
gi 223005862  1682 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWD 1715
Cdd:pfam00400    7 LEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 pfam00400
WD domain, G-beta repeat;
315-353 7.88e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 38.87  E-value: 7.88e-04
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 223005862   315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:pfam00400    2 KLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
760-802 1.25e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 38.06  E-value: 1.25e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|...
gi 223005862    760 TLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:smart00320    1 SGELLKTLKG-HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
 
Name Accession Description Interval E-value
WD40 COG2319
WD40 repeat [General function prediction only];
57-397 1.85e-39

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 152.76  E-value: 1.85e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862   57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319   116 LTGHTGAVRSVAFSPDGKTLASGSADGT--VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GTVRLWD 190
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgktgDLQTILclacakeditySGAL 216
Cdd:COG2319   191 LATGKLLRTLTGHTGAVRSVAFSP-----------------------------------DGKLLA-----------SGSA 224
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWK 292
Cdd:COG2319   225 DGTVRLWdlATGKLLRTLTG-HSGSVRSVAFSPDGrlLASGSADGTVRLWDLATG-----ELLRTLTGHSG-GVNSVAFS 297
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  293 AD--RLLAGTQDSEI--FEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EE 367
Cdd:COG2319   298 PDgkLLASGSDDGTVrlWDL---ATGKLLRTLTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGhTG 373
                         330       340       350
                  ....*....|....*....|....*....|
gi 223005862  368 AVRSVAFSPDGSQLALGMKDGSFIVLRVRD 397
Cdd:COG2319   374 AVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 COG2319
WD40 repeat [General function prediction only];
57-466 2.63e-39

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 152.37  E-value: 2.63e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862   57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319    74 LLGHTAAVLSVAFSPDGRLLASASADGT--VRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  137 WRKGKLLASATGHSDRIFDISWDPyqpnrvvscgvkhikfwtlcgnaltakrgifgkTGDLqtilcLAcakeditySGAL 216
Cdd:COG2319   149 LATGKLLRTLTGHSGAVTSVAFSP---------------------------------DGKL-----LA--------SGSD 182
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  217 NGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDfkpitkidlreteqgykglsirsvcwk 292
Cdd:COG2319   183 DGTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLA--------------------------- 234
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  293 adrllagtqdseifevivreRDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNMEE-AVRS 371
Cdd:COG2319   235 --------------------TGKLLRTLTGH-SGSVRSVAFSPDGRLLASGSADGTVRLWDLATGELLRTLTGHSgGVNS 293
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  372 VAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQrykkiGECSKSL 451
Cdd:COG2319   294 VAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLAT-----GELLRTL 368
                         410
                  ....*....|....*....
gi 223005862  452 ----SFITHIDWSLDSKYL 466
Cdd:COG2319   369 tghtGAVTSVAFSPDGRTL 387
WD40 COG2319
WD40 repeat [General function prediction only];
57-438 7.42e-37

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 145.05  E-value: 7.42e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862   57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:COG2319    32 LLGLAAAVASLAASPDGARLAAGAGDLT--LLLLDAAAGALLATLLG-HTAAVLSVAFSPDGRLLASASAD--GTVRLWD 106
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  137 WRKGKLLASATGHSDRIFDISWDPyQPNRVVSCGV-KHIKFWtlcgNALTAKRgIFGKTGDLQTILCLAC-AKEDITYSG 214
Cdd:COG2319   107 LATGLLLRTLTGHTGAVRSVAFSP-DGKTLASGSAdGTVRLW----DLATGKL-LRTLTGHSGAVTSVAFsPDGKLLASG 180
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  215 ALNGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDfkpiTKiDLRETEQGYKGlSIRSVC 290
Cdd:COG2319   181 SDDGTVRLWdlATGKLLRTLTG-HTGAVRSVAFSPDGklLASGSADGTVRLWDLA----TG-KLLRTLTGHSG-SVRSVA 253
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  291 WKAD--RLLAGTQDSEIfEVIVRERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNMEEA 368
Cdd:COG2319   254 FSPDgrLLASGSADGTV-RLWDLATGELLRTLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTG 331
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 223005862  369 -VRSVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA 438
Cdd:COG2319   332 aVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
57-353 1.11e-35

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 138.24  E-value: 1.11e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862   57 FLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKdVHTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:cd00200     5 LKGHTGGVTCVAFSPDGKLLATGSGDGT--IKVWDLETGELLRTLK-GHTGPVRDVAASADGTYLASGSSD--KTIRLWD 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  137 WRKGKLLASATGHSDRIFDISWDPYqpNRVVSCGVKH--IKFWTL-CGNALTAKRGIFGktgdlqTILCLA-CAKEDITY 212
Cdd:cd00200    80 LETGECVRTLTGHTSYVSSVAFSPD--GRILSSSSRDktIKVWDVeTGKCLTTLRGHTD------WVNSVAfSPDGTFVA 151
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  213 SGALNGDIYVW--KGLNLVRTIQGaHSAGIFSMYACEEG--FATGGRDGCIRLWDTDFKPITKIdLRETEQGykglsIRS 288
Cdd:cd00200   152 SSSQDGTIKLWdlRTGKCVATLTG-HTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGT-LRGHENG-----VNS 224
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 223005862  289 VCWKADRLL--AGTQDS--EIFEVivrERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:cd00200   225 VAFSPDGYLlaSGSEDGtiRVWDL---RTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
725-1026 1.37e-35

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 137.85  E-value: 1.37e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  725 GHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWK 804
Cdd:cd00200     7 GHTGGVTCVAFSPDGKLLATG--SGDGTIKVWDLETGELLRTLKG-HTGPVRDVAASADGTYLASGSSD--KTIRLWDLE 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  805 KGEKIATTRGHKDKIFVVKCNPHHvdKLVTVGIKH--IKFWQ-QAGGGFTSKRGTFGSVgkletmMCVSYGRMEDLVFSG 881
Cdd:cd00200    82 TGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDvETGKCLTTLRGHTDWV------NSVAFSPDGTFVASS 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  882 AATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWDDMFERCLKTYAIKRS---ALSTSSKGLL 954
Cdd:cd00200   154 SQDGTIKLWdlRTGKCVATLTGHTGEVNSVAFSPDGekLLSSSSDGTIKLWDLSTGKCLGTLRGHENgvnSVAFSPDGYL 233
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 223005862  955 LednpsiraitlghghiLVGTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWE 1026
Cdd:cd00200   234 L----------------ASGSEDGTIRVWDlRTGECVQTLSGH-TNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
708-1140 2.71e-34

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 137.35  E-value: 2.71e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  708 AVAVVYNRQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCL 787
Cdd:COG2319    59 TLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASA--SADGTVRLWDLATGLLLRTLTG-HTGAVRSVAFSPDGKTL 135
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  788 VSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVkcnphhvdklvtvgikhikfwqqagggftskrgTFGSVGKLetmm 867
Cdd:COG2319   136 ASGSAD--GTVRLWDLATGKLLRTLTGHSGAVTSV---------------------------------AFSPDGKL---- 176
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  868 cvsygrmedlVFSGAATGDIFIWkDIL---LLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDdmferclktyaik 942
Cdd:COG2319   177 ----------LASGSDDGTVRLW-DLAtgkLLRTLTGHTGAVRSVaFSPDgKLLASGSADGTVRLWD------------- 232
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  943 rsalstsskglllednpsiraitlghghilvgTKNGEILeidksgpmtLLVQGHmEGEVWGLAAHP---LLpicATVSDD 1019
Cdd:COG2319   233 --------------------------------LATGKLL---------RTLTGH-SGSVRSVAFSPdgrLL---ASGSAD 267
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1020 KTLRIWELSAQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLA 1099
Cdd:COG2319   268 GTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPD-GKTLA 346
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|.
gi 223005862 1100 VASHDNFVDIYNVLTSKRVGICKGASSYITHIDWDSRGKLL 1140
Cdd:COG2319   347 SGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTL 387
WD40 COG2319
WD40 repeat [General function prediction only];
1471-1956 4.30e-32

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 131.19  E-value: 4.30e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1471 SATGKLLVSVGVDPEHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTQFVSVGVKHMKFWTLAGSALLYKKGVIGSl 1550
Cdd:COG2319     1 ALSADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTA- 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1551 gaakmqTMLSVAFGANNLTF-TGAINGDVYVWK-DHFLIRLVAKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKL 1627
Cdd:COG2319    80 ------AVLSVAFSPDGRLLaSASADGTVRLWDlATGLLLRTLTGHTGAVRSV-AFSPDGkTLASGS------ADGTVRL 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1628 WDqemkrcrafqLETGQLVEcvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASN 1707
Cdd:COG2319   147 WD----------LATGKLLR---------------------------------TLTGH-SGAVTSVAFSPDGKLLASGSD 182
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1708 DGTARIWDLADKKLLNKVSlGHAA--RCAAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRDRKSAIQDIRISPDNRFL 1785
Cdd:COG2319   183 DGTVRLWDLATGKLLRTLT-GHTGavRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPDGRLL 261
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1786 AVGSSEHTVDFYDLTQGTNLNRIGyckDIPSFVIQMDFSADGKYIqVSTGAYKR-QVHEVPLGKQvteavviekitwasw 1864
Cdd:COG2319   262 ASGSADGTVRLWDLATGELLRTLT---GHSGGVNSVAFSPDGKLL-ASGSDDGTvRLWDLATGKL--------------- 322
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1865 tsvlgdevigIWPRNADKADVNCACVTHAGLNIVTGDDFGLVKLFDfpcTEKFAKHKRYFGHSAHVTNIRFSYDDKYVVS 1944
Cdd:COG2319   323 ----------LRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWD---LATGELLRTLTGHTGAVTSVAFSPDGRTLAS 389
                         490
                  ....*....|..
gi 223005862 1945 tGGDDCSVFVWR 1956
Cdd:COG2319   390 -GSADGTVRLWD 400
WD40 COG2319
WD40 repeat [General function prediction only];
1420-1800 4.01e-31

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 128.11  E-value: 4.01e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1420 LTVNQHPKYRNVVATSQIGTT-------PSIHIWDAMTKHTLSMLRcFHSKGVNYINFSATGKLLVSVGVDpeHTITVWR 1492
Cdd:COG2319    72 ATLLGHTAAVLSVAFSPDGRLlasasadGTVRLWDLATGLLLRTLT-GHTGAVRSVAFSPDGKTLASGSAD--GTVRLWD 148
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1493 WQEGAKVASRGGHLERIFVVEFRPDSdTQFVSVGV-KHMKFWTLAGSALLYKkgVIGSLGAAkmqtmLSVAFGANNLTF- 1570
Cdd:COG2319   149 LATGKLLRTLTGHSGAVTSVAFSPDG-KLLASGSDdGTVRLWDLATGKLLRT--LTGHTGAV-----RSVAFSPDGKLLa 220
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1571 TGAINGDVYVW--KDHFLIRLVaKAHTGPVFTMyTTLRDG-LIVTGGkerptkEGGAVKLWDqemkrcrafqLETGQLVE 1647
Cdd:COG2319   221 SGSADGTVRLWdlATGKLLRTL-TGHSGSVRSV-AFSPDGrLLASGS------ADGTVRLWD----------LATGELLR 282
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1648 cvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLNKVSl 1727
Cdd:COG2319   283 ---------------------------------TLTGH-SGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLT- 327
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 223005862 1728 GHAA--RCAAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRDRKSAIQDIRISPDNRFLAVGSSEHTVDFYDLT 1800
Cdd:COG2319   328 GHTGavRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
WD40 COG2319
WD40 repeat [General function prediction only];
718-1071 2.50e-30

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 125.79  E-value: 2.50e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  718 HSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHS 797
Cdd:COG2319   111 LLLRTLTGHTGAVRSVAFSPDGKTLASG--SADGTVRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLASGSDD--GT 185
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  798 IVFWDWKKGEKIATTRGHKDKIFVVKCNPHHvDKLVTVGI-KHIKFWQQAGGGF-TSKRGTFGSVgkletmMCVSY---G 872
Cdd:COG2319   186 VRLWDLATGKLLRTLTGHTGAVRSVAFSPDG-KLLASGSAdGTVRLWDLATGKLlRTLTGHSGSV------RSVAFspdG 258
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  873 RmedLVFSGAATGDIFIW--KDILLLKTVKAHDGPVFAM-YALD-KGFVTGGKDGIVELWDdmferclktyaikrsalst 948
Cdd:COG2319   259 R---LLASGSADGTVRLWdlATGELLRTLTGHSGGVNSVaFSPDgKLLASGSDDGTVRLWD------------------- 316
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  949 sskglllednpsiraitlghghilvgTKNGEILEIdksgpmtllVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWELS 1028
Cdd:COG2319   317 --------------------------LATGKLLRT---------LTGH-TGAVRSVAFSPDGKTLASGSDDGTVRLWDLA 360
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|...
gi 223005862 1029 AQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADT 1071
Cdd:COG2319   361 TGELLRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLAT 403
WD40 COG2319
WD40 repeat [General function prediction only];
274-844 2.07e-29

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 123.10  E-value: 2.07e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  274 LRETEQGYKGLSIRSVCWKADRLLAGTQDSEIFEVIVRERDKPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:COG2319    28 LLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH-TAAVLSVAFSPDGRLLASASADGTVRLWD 106
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  354 LADHALIARCNM-EEAVRSVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPV 432
Cdd:COG2319   107 LATGLLLRTLTGhTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPDGKLLASGSDDGTV 186
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  433 DVYAVAQrykkiGECSKSL----SFITHIDWSLDSKYLqtndgagerlfyrmpsgkpltskeeikgipwaswtcvkgpev 508
Cdd:COG2319   187 RLWDLAT-----GKLLRTLtghtGAVRSVAFSPDGKLL------------------------------------------ 219
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  509 sgiwpkytevtdinsvdanynssvlVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSV 588
Cdd:COG2319   220 -------------------------ASGSADGTVRLWD---LATGKLLRTLTGHSGSVRSVAFSPDGRLLAS-GSADGTV 270
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  589 FQWRfipegvsngmletapqeggadsyseesdsdlsdvpeldsdieqeaqinydrqvykedlpqlkqqskeknhavpflk 668
Cdd:COG2319   271 RLWD---------------------------------------------------------------------------- 274
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  669 rekapedslklqfihgyrgydcrnnlfyTQAGEVVyhiaavavvynrqqhsqRLYLGHDDDILSLTIHPVKDYVATGqvG 748
Cdd:COG2319   275 ----------------------------LATGELL-----------------RTLTGHSGGVNSVAFSPDGKLLASG--S 307
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  749 RDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNPHH 828
Cdd:COG2319   308 DDGTVRLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGSDD--GTVRLWDLATGELLRTLTGHTGAVTSVAFSPDG 384
                         570
                  ....*....|....*..
gi 223005862  829 vDKLVTVGI-KHIKFWQ 844
Cdd:COG2319   385 -RTLASGSAdGTVRLWD 400
WD40 COG2319
WD40 repeat [General function prediction only];
880-1266 1.29e-26

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 114.62  E-value: 1.29e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  880 SGAATGDIFIWKDILLLKTVKAHDGPVF--AMYALDKGFVTGGKDGIVELWDdmferclktyaikrsALSTSSKGLLLED 957
Cdd:COG2319    55 AGDLTLLLLDAAAGALLATLLGHTAAVLsvAFSPDGRLLASASADGTVRLWD---------------LATGLLLRTLTGH 119
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  958 NPSIRAITLGH-GHILV-GTKNGEILEID-KSGPMTLLVQGHmEGEVWGLAAHP---LLpicATVSDDKTLRIWELSAQH 1031
Cdd:COG2319   120 TGAVRSVAFSPdGKTLAsGSADGTVRLWDlATGKLLRTLTGH-SGAVTSVAFSPdgkLL---ASGSDDGTVRLWDLATGK 195
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1032 RMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYN 1111
Cdd:COG2319   196 LLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSPD-GRLLASGSADGTVRLWD 274
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1112 VLTSKRVGICKGASSYITHIDWDSRGKLLqVnSGAREqlffeaprgkrHIIRPseiekieWDTWTcvlgPTCEGIWPAHS 1191
Cdd:COG2319   275 LATGELLRTLTGHSGGVNSVAFSPDGKLL-A-SGSDD-----------GTVRL-------WDLAT----GKLLRTLTGHT 330
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 223005862 1192 DitDVNAASLTKDCSLLATGDDFGFVKLFSYPVKGQHARFKkyvGHSAHVTNVRWLHNDSVLLTvGGADTALMIW 1266
Cdd:COG2319   331 G--AVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLT---GHTGAVTSVAFSPDGRTLAS-GSADGTVRLW 399
WD40 COG2319
WD40 repeat [General function prediction only];
943-1266 1.27e-24

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 108.85  E-value: 1.27e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  943 RSALSTSSKGLLLEDNPSIRAITLGHGHILVGTKNGEILEIDKSGPMTLLVQGHmEGEVWGLAAHPLLPICATVSDDKTL 1022
Cdd:COG2319    24 ALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGH-TAAVLSVAFSPDGRLLASASADGTV 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1023 RIWELSAQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVAS 1102
Cdd:COG2319   103 RLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSPD-GKLLASGS 181
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1103 HDNFVDIYNVLTSKRVGICKGASSYITHIDWDSRGKLLQVNSGAREqlffeaprgkrhiIRPseiekieWDtwtcVLGPT 1182
Cdd:COG2319   182 DDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGT-------------VRL-------WD----LATGK 237
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1183 CEGIWPAHSDitDVNAASLTKDCSLLATGDDFGFVKLFSyPVKGQHARFKKyvGHSAHVTNVRWLHNDSVLLTvGGADTA 1262
Cdd:COG2319   238 LLRTLTGHSG--SVRSVAFSPDGRLLASGSADGTVRLWD-LATGELLRTLT--GHSGGVNSVAFSPDGKLLAS-GSDDGT 311

                  ....
gi 223005862 1263 LMIW 1266
Cdd:COG2319   312 VRLW 315
WD40 COG2319
WD40 repeat [General function prediction only];
1401-1718 1.09e-23

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 105.76  E-value: 1.09e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1401 NLSTGSQSFYLE-HTDDILCLTVnqHPKYRNVVATSQIGTtpsIHIWDAMTKHTLSMLRCfHSKGVNYINFSATGKLLVS 1479
Cdd:COG2319   106 DLATGLLLRTLTgHTGAVRSVAF--SPDGKTLASGSADGT---VRLWDLATGKLLRTLTG-HSGAVTSVAFSPDGKLLAS 179
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1480 VGVDpeHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTqFVSVGV-KHMKFWTLAGSALLYKKGviGSLGAAkmqtm 1558
Cdd:COG2319   180 GSDD--GTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKL-LASGSAdGTVRLWDLATGKLLRTLT--GHSGSV----- 249
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1559 LSVAFGANNLTF-TGAINGDVYVWK-DHFLIRLVAKAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWDqemkrcr 1636
Cdd:COG2319   250 RSVAFSPDGRLLaSGSADGTVRLWDlATGELLRTLTGHSGGVNSVAFSPDGKLLASGS------DDGTVRLWD------- 316
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1637 afqLETGQLV-------ECVRSVC-RGKGKILV-GTKDGEI----IEVGEKNAAsnilIDGHmEGEIWGLATHPSKDLFI 1703
Cdd:COG2319   317 ---LATGKLLrtltghtGAVRSVAfSPDGKTLAsGSDDGTVrlwdLATGELLRT----LTGH-TGAVTSVAFSPDGRTLA 388
                         330
                  ....*....|....*
gi 223005862 1704 SASNDGTARIWDLAD 1718
Cdd:COG2319   389 SGSADGTVRLWDLAT 403
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
723-929 7.53e-23

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 100.87  E-value: 7.53e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  723 YLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:cd00200    89 LTGHTSYVSSVAFSPDGRILSSS--SRDKTIKVWDVETGKCLTTLRG-HTDWVNSVAFSPDGTFVASSSQD--GTIKLWD 163
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  803 WKKGEKIATTRGHKDKIFVVKCNPHHVDKLVTVGIKHIKFWQQAGGgftSKRGTFgsVGKLETMMCVSYGRMEDLVFSGA 882
Cdd:cd00200   164 LRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG---KCLGTL--RGHENGVNSVAFSPDGYLLASGS 238
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|.
gi 223005862  883 ATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD 929
Cdd:cd00200   239 EDGTIRVWdlRTGECVQTLSGHTNSVTSLAWSPDGkrLASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1413-1763 1.24e-21

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 97.41  E-value: 1.24e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1413 HTDDILCLTVNQHPKYrnVVATSQIGTtpsIHIWDamtKHTLSMLRCF--HSKGVNYINFSATGKLLVSVGVDpeHTITV 1490
Cdd:cd00200     8 HTGGVTCVAFSPDGKL--LATGSGDGT---IKVWD---LETGELLRTLkgHTGPVRDVAASADGTYLASGSSD--KTIRL 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1491 WRWQEGAKVASRGGHLERIFVVEFRPDSdtQFVSVGVKH--MKFWTLAgsallykKGVIGSLGAAKMQTMLSVAF-GANN 1567
Cdd:cd00200    78 WDLETGECVRTLTGHTSYVSSVAFSPDG--RILSSSSRDktIKVWDVE-------TGKCLTTLRGHTDWVNSVAFsPDGT 148
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1568 LTFTGAINGDVYVW--KDHFLIRlVAKAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWDQEMKRCRAfqletgql 1645
Cdd:cd00200   149 FVASSSQDGTIKLWdlRTGKCVA-TLTGHTGEVNSVAFSPDGEKLLSSS------SDGTIKLWDLSTGKCLG-------- 213
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1646 vecvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLNKV 1725
Cdd:cd00200   214 -----------------------------------TLRGH-ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTL 257
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|
gi 223005862 1726 SlGHAAR--CAAYSPDGEMVAIGMKNgefvillvNSLKVW 1763
Cdd:cd00200   258 S-GHTNSvtSLAWSPDGKRLASGSAD--------GTIRIW 288
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
228-466 1.05e-20

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 94.71  E-value: 1.05e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  228 LVRTIQGaHSAGIFSM--YACEEGFATGGRDGCIRLWDTDFKpitkiDLRETEQGYKGlSIRSVCWKAD--RLLAGTQDS 303
Cdd:cd00200     1 LRRTLKG-HTGGVTCVafSPDGKLLATGSGDGTIKVWDLETG-----ELLRTLKGHTG-PVRDVAASADgtYLASGSSDK 73
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  304 EIFevIVRERDKPML-ILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNM-EEAVRSVAFSPDGSQL 381
Cdd:cd00200    74 TIR--LWDLETGECVrTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGhTDWVNSVAFSPDGTFV 150
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  382 ALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVAQRyKKIGECSKSLSFITHIDWSL 461
Cdd:cd00200   151 ASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTG-KCLGTLRGHENGVNSVAFSP 229

                  ....*
gi 223005862  462 DSKYL 466
Cdd:cd00200   230 DGYLL 234
WD40 COG2319
WD40 repeat [General function prediction only];
521-1112 2.60e-20

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 95.75  E-value: 2.60e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRfipegvsn 600
Cdd:COG2319    81 VLSVAFSPDGRLLASASADGTVRLWD---LATGLLLRTLTGHTGAVRSVAFSPDGKTLAS-GSADGTVRLWD-------- 148
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  601 gmletapqeggadsyseesdsdlsdvpeldsdieqeaqinydrqvykedlpqlkqqskeknhavpflkrekapedslklq 680
Cdd:COG2319       --------------------------------------------------------------------------------
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  681 fihgyrgydcrnnlfyTQAGEVVYHIAavavvynrqqhsqrlylGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQT 760
Cdd:COG2319   149 ----------------LATGKLLRTLT-----------------GHSGAVTSVAFSPDGKLLASG--SDDGTVRLWDLAT 193
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  761 LKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFvvkcnphhvdklvtvgikhi 840
Cdd:COG2319   194 GKLLRTLTG-HTGAVRSVAFSPDGKLLASGSAD--GTVRLWDLATGKLLRTLTGHSGSVR-------------------- 250
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  841 kfwqqagggftskrgtfgsvgkletmmcvsygrmeDLVFSgaatgdifiwkdilllktvkaHDGPVFAmyaldkgfvTGG 920
Cdd:COG2319   251 -----------------------------------SVAFS---------------------PDGRLLA---------SGS 265
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  921 KDGIVELWDdmferclktyaikrsalstsskglllednpsiraitlghghilvgTKNGEILEidksgpmtlLVQGHmEGE 1000
Cdd:COG2319   266 ADGTVRLWD---------------------------------------------LATGELLR---------TLTGH-SGG 290
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1001 VWGLAAHPLLPICATVSDDKTLRIWELSAQHRMLAVRKLKKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHH 1080
Cdd:COG2319   291 VNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLTG 370
                         570       580       590
                  ....*....|....*....|....*....|..
gi 223005862 1081 RKEMISDIKFSKDtGKYLAVASHDNFVDIYNV 1112
Cdd:COG2319   371 HTGAVTSVAFSPD-GRTLASGSADGTVRLWDL 401
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
2-48 5.47e-20

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


Pssm-ID: 460922  Cd Length: 72  Bit Score: 85.68  E-value: 5.47e-20
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 223005862     2 ADRTAPRCQLRLEWVYGYRGHQCRNNLYYTAGKEVVYFVAGVGVVYN 48
Cdd:pfam03451   25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYD 71
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
668-715 5.68e-20

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


Pssm-ID: 460922  Cd Length: 72  Bit Score: 85.68  E-value: 5.68e-20
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 223005862   668 KREKAPEDSLKLQFIHGYRGYDCRNNLFYTQAGEVVYHIAAVAVVYNR 715
Cdd:pfam03451   25 QKKEPPDKKLKLEWVYGYRGKDCRSNLYYLPTGEIVYFTAAVVVLYDV 72
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1461-1798 1.17e-19

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 91.63  E-value: 1.17e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1461 HSKGVNYINFSATGKLLVSVGVDpeHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSdTQFVSVGVKHM-KFWTLAGSA 1539
Cdd:cd00200     8 HTGGVTCVAFSPDGKLLATGSGD--GTIKVWDLETGELLRTLKGHTGPVRDVAASADG-TYLASGSSDKTiRLWDLETGE 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1540 LLYkkgvigSLGAAKmQTMLSVAFGANNLTFTGAI-NGDVYVWK-DHFLIRLVAKAHTGPVFTMyTTLRDGLIVTGGKEr 1617
Cdd:cd00200    85 CVR------TLTGHT-SYVSSVAFSPDGRILSSSSrDKTIKVWDvETGKCLTTLRGHTDWVNSV-AFSPDGTFVASSSQ- 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1618 ptkeGGAVKLWDqemkrcrafqLETGQLVEcvrsvcrgkgkilvgtkdgeiievgeknaasniLIDGHmEGEIWGLATHP 1697
Cdd:cd00200   156 ----DGTIKLWD----------LRTGKCVA---------------------------------TLTGH-TGEVNSVAFSP 187
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1698 SKDLFISASNDGTARIWDLADKKLLnKVSLGH--AARCAAYSPDGEMVAIGMKNGefvillvnSLKVW-GKKRDRK---- 1770
Cdd:cd00200   188 DGEKLLSSSSDGTIKLWDLSTGKCL-GTLRGHenGVNSVAFSPDGYLLASGSEDG--------TIRVWdLRTGECVqtls 258
                         330       340       350
                  ....*....|....*....|....*....|.
gi 223005862 1771 ---SAIQDIRISPDNRFLAVGSSEHTVDFYD 1798
Cdd:cd00200   259 ghtNSVTSLAWSPDGKRLASGSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1592-1956 2.75e-19

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 90.47  E-value: 2.75e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1592 KAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWDQEMKRC-RAFQLETGQlVECVRSVCRGKgKILVGTKDGEIIE 1670
Cdd:cd00200     6 KGHTGGVTCVAFSPDGKLLATGS------GDGTIKVWDLETGELlRTLKGHTGP-VRDVAASADGT-YLASGSSDKTIRL 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1671 VGEKNAASNILIDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLnKVSLGH--AARCAAYSPDGEMVAIGMK 1748
Cdd:cd00200    78 WDLETGECVRTLTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCL-TTLRGHtdWVNSVAFSPDGTFVASSSQ 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1749 NGefvillvnSLKVW----GKKRDR----KSAIQDIRISPDNRFLAVGSSEHTVDFYDLTQGTNLNRIGYCKDipsFVIQ 1820
Cdd:cd00200   156 DG--------TIKLWdlrtGKCVATltghTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHEN---GVNS 224
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1821 MDFSADGKYIqvstgaykrqvhevplgkqvteavviekitwaswTSVLGDEVIGIWprnadkadvncacvthaglNIVTG 1900
Cdd:cd00200   225 VAFSPDGYLL----------------------------------ASGSEDGTIRVW-------------------DLRTG 251
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 223005862 1901 DdfglvklfdfpctekfaKHKRYFGHSAHVTNIRFSYDDKYVVStGGDDCSVFVWR 1956
Cdd:cd00200   252 E-----------------CVQTLSGHTNSVTSLAWSPDGKRLAS-GSADGTIRIWD 289
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
895-1221 1.95e-18

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 87.78  E-value: 1.95e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  895 LLKTVKAHDGPVFAMYALDKG--FVTGGKDGIVELWD---DMFERCLKTYAIKRSALSTSSKGLLL----EDNpSIRait 965
Cdd:cd00200     1 LRRTLKGHTGGVTCVAFSPDGklLATGSGDGTIKVWDletGELLRTLKGHTGPVRDVAASADGTYLasgsSDK-TIR--- 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  966 lghghiLVGTKNGEILEIdksgpmtllVQGHmEGEVWGLAAHPLLPICATVSDDKTLRIWELSAQHRMLAVRKLKKGGRC 1045
Cdd:cd00200    77 ------LWDLETGECVRT---------LTGH-TSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNS 140
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1046 CAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNVLTSKRVGICKGAS 1125
Cdd:cd00200   141 VAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPD-GEKLLSSSSDGTIKLWDLSTGKCLGTLRGHE 219
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1126 SYITHIDWDSRGKLLqvnSGAreqlffeaprgkrhiirpSEIEKIE-WDTWTCVLGPTCEGiwpaHSdiTDVNAASLTKD 1204
Cdd:cd00200   220 NGVNSVAFSPDGYLL---ASG------------------SEDGTIRvWDLRTGECVQTLSG----HT--NSVTSLAWSPD 272
                         330
                  ....*....|....*..
gi 223005862 1205 CSLLATGDDFGFVKLFS 1221
Cdd:cd00200   273 GKRLASGSADGTIRIWD 289
WD40 COG2319
WD40 repeat [General function prediction only];
291-929 7.87e-18

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 88.04  E-value: 7.87e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  291 WKADRLLAGTQDSEIFEVIVRERDKPMLILQGHCEGELWALALHPKKPLAVTGSDDRSVRLWSLADHALIARCNMEEAVR 370
Cdd:COG2319     3 SADGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVL 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  371 SVAFSPDGSQLALGMKDGSFIVLRVRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVyavaqrykkigecsks 450
Cdd:COG2319    83 SVAFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRL---------------- 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  451 lsfithidWSLDSkylqtndgagerlfyrmpsGKPLtskeeikgipwaswtcvkgpevsgiwpkytevtdinsvdanyns 530
Cdd:COG2319   147 --------WDLAT-------------------GKLL-------------------------------------------- 155
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  531 svlvsgddfglvklfkfpclkrgakfRKYVGHSAHVTNVRWSHDFQWvLSTGGADHSVFQWRfipegvsngmletapqeg 610
Cdd:COG2319   156 --------------------------RTLTGHSGAVTSVAFSPDGKL-LASGSDDGTVRLWD------------------ 190
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  611 gadsyseesdsdlsdvpeldsdieqeaqinydrqvykedlpqlkqqskeknhavpflkrekapedslklqfihgyrgydc 690
Cdd:COG2319       --------------------------------------------------------------------------------
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  691 rnnlfyTQAGEVVyhiaavavvynrqqhsqRLYLGHDDDILSLTIHPVKDYVATGqvGRDAAIHVWDTQTLKCLSLLKGq 770
Cdd:COG2319   191 ------LATGKLL-----------------RTLTGHTGAVRSVAFSPDGKLLASG--SADGTVRLWDLATGKLLRTLTG- 244
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  771 HQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNPhhvD--KLVTVGI-KHIKFWQQAG 847
Cdd:COG2319   245 HSGSVRSVAFSPDGRLLASGSAD--GTVRLWDLATGELLRTLTGHSGGVNSVAFSP---DgkLLASGSDdGTVRLWDLAT 319
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  848 GgftSKRGTFGsvGKLETMMCVSYGRMEDLVFSGAATGDIFIW--KDILLLKTVKAHDGPVFAMYALDKG--FVTGGKDG 923
Cdd:COG2319   320 G---KLLRTLT--GHTGAVRSVAFSPDGKTLASGSDDGTVRLWdlATGELLRTLTGHTGAVTSVAFSPDGrtLASGSADG 394

                  ....*.
gi 223005862  924 IVELWD 929
Cdd:COG2319   395 TVRLWD 400
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1682-1956 9.87e-18

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 85.85  E-value: 9.87e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1682 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWDLADKKLLNKVSlGHAA--RCAAYSPDGEMVAIGMKNgefvillvNS 1759
Cdd:cd00200     5 LKGH-TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLK-GHTGpvRDVAASADGTYLASGSSD--------KT 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1760 LKVW----GKKRDR----KSAIQDIRISPDNRFLAVGSSEHTVDFYDLTQGTNLNRIGYCKDipsFVIQMDFSADGKYiq 1831
Cdd:cd00200    75 IRLWdletGECVRTltghTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTD---WVNSVAFSPDGTF-- 149
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1832 VSTGAYKRQVH--EVPLGKqvteavVIEKITwaswtsvlgdevigiwprnADKADVNCACVTHAGLNIVTGDDFGLVKLF 1909
Cdd:cd00200   150 VASSSQDGTIKlwDLRTGK------CVATLT-------------------GHTGEVNSVAFSPDGEKLLSSSSDGTIKLW 204
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*..
gi 223005862 1910 DFpctEKFAKHKRYFGHSAHVTNIRFSYDDKYVVStGGDDCSVFVWR 1956
Cdd:cd00200   205 DL---STGKCLGTLRGHENGVNSVAFSPDGYLLAS-GSEDGTIRVWD 247
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
46-177 4.14e-16

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 80.84  E-value: 4.14e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862   46 VYNTREHS-QKFFLGHNDDIISLALHPDKTLVATGQVGKEpyICIWDSYNVQTVSLLKdVHTHGVACLAFDSDGQRLASV 124
Cdd:cd00200   161 LWDLRTGKcVATLTGHTGEVNSVAFSPDGEKLLSSSSDGT--IKLWDLSTGKCLGTLR-GHENGVNSVAFSPDGYLLASG 237
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 223005862  125 GLDakNTVCIWDWRKGKLLASATGHSDRIFDISWDPyQPNRVVSCGV-KHIKFW 177
Cdd:cd00200   238 SED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP-DGKRLASGSAdGTIRIW 288
WD40 COG2319
WD40 repeat [General function prediction only];
55-179 4.59e-16

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 82.65  E-value: 4.59e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862   55 KFFLGHNDDIISLALHPDKTLVATGqvGKEPYICIWDSYNVQTVSLLKDvHTHGVACLAFDSDGQRLASVGLDakNTVCI 134
Cdd:COG2319   282 RTLTGHSGGVNSVAFSPDGKLLASG--SDDGTVRLWDLATGKLLRTLTG-HTGAVRSVAFSPDGKTLASGSDD--GTVRL 356
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 223005862  135 WDWRKGKLLASATGHSDRIFDISWDPyQPNRVVSCGV-KHIKFWTL 179
Cdd:COG2319   357 WDLATGELLRTLTGHTGAVTSVAFSP-DGRTLASGSAdGTVRLWDL 401
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
521-826 1.96e-14

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 75.83  E-value: 1.96e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  521 INSVDANYNSSVLVSGDDFGLVKLFKfpcLKRGAKFRKYVGHSAHVTNVRWSHDFQWVLStGGADHSVFQWRFipegvsn 600
Cdd:cd00200    12 VTCVAFSPDGKLLATGSGDGTIKVWD---LETGELLRTLKGHTGPVRDVAASADGTYLAS-GSSDKTIRLWDL------- 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  601 gmletapqEGGADSYS-EESDSDLSDVpeldsDIEQEAQI----NYDRQVYKEDLPQLKQQSKEKNHavpflkrekapED 675
Cdd:cd00200    81 --------ETGECVRTlTGHTSYVSSV-----AFSPDGRIlsssSRDKTIKVWDVETGKCLTTLRGH-----------TD 136
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  676 SLklqfihgyrgydcrNNLFYTQAGEVVYHIAA--VAVVYN-RQQHSQRLYLGHDDDILSLTIHPVKDYVATGqvGRDAA 752
Cdd:cd00200   137 WV--------------NSVAFSPDGTFVASSSQdgTIKLWDlRTGKCVATLTGHTGEVNSVAFSPDGEKLLSS--SSDGT 200
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 223005862  753 IHVWDTQTLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWDWKKGEKIATTRGHKDKIFVVKCNP 826
Cdd:cd00200   201 IKLWDLSTGKCLGTLRG-HENGVNSVAFSPDGYLLASGSED--GTIRVWDLRTGECVQTLSGHTNSVTSLAWSP 271
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1398-1629 1.78e-13

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 73.14  E-value: 1.78e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1398 IVQNLSTGSQSFYLE-HTDDILCLTVNQHPKYrnVVATSQIGTtpsIHIWDAMTKHTLSMLRCfHSKGVNYINFSATGKL 1476
Cdd:cd00200    76 RLWDLETGECVRTLTgHTSYVSSVAFSPDGRI--LSSSSRDKT---IKVWDVETGKCLTTLRG-HTDWVNSVAFSPDGTF 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1477 LVSVGVDpeHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSDTQFVSVGVKHMKFWTLAGSALLYkkgvigsLGAAKMQ 1556
Cdd:cd00200   150 VASSSQD--GTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLG-------TLRGHEN 220
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 223005862 1557 TMLSVAFGANNLTFTGA-INGDVYVWK-DHFLIRLVAKAHTGPVFTMYTTLRDGLIVTGGkerptkEGGAVKLWD 1629
Cdd:cd00200   221 GVNSVAFSPDGYLLASGsEDGTIRVWDlRTGECVQTLSGHTNSVTSLAWSPDGKRLASGS------ADGTIRIWD 289
HELP pfam03451
HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm ...
1350-1401 7.88e-13

HELP motif; The founding member of the EMAP protein family is the 75 kDa Echinoderm Microtubule-Associated Protein, so-named for its abundance in sea urchin, sand dollar and starfish eggs. The Hydrophobic EMAP-Like Protein (HELP) motif was identified initially in the human EMAP-Like Protein 2 (EML2) and subsequently in the entire EMAP Protein family. The HELP motif is approximately 60-70 amino acids in length and is conserved amongst metazoans. Although the HELP motif is hydrophobic, there is no evidence that EMAP-Like Proteins are membrane-associated. All members of the EMAP-Like Protein family, identified to-date, are constructed with an amino terminal HELP motif followed by a WD domain. In C. elegans, EMAP-Like Protein-1 (ELP-1) is required for touch sensation indicating that ELP-1 may play a role in mechanosensation. The localization of ELP-1 to microtubules and adhesion sites implies that ELP-1 may transmit forces between the body surface and the touch receptor neurons.


Pssm-ID: 460922  Cd Length: 72  Bit Score: 65.27  E-value: 7.88e-13
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 223005862  1350 KNNITKKKKLVEE-LALDHVFGYRGFDCRNNLHYLNDGaDIIFHTAAAGIVQN 1401
Cdd:pfam03451   20 KDDLDQKKEPPDKkLKLEWVYGYRGKDCRSNLYYLPTG-EIVYFTAAVVVLYD 71
WD40 cd00200
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ...
1040-1266 5.46e-12

WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.


Pssm-ID: 238121 [Multi-domain]  Cd Length: 289  Bit Score: 68.52  E-value: 5.46e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1040 KKGGRCCAFSPDGKALAVGLNDGSFLVVNADTVEDMVSFHHRKEMISDIKFSKDtGKYLAVASHDNFVDIYNVLTSKRVG 1119
Cdd:cd00200     9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASAD-GTYLASGSSDKTIRLWDLETGECVR 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1120 ICKGASSYITHIDWDSRGKLLqvnSGAREQlffeaprgKRHIIrpseiekieWDTWTcvlgPTCEGIWPAHSDitDVNAA 1199
Cdd:cd00200    88 TLTGHTSYVSSVAFSPDGRIL---SSSSRD--------KTIKV---------WDVET----GKCLTTLRGHTD--WVNSV 141
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 223005862 1200 SLTKDCSLLATGDDFGFVKLFSyPVKGQhaRFKKYVGHSAHVTNVRWlHNDSVLLTVGGADTALMIW 1266
Cdd:cd00200   142 AFSPDGTFVASSSQDGTIKLWD-LRTGK--CVATLTGHTGEVNSVAF-SPDGEKLLSSSSDGTIKLW 204
WD40 COG2319
WD40 repeat [General function prediction only];
1123-1629 6.46e-09

WD40 repeat [General function prediction only];


Pssm-ID: 441893 [Multi-domain]  Cd Length: 403  Bit Score: 60.31  E-value: 6.46e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1123 GASSYITHIDWDSRGKLLQVNSGAREQLFFEA-PRGKRHIIRPSEIEKIE-WDtwtcVLGPTCEGIWPAHSDitDVNAAS 1200
Cdd:COG2319    54 GAGDLTLLLLDAAAGALLATLLGHTAAVLSVAfSPDGRLLASASADGTVRlWD----LATGLLLRTLTGHTG--AVRSVA 127
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1201 LTKDCSLLATGDDFGFVKLFSYPVKGQHARFKkyvGHSAHVTNVRWlHNDSVLLTVGGADTALMIWtrefvgtqesklvd 1280
Cdd:COG2319   128 FSPDGKTLASGSADGTVRLWDLATGKLLRTLT---GHSGAVTSVAF-SPDGKLLASGSDDGTVRLW-------------- 189
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1281 seesdtdveedggydsdvarekaidyttkiyavsiremegtkphqqlkevsveerppvsraapqpeklqknnitkkkklv 1360
Cdd:COG2319       --------------------------------------------------------------------------------
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1361 eelaldhvfgyrgfdcrnnlhylndgadiifhtaaagivqNLSTGSQSFYLE-HTDDILCLTVNQHPKYrnVVATSQIGT 1439
Cdd:COG2319   190 ----------------------------------------DLATGKLLRTLTgHTGAVRSVAFSPDGKL--LASGSADGT 227
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1440 tpsIHIWDAMTKHTLSMLRcFHSKGVNYINFSATGKLLVSVGVDpeHTITVWRWQEGAKVASRGGHLERIFVVEFRPDSD 1519
Cdd:COG2319   228 ---VRLWDLATGKLLRTLT-GHSGSVRSVAFSPDGRLLASGSAD--GTVRLWDLATGELLRTLTGHSGGVNSVAFSPDGK 301
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1520 TqFVSVGV-KHMKFWTLAGSALLykKGVIGSLGAAkmqtmLSVAFGANNLT-FTGAINGDVYVWK-DHFLIRLVAKAHTG 1596
Cdd:COG2319   302 L-LASGSDdGTVRLWDLATGKLL--RTLTGHTGAV-----RSVAFSPDGKTlASGSDDGTVRLWDlATGELLRTLTGHTG 373
                         490       500       510
                  ....*....|....*....|....*....|....
gi 223005862 1597 PVFTMyTTLRDG-LIVTGGkerptkEGGAVKLWD 1629
Cdd:COG2319   374 AVTSV-AFSPDGrTLASGS------ADGTVRLWD 400
YncE COG3391
DNA-binding beta-propeller fold protein YncE [General function prediction only];
1699-1830 3.23e-05

DNA-binding beta-propeller fold protein YncE [General function prediction only];


Pssm-ID: 442618 [Multi-domain]  Cd Length: 237  Bit Score: 47.38  E-value: 3.23e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1699 KDLFISASNDGTARIWDLADKKLLNKVSLGHAARCAAYSPDGEMVAI-GMKNGEFVILLVNSLKVWGKKRDRKSAiQDIR 1777
Cdd:COG3391    80 RRLYVANSGSGRVSVIDLATGKVVATIPVGGGPRGLAVDPDGGRLYVaDSGNGRVSVIDTATGKVVATIPVGAGP-HGIA 158
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 223005862 1778 ISPDNRFLAVGSSE-HTVDFY----DLTQGTNLNRIgyckDIPSFVIQMDFSADGKYI 1830
Cdd:COG3391   159 VDPDGKRLYVANSGsNTVSVIvsviDTATGKVVATI----PVGGGPVGVAVSPDGRRL 212
COG4946 COG4946
Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown] ...
368-488 5.55e-05

Uncharacterized N-terminal domain of tricorn protease, contains WD40 repeats [Function unknown];


Pssm-ID: 443973 [Multi-domain]  Cd Length: 1072  Bit Score: 48.50  E-value: 5.55e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  368 AVRSVAFSPDGSQLA-LGMKDGSF-IVLR--VRDMTEVVHIKDRKEVIHEMKFSPDGSYLAVGSNDGPVDVYAVA-QRYK 442
Cdd:COG4946   344 RERLPAWSPDGKSIAyFSDASGEYeLYIApaDGSGEPKQLTLGDLGRVFNPVWSPDGKKIAFTDNRGRLWVVDLAsGKVR 423
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 223005862  443 KIGEcSKSLSFITHIDWSLDSKYL---QTNDGAGERLF-YRMPSGK--PLTS 488
Cdd:COG4946   424 KVDT-DGYGDGISDLAWSPDSKWLaysKPGPNQLSQIFlYDVETGKtvQLTD 474
YncE COG3391
DNA-binding beta-propeller fold protein YncE [General function prediction only];
1691-1802 1.87e-04

DNA-binding beta-propeller fold protein YncE [General function prediction only];


Pssm-ID: 442618 [Multi-domain]  Cd Length: 237  Bit Score: 45.07  E-value: 1.87e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862 1691 WGLATHPSKD-LFISASNDGTARIWDLADKKLLNKVSLGHAARCAAYSPDGEMVAIGMKNGEFVILLV-----NSLKVWg 1764
Cdd:COG3391   113 RGLAVDPDGGrLYVADSGNGRVSVIDTATGKVVATIPVGAGPHGIAVDPDGKRLYVANSGSNTVSVIVsvidtATGKVV- 191
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 223005862 1765 KKRDRKSAIQDIRISPDNRFLAV--------GSSEHTVDFYDLTQG 1802
Cdd:COG3391   192 ATIPVGGGPVGVAVSPDGRRLYVanrgsntsNGGSNTVSVIDLATL 237
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1682-1715 2.58e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 39.99  E-value: 2.58e-04
                            10        20        30
                    ....*....|....*....|....*....|....
gi 223005862   1682 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWD 1715
Cdd:smart00320    8 LKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
105-136 3.40e-04

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 39.99  E-value: 3.40e-04
                            10        20        30
                    ....*....|....*....|....*....|..
gi 223005862    105 HTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:smart00320   11 HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
1682-1715 4.24e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 39.64  E-value: 4.24e-04
                           10        20        30
                   ....*....|....*....|....*....|....
gi 223005862  1682 IDGHmEGEIWGLATHPSKDLFISASNDGTARIWD 1715
Cdd:pfam00400    7 LEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 pfam00400
WD domain, G-beta repeat;
315-353 7.88e-04

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 38.87  E-value: 7.88e-04
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 223005862   315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:pfam00400    2 KLLKTLEGH-TGSVTSLAFSPDGKLLASGSDDGTVKVWD 39
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
760-802 1.25e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 38.06  E-value: 1.25e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|...
gi 223005862    760 TLKCLSLLKGqHQRGVCALDFSADGKCLVSVGLDdfHSIVFWD 802
Cdd:smart00320    1 SGELLKTLKG-HTGPVTSVAFSPDGKYLASGSDD--GTIKLWD 40
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
315-353 1.81e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 37.68  E-value: 1.81e-03
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 223005862    315 KPMLILQGHcEGELWALALHPKKPLAVTGSDDRSVRLWS 353
Cdd:smart00320    3 ELLKTLKGH-TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
WD40 pfam00400
WD domain, G-beta repeat;
105-136 1.95e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 37.71  E-value: 1.95e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 223005862   105 HTHGVACLAFDSDGQRLASVGLDakNTVCIWD 136
Cdd:pfam00400   10 HTGSVTSLAFSPDGKLLASGSDD--GTVKVWD 39
ANAPC4_WD40 pfam12894
Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped ...
1734-1798 2.54e-03

Anaphase-promoting complex subunit 4 WD40 domain; Apc4 contains an N-terminal propeller-shaped WD40 domain.The N-terminus of Afi1 serves to stabilize the union between Apc4 and Apc5, both of which lie towards the bottom-front of the APC,


Pssm-ID: 403945 [Multi-domain]  Cd Length: 91  Bit Score: 38.80  E-value: 2.54e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 223005862  1734 AAYSPDGEMVAIGMKNGEFVILLVNSLKVWGKKRD-RKSAIQDIRISPDNRFLAVGSSEHTVDFYD 1798
Cdd:pfam12894    1 MSWCPTMDLIALATEDGELLLHRLNWQRVWTLSPDkEDLEVTSLAWRPDGKLLAVGYSDGTVRLLD 66
WD40 pfam00400
WD domain, G-beta repeat;
1921-1955 2.95e-03

WD domain, G-beta repeat;


Pssm-ID: 459801 [Multi-domain]  Cd Length: 39  Bit Score: 37.32  E-value: 2.95e-03
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 223005862  1921 KRYFGHSAHVTNIRFSYDDKYVVStGGDDCSVFVW 1955
Cdd:pfam00400    5 KTLEGHTGSVTSLAFSPDGKLLAS-GSDDGTVKVW 38
YncE COG3391
DNA-binding beta-propeller fold protein YncE [General function prediction only];
339-430 5.00e-03

DNA-binding beta-propeller fold protein YncE [General function prediction only];


Pssm-ID: 442618 [Multi-domain]  Cd Length: 237  Bit Score: 40.83  E-value: 5.00e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  339 LAVTGSDDRSVRLWSLADHALIARCNMEEAVRSVAFSPDGSQL-ALGMKDGSFIVLRVRDMTEVVHIKDRKEViHEMKFS 417
Cdd:COG3391    82 LYVANSGSGRVSVIDLATGKVVATIPVGGGPRGLAVDPDGGRLyVADSGNGRVSVIDTATGKVVATIPVGAGP-HGIAVD 160
                          90
                  ....*....|...
gi 223005862  418 PDGSYLAVGSNDG 430
Cdd:COG3391   161 PDGKRLYVANSGS 173
WD40 smart00320
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ...
1916-1955 5.57e-03

WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.


Pssm-ID: 197651 [Multi-domain]  Cd Length: 40  Bit Score: 36.52  E-value: 5.57e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 223005862   1916 KFAKHKRYFGHSAHVTNIRFSYDDKYVVStGGDDCSVFVW 1955
Cdd:smart00320    1 SGELLKTLKGHTGPVTSVAFSPDGKYLAS-GSDDGTIKLW 39
YncE COG3391
DNA-binding beta-propeller fold protein YncE [General function prediction only];
329-425 6.43e-03

DNA-binding beta-propeller fold protein YncE [General function prediction only];


Pssm-ID: 442618 [Multi-domain]  Cd Length: 237  Bit Score: 40.45  E-value: 6.43e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 223005862  329 WALALHPK-KPLAVTGSDDRSVRLWSLADHALIARCNMEEAVRSVAFSPDGSQLALGMKDGSFI-----VLRVRDMTEVV 402
Cdd:COG3391   113 RGLAVDPDgGRLYVADSGNGRVSVIDTATGKVVATIPVGAGPHGIAVDPDGKRLYVANSGSNTVsvivsVIDTATGKVVA 192
                          90       100
                  ....*....|....*....|...
gi 223005862  403 HIkDRKEVIHEMKFSPDGSYLAV 425
Cdd:COG3391   193 TI-PVGGGPVGVAVSPDGRRLYV 214
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH